Improving the Accuracy-Latency Trade-off of Edge-Cloud Computation Offloading for Deep Learning Services

被引：3

作者：

Zhao, Xiaobo ^{[1
]}

Hosseinzadeh, Minoo ^{[2
]}

Hudson, Nathaniel ^{[2
]}

Khamfroush, Hana ^{[2
]}

Lucani, Daniel E. ^{[1
]}

机构：

[1] Aarhus Univ, Dept Engn, DIGIT, Aarhus, Denmark

[2] Univ Kentucky, Dept Comp Sci, Lexington, KY 40506 USA

来源：

2020 IEEE GLOBECOM WORKSHOPS (GC WKSHPS) | 2020年

关键词：

D O I：

10.1109/GCWkshps50303.2020.9367470

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Offloading tasks to the edge or the Cloud has the potential to improve accuracy of classification and detection tasks as more powerful hardware and machine learning models can be used. The downside is the added delay introduced for sending the data to the Edge/Cloud. In delay-sensitive applications, it is usually necessary to strike a balance between accuracy and latency. However, the state of the art typically considers offloading all-or-nothing decisions, e.g., process locally or send all available data to the Edge (Cloud). Our goal is to expand the options in the accuracy-latency trade-off by allowing the source to send a fraction of the total data for processing. We evaluate the performance of image classifiers when faced with images that have been purposely reduced in quality in order to reduce traffic costs. Using three common models (SqueezeNet, GoogleNet, ResNet) and two data sets (Caltech101, ImageNet) we show that the Gompertz function provides a good approximation to determine the accuracy of a model given the fraction of the data of the image that is actually conveyed to the model. We formulate the offloading decision process using this new flexibility and show that a better overall accuracy-latency trade-off is attained: 58% traffic reduction, 25% latency reduction, as well as 12% accuracy improvement.

引用

页数：6

共 36 条

[1] Cloud and Edge Computation Offloading for Latency Limited Services
Kovacevic, Ivana
Harjula, Erkki
Glisic, Savo
Lorenzo, Beatriz
Ylianttila, Mika
[J]. IEEE ACCESS, 2021, 9 : 55764 - 55776
[2] Energy-Latency Trade-off for Multiuser Wireless Computation Offloading
Munoz, Olga
Pascual Iserte, Antonio
Vidal, Josep
Molina, Marc
[J]. 2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2014, : 29 - 33
[3] Optimal Accuracy-Time Trade-off for Deep Learning Services in Edge Computing Systems
Hosseinzadeh, Minoo
Wachal, Andrew
Khamfroush, Hana
Lucani, Daniel E.
[J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[4] Edge-Cloud Collaborative Computation Offloading for Federated Learning in Smart City
Peng, Kai
Zhang, Haoqi
Zhao, Bohai
Liu, Peichen
[J]. 2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 706 - 712
[5] One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-off in Machine Learning Cloud Service APIs via Tolerance Tiers
Halpern, Matthew
Boroujerdian, Behzad
Mummert, Todd
Duesterwald, Evelyn
Reddi, Vijay Janapa
[J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2019, : 34 - 47
[6] Hierarchical Deep Reinforcement Learning for Joint Service Caching and Computation Offloading in Mobile Edge-Cloud Computing
Sun, Chuan
Li, Xiuhua
Wang, Chenyang
He, Qiang
Wang, Xiaofei
Leung, Victor C. M.
[J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (04) : 1548 - 1564
[7] A Trade-off Analysis of Latency, Accuracy, and Energy in Task Offloading Strategies for UAVs
Erbayat, Egemen
Zou, Rujia
Wei, Xianglin
Venkataramani, Guru
Subramaniam, Suresh
[J]. 2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 48 - 53
[8] TRADE-OFF BETWEEN SERVICE DELAY AND POWER CONSUMPTION IN EDGE-CLOUD COMPUTING
Wang, Xu
Ni, Hong
Han, Rui
Huang, Xingwang
[J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2018, 14 (06): : 2011 - 2024
[9] Improving latency performance trade-off in keyword spotting applications at the edge
Paissan, Francesco
Sahabdeen, Anisha Mohamed
Ancilotto, Alberto
Farella, Elisabetta
[J]. 2023 9TH INTERNATIONAL WORKSHOP ON ADVANCES IN SENSORS AND INTERFACES, IWASI, 2023, : 299 - 304
[10] Joint Computation Offloading and Resource Allocation for Edge-Cloud Collaboration in Internet of Vehicles via Deep Reinforcement Learning
Huang, Jiwei
Wan, Jiangyuan
Lv, Bofeng
Ye, Qiang
Chen, Ying
[J]. IEEE SYSTEMS JOURNAL, 2023, 17 (02): : 2500 - 2511

← 1 2 3 4 →