Improving the Accuracy-Latency Trade-off of Edge-Cloud Computation Offloading for Deep Learning Services

被引:3
|
作者
Zhao, Xiaobo [1 ]
Hosseinzadeh, Minoo [2 ]
Hudson, Nathaniel [2 ]
Khamfroush, Hana [2 ]
Lucani, Daniel E. [1 ]
机构
[1] Aarhus Univ, Dept Engn, DIGIT, Aarhus, Denmark
[2] Univ Kentucky, Dept Comp Sci, Lexington, KY 40506 USA
关键词
D O I
10.1109/GCWkshps50303.2020.9367470
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Offloading tasks to the edge or the Cloud has the potential to improve accuracy of classification and detection tasks as more powerful hardware and machine learning models can be used. The downside is the added delay introduced for sending the data to the Edge/Cloud. In delay-sensitive applications, it is usually necessary to strike a balance between accuracy and latency. However, the state of the art typically considers offloading all-or-nothing decisions, e.g., process locally or send all available data to the Edge (Cloud). Our goal is to expand the options in the accuracy-latency trade-off by allowing the source to send a fraction of the total data for processing. We evaluate the performance of image classifiers when faced with images that have been purposely reduced in quality in order to reduce traffic costs. Using three common models (SqueezeNet, GoogleNet, ResNet) and two data sets (Caltech101, ImageNet) we show that the Gompertz function provides a good approximation to determine the accuracy of a model given the fraction of the data of the image that is actually conveyed to the model. We formulate the offloading decision process using this new flexibility and show that a better overall accuracy-latency trade-off is attained: 58% traffic reduction, 25% latency reduction, as well as 12% accuracy improvement.
引用
收藏
页数:6
相关论文
共 36 条
  • [1] Cloud and Edge Computation Offloading for Latency Limited Services
    Kovacevic, Ivana
    Harjula, Erkki
    Glisic, Savo
    Lorenzo, Beatriz
    Ylianttila, Mika
    [J]. IEEE ACCESS, 2021, 9 : 55764 - 55776
  • [2] Energy-Latency Trade-off for Multiuser Wireless Computation Offloading
    Munoz, Olga
    Pascual Iserte, Antonio
    Vidal, Josep
    Molina, Marc
    [J]. 2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2014, : 29 - 33
  • [3] Optimal Accuracy-Time Trade-off for Deep Learning Services in Edge Computing Systems
    Hosseinzadeh, Minoo
    Wachal, Andrew
    Khamfroush, Hana
    Lucani, Daniel E.
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [4] Edge-Cloud Collaborative Computation Offloading for Federated Learning in Smart City
    Peng, Kai
    Zhang, Haoqi
    Zhao, Bohai
    Liu, Peichen
    [J]. 2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 706 - 712
  • [5] One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-off in Machine Learning Cloud Service APIs via Tolerance Tiers
    Halpern, Matthew
    Boroujerdian, Behzad
    Mummert, Todd
    Duesterwald, Evelyn
    Reddi, Vijay Janapa
    [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2019, : 34 - 47
  • [6] Hierarchical Deep Reinforcement Learning for Joint Service Caching and Computation Offloading in Mobile Edge-Cloud Computing
    Sun, Chuan
    Li, Xiuhua
    Wang, Chenyang
    He, Qiang
    Wang, Xiaofei
    Leung, Victor C. M.
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (04) : 1548 - 1564
  • [7] A Trade-off Analysis of Latency, Accuracy, and Energy in Task Offloading Strategies for UAVs
    Erbayat, Egemen
    Zou, Rujia
    Wei, Xianglin
    Venkataramani, Guru
    Subramaniam, Suresh
    [J]. 2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 48 - 53
  • [8] TRADE-OFF BETWEEN SERVICE DELAY AND POWER CONSUMPTION IN EDGE-CLOUD COMPUTING
    Wang, Xu
    Ni, Hong
    Han, Rui
    Huang, Xingwang
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2018, 14 (06): : 2011 - 2024
  • [9] Improving latency performance trade-off in keyword spotting applications at the edge
    Paissan, Francesco
    Sahabdeen, Anisha Mohamed
    Ancilotto, Alberto
    Farella, Elisabetta
    [J]. 2023 9TH INTERNATIONAL WORKSHOP ON ADVANCES IN SENSORS AND INTERFACES, IWASI, 2023, : 299 - 304
  • [10] Joint Computation Offloading and Resource Allocation for Edge-Cloud Collaboration in Internet of Vehicles via Deep Reinforcement Learning
    Huang, Jiwei
    Wan, Jiangyuan
    Lv, Bofeng
    Ye, Qiang
    Chen, Ying
    [J]. IEEE SYSTEMS JOURNAL, 2023, 17 (02): : 2500 - 2511