Improving the Accuracy-Latency Trade-off of Edge-Cloud Computation Offloading for Deep Learning Services

被引:3
|
作者
Zhao, Xiaobo [1 ]
Hosseinzadeh, Minoo [2 ]
Hudson, Nathaniel [2 ]
Khamfroush, Hana [2 ]
Lucani, Daniel E. [1 ]
机构
[1] Aarhus Univ, Dept Engn, DIGIT, Aarhus, Denmark
[2] Univ Kentucky, Dept Comp Sci, Lexington, KY 40506 USA
关键词
D O I
10.1109/GCWkshps50303.2020.9367470
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Offloading tasks to the edge or the Cloud has the potential to improve accuracy of classification and detection tasks as more powerful hardware and machine learning models can be used. The downside is the added delay introduced for sending the data to the Edge/Cloud. In delay-sensitive applications, it is usually necessary to strike a balance between accuracy and latency. However, the state of the art typically considers offloading all-or-nothing decisions, e.g., process locally or send all available data to the Edge (Cloud). Our goal is to expand the options in the accuracy-latency trade-off by allowing the source to send a fraction of the total data for processing. We evaluate the performance of image classifiers when faced with images that have been purposely reduced in quality in order to reduce traffic costs. Using three common models (SqueezeNet, GoogleNet, ResNet) and two data sets (Caltech101, ImageNet) we show that the Gompertz function provides a good approximation to determine the accuracy of a model given the fraction of the data of the image that is actually conveyed to the model. We formulate the offloading decision process using this new flexibility and show that a better overall accuracy-latency trade-off is attained: 58% traffic reduction, 25% latency reduction, as well as 12% accuracy improvement.
引用
收藏
页数:6
相关论文
共 36 条
  • [31] Computation Offloading and Resource Management for Energy and Cost Trade-Offs with Deep Reinforcement Learning in Mobile Edge Computing
    Mo, Ruichao
    Xu, Xiaolong
    Zhang, Xuyun
    Qi, Lianyong
    Liu, Qi
    [J]. SERVICE-ORIENTED COMPUTING (ICSOC 2021), 2021, 13121 : 563 - 577
  • [32] Optimal trade-off between accuracy and network cost of distributed learning in Mobile Edge Computing: An analytical approach
    Valerio, Lorenzo
    Passarella, Andrea
    Conti, Marco
    [J]. 2017 IEEE 18TH INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM), 2017,
  • [33] DeepBrain: Experimental Evaluation of Cloud-Based Computation Offloading and Edge Computing in the Internet-of-Drones for Deep Learning Applications
    Koubaa, Anis
    Ammar, Adel
    Alahdab, Mahmoud
    Kanhouch, Anas
    Azar, Ahmad Taher
    [J]. SENSORS, 2020, 20 (18) : 1 - 25
  • [34] Exploring the Power - Prediction Accuracy Trade-Off in a Deep Learning Neural Network using Wide Compliance RRAM Device
    Prabhu, Nagaraj Lakshmana
    Jun, Desmond Loy Jia
    Dananjaya, Putu Andhita
    Toh, Eng Huat
    Lew, Wen Siang
    Raghavan, Nagarajan
    [J]. 2019 8TH INTERNATIONAL SYMPOSIUM ON NEXT GENERATION ELECTRONICS (ISNE), 2019,
  • [35] Tackling the Accuracy-Interpretability Trade-off: Interpretable Deep Learning Models for Satellite Image-based Real Estate Appraisal
    Kucklick, Jan-Peter
    Mueller, Oliver
    [J]. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2023, 14 (01)
  • [36] Deep reinforcement learning based computation offloading for xURLLC services with UAV-assisted IoT-based multi-access edge computing system
    Fatima, Nida
    Saxena, Paresh
    Giambene, Giovanni
    [J]. WIRELESS NETWORKS, 2023, 30 (09) : 7275 - 7291