Energy and Performance Efficient Computation Offloading for Deep Neural Networks in a Mobile Cloud Computing Environment

被引:71
|
作者
Eshratifar, Amir Erfan [1 ]
Pedram, Massoud [1 ]
机构
[1] Univ Southern Calif, Dept Elect Engn, Los Angeles, CA 90007 USA
关键词
computation offloading; mobile cloud computing; deep neural networks; energy efficient computing; high performance computing;
D O I
10.1145/3194554.3194565
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In today's computing technology scene, mobile devices are considered to be computationally weak, while large cloud servers are capable of handling expensive workloads, therefore, intensive computing tasks are typically offloaded to the cloud. Recent advances in learning techniques have enabled Deep Neural Networks (DNNs) to be deployed in a wide range of applications. Commercial speech based intelligent personal assistants (IPA) like Apple's Siri, which employs DNN as its recognition model, operate solely over the cloud. The cloud-only approach may require a large amount of data transfer between the cloud and the mobile device. The mobile-only approach may lack performance efficiency. In addition, the cloud server may be slow at times due to the congestion and limited subscription and mobile devices may have battery usage constraints. In this paper, we investigate the efficiency of offloading only some parts of the computations in DNNs to the cloud. We have formulated an optimal computation offloading framework for forward propagation in DNNs, which adapts to battery usage constraints on the mobile side and limited available resources on the cloud. Our simulation results show that our framework can achieve 1.42x on average and up to 3.07x speedup in the execution time on the mobile device. In addition, it results in 2.11x on average and up to 4.26x reduction in mobile energy consumption.
引用
收藏
页码:111 / 116
页数:6
相关论文
共 50 条
  • [31] A survey of research on computation offloading in mobile cloud computing
    Xiaomin Jin
    Wenqiang Hua
    Zhongmin Wang
    Yanping Chen
    Wireless Networks, 2022, 28 : 1563 - 1585
  • [32] Computation Offloading for Service Workflow in Mobile Cloud Computing
    Deng, Shuiguang
    Huang, Longtao
    Taheri, Javid
    Zomaya, Albert Y.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (12) : 3317 - 3329
  • [33] Stochastic Computation Offloading Game for Mobile Cloud Computing
    Zheng, Jianchao
    Cai, Yueming
    Wu, Yuan
    Shen, Xuemin
    2016 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2016,
  • [34] Computation Offloading for Mobile Cloud Computing Frameworks and Techniques
    Abusaimeh, Hesham
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2022, 11 (03): : 1042 - 1046
  • [35] Computation Offloading Frameworks in Mobile Cloud Computing : A Survey
    Deshmukh, Shantanu
    Shah, Rinku
    2016 IEEE INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN ADVANCED COMPUTING (ICCTAC), 2016,
  • [36] Energy-Efficient Task Offloading for Multiuser Mobile Cloud Computing
    Zhao, Yun
    Zhou, Sheng
    Zhao, Tianchu
    Niu, Zhisheng
    2015 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2015,
  • [37] Distributed Computation Offloading in Mobile Fog Computing: A Deep Neural Network Approach
    Yang, Zhongjun
    Bai, Wenle
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (03) : 696 - 700
  • [38] A Constrained Multi-objective Computation Offloading Algorithm in the Mobile Cloud Computing Environment
    Liu, Li
    Du, Yuanyuan
    Fan, Qi
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (09) : 4329 - 4348
  • [39] Energy-efficient computation offloading model for mobile phone environment
    Fekete, Krisztian
    Csorba, Kristof
    Forstner, Bertalan
    Feher, Marcell
    Vajk, Tamas
    2012 IEEE 1ST INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (CLOUDNET), 2012,
  • [40] Energy-efficient computation offloading for vehicular edge computing networks
    Gu, Xiaohui
    Zhang, Guoan
    COMPUTER COMMUNICATIONS, 2021, 166 : 244 - 253