Energy and Performance Efficient Computation Offloading for Deep Neural Networks in a Mobile Cloud Computing Environment

被引:71
|
作者
Eshratifar, Amir Erfan [1 ]
Pedram, Massoud [1 ]
机构
[1] Univ Southern Calif, Dept Elect Engn, Los Angeles, CA 90007 USA
来源
PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18) | 2018年
关键词
computation offloading; mobile cloud computing; deep neural networks; energy efficient computing; high performance computing;
D O I
10.1145/3194554.3194565
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In today's computing technology scene, mobile devices are considered to be computationally weak, while large cloud servers are capable of handling expensive workloads, therefore, intensive computing tasks are typically offloaded to the cloud. Recent advances in learning techniques have enabled Deep Neural Networks (DNNs) to be deployed in a wide range of applications. Commercial speech based intelligent personal assistants (IPA) like Apple's Siri, which employs DNN as its recognition model, operate solely over the cloud. The cloud-only approach may require a large amount of data transfer between the cloud and the mobile device. The mobile-only approach may lack performance efficiency. In addition, the cloud server may be slow at times due to the congestion and limited subscription and mobile devices may have battery usage constraints. In this paper, we investigate the efficiency of offloading only some parts of the computations in DNNs to the cloud. We have formulated an optimal computation offloading framework for forward propagation in DNNs, which adapts to battery usage constraints on the mobile side and limited available resources on the cloud. Our simulation results show that our framework can achieve 1.42x on average and up to 3.07x speedup in the execution time on the mobile device. In addition, it results in 2.11x on average and up to 4.26x reduction in mobile energy consumption.
引用
收藏
页码:111 / 116
页数:6
相关论文
共 50 条
  • [41] Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Computing
    Chen, Xu
    Jiao, Lei
    Li, Wenzhong
    Fu, Xiaoming
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (05) : 2827 - 2840
  • [42] Computation Offloading and Resource Allocation For Cloud Assisted Mobile Edge Computing in Vehicular Networks
    Zhao, Junhui
    Li, Qiuping
    Gong, Yi
    Zhang, Ke
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (08) : 7944 - 7956
  • [43] Energy-Efficient Computation Offloading in Mobile Edge Computing Systems With Uncertainties
    Ji, Tianxi
    Luo, Changqing
    Yu, Lixing
    Wang, Qianlong
    Chen, Siheng
    Thapa, Arun
    Li, Pan
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (08) : 5717 - 5729
  • [44] Time and Energy Saving through Computation Offloading with Bandwidth Consideration for Mobile Cloud Computing
    Pawar, Apurva
    Jagtap, Vandana
    Bhamare, Mamta
    PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015), 2015, : 527 - 532
  • [45] Energy-Efficient Dynamic Task Offloading for Energy Harvesting Mobile Cloud Computing
    Zhang, Yongqiang
    He, Jianbo
    Guo, Songtao
    2018 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2018,
  • [46] An Adaptive Approach Towards Computation Offloading for Mobile Cloud Computing
    Kero, Archana
    Khanna, Abhirup
    Kumar, Devendra
    Agarwal, Amit
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2019, 14 (02) : 52 - 73
  • [47] Energy Efficient Joint Computation Offloading and Service Caching for Mobile Edge Computing: A Deep Reinforcement Learning Approach
    Zhou, Huan
    Zhang, Zhenyu
    Wu, Yuan
    Dong, Mianxiong
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2023, 7 (02): : 950 - 961
  • [48] Computation offloading in mobile edge computing networks: A survey
    Feng, Chuan
    Han, Pengchao
    Zhang, Xu
    Yang, Bowen
    Liu, Yejun
    Guo, Lei
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2022, 202
  • [49] Phone2Cloud: Exploiting computation offloading for energy saving on smartphones in mobile cloud computing
    Xia, Feng
    Ding, Fangwei
    Li, Jie
    Kong, Xiangjie
    Yang, Laurence T.
    Ma, Jianhua
    INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 95 - 111
  • [50] Phone2Cloud: Exploiting computation offloading for energy saving on smartphones in mobile cloud computing
    Feng Xia
    Fangwei Ding
    Jie Li
    Xiangjie Kong
    Laurence T. Yang
    Jianhua Ma
    Information Systems Frontiers, 2014, 16 : 95 - 111