Energy and Performance Efficient Computation Offloading for Deep Neural Networks in a Mobile Cloud Computing Environment

被引:71
|
作者
Eshratifar, Amir Erfan [1 ]
Pedram, Massoud [1 ]
机构
[1] Univ Southern Calif, Dept Elect Engn, Los Angeles, CA 90007 USA
关键词
computation offloading; mobile cloud computing; deep neural networks; energy efficient computing; high performance computing;
D O I
10.1145/3194554.3194565
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In today's computing technology scene, mobile devices are considered to be computationally weak, while large cloud servers are capable of handling expensive workloads, therefore, intensive computing tasks are typically offloaded to the cloud. Recent advances in learning techniques have enabled Deep Neural Networks (DNNs) to be deployed in a wide range of applications. Commercial speech based intelligent personal assistants (IPA) like Apple's Siri, which employs DNN as its recognition model, operate solely over the cloud. The cloud-only approach may require a large amount of data transfer between the cloud and the mobile device. The mobile-only approach may lack performance efficiency. In addition, the cloud server may be slow at times due to the congestion and limited subscription and mobile devices may have battery usage constraints. In this paper, we investigate the efficiency of offloading only some parts of the computations in DNNs to the cloud. We have formulated an optimal computation offloading framework for forward propagation in DNNs, which adapts to battery usage constraints on the mobile side and limited available resources on the cloud. Our simulation results show that our framework can achieve 1.42x on average and up to 3.07x speedup in the execution time on the mobile device. In addition, it results in 2.11x on average and up to 4.26x reduction in mobile energy consumption.
引用
收藏
页码:111 / 116
页数:6
相关论文
共 50 条
  • [1] Energy efficient computing task offloading strategy for deep neural networks in mobile edge computing
    Gao H.
    Li X.
    Zhou B.
    Liu X.
    Xu J.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2020, 26 (06): : 1607 - 1615
  • [2] A survey on computation offloading in the mobile cloud computing environment
    Liu, Li
    Du, Yuanyuan
    Fan, Qi
    Zhang, Weicun
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2019, 59 (02) : 106 - 113
  • [3] Efficient Multisite Computation Offloading for Mobile Cloud Computing
    Goudarzi, Mohammad
    Movahedi, Zeinab
    Nazari, Masoud
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 1131 - 1138
  • [4] Efficient Computation Offloading Strategies for Mobile Cloud Computing
    Tao, Yaling
    Zhang, Yongbing
    Ji, Yusheng
    2015 IEEE 29TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (IEEE AINA 2015), 2015, : 626 - 633
  • [5] Energy-efficient computation offloading strategy for the terminal in mobile cloud environment
    Zhang W.
    Cao B.
    Zhou X.
    1600, Science Press (44): : 175 - 180
  • [6] Energy Efficient Computation Offloading in Mobile Edge Computing
    Rong, Bo
    Chen, Ying
    Zhang, Ning
    Wu, Yuan
    Shen, Sherman
    IEEE WIRELESS COMMUNICATIONS, 2023, 30 (02) : 8 - 8
  • [7] Energy-Efficient Computation Offloading for Wearable Devices and Smartphones in Mobile Cloud Computing
    Ragona, Claudio
    Granelli, Fabrizio
    Fiandrino, Claudio
    Kliazovich, Dzmitry
    Bouvry, Pascal
    2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,
  • [8] Energy-Efficient Dynamic Computation Offloading and Cooperative Task Scheduling in Mobile Cloud Computing
    Guo, Songtao
    Liu, Jiadi
    Yang, Yuanyuan
    Xiao, Bin
    Li, Zhetao
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2019, 18 (02) : 319 - 333
  • [9] Mobile Cloud Computing Architecture for Computation Offloading
    Khanna, Abhirup
    Kero, Archana
    Kumar, Devendra
    PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2016, : 639 - 643
  • [10] Framework for Computation Offloading in Mobile Cloud Computing
    Kovachev, Dejan
    Klamma, Ralf
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2012, 1 (07): : 6 - 15