Energy and Performance Efficient Computation Offloading for Deep Neural Networks in a Mobile Cloud Computing Environment

被引：71

作者：

Eshratifar, Amir Erfan ^{[1
]}

Pedram, Massoud ^{[1
]}

机构：

[1] Univ Southern Calif, Dept Elect Engn, Los Angeles, CA 90007 USA

来源：

PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18) | 2018年

关键词：

computation offloading; mobile cloud computing; deep neural networks; energy efficient computing; high performance computing;

D O I：

10.1145/3194554.3194565

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In today's computing technology scene, mobile devices are considered to be computationally weak, while large cloud servers are capable of handling expensive workloads, therefore, intensive computing tasks are typically offloaded to the cloud. Recent advances in learning techniques have enabled Deep Neural Networks (DNNs) to be deployed in a wide range of applications. Commercial speech based intelligent personal assistants (IPA) like Apple's Siri, which employs DNN as its recognition model, operate solely over the cloud. The cloud-only approach may require a large amount of data transfer between the cloud and the mobile device. The mobile-only approach may lack performance efficiency. In addition, the cloud server may be slow at times due to the congestion and limited subscription and mobile devices may have battery usage constraints. In this paper, we investigate the efficiency of offloading only some parts of the computations in DNNs to the cloud. We have formulated an optimal computation offloading framework for forward propagation in DNNs, which adapts to battery usage constraints on the mobile side and limited available resources on the cloud. Our simulation results show that our framework can achieve 1.42x on average and up to 3.07x speedup in the execution time on the mobile device. In addition, it results in 2.11x on average and up to 4.26x reduction in mobile energy consumption.

引用

页码：111 / 116

页数：6

共 50 条

[31] A survey of research on computation offloading in mobile cloud computing
Xiaomin Jin
Wenqiang Hua
Zhongmin Wang
Yanping Chen
Wireless Networks, 2022, 28 : 1563 - 1585
[32] Computation Offloading for Service Workflow in Mobile Cloud Computing
Deng, Shuiguang
Huang, Longtao
Taheri, Javid
Zomaya, Albert Y.
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (12) : 3317 - 3329
[33] Stochastic Computation Offloading Game for Mobile Cloud Computing
Zheng, Jianchao
Cai, Yueming
Wu, Yuan
Shen, Xuemin
2016 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2016,
[34] Computation Offloading for Mobile Cloud Computing Frameworks and Techniques
Abusaimeh, Hesham
TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2022, 11 (03): : 1042 - 1046
[35] Computation Offloading Frameworks in Mobile Cloud Computing : A Survey
Deshmukh, Shantanu
Shah, Rinku
2016 IEEE INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN ADVANCED COMPUTING (ICCTAC), 2016,
[36] Energy-Efficient Task Offloading for Multiuser Mobile Cloud Computing
Zhao, Yun
Zhou, Sheng
Zhao, Tianchu
Niu, Zhisheng
2015 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2015,
[37] Distributed Computation Offloading in Mobile Fog Computing: A Deep Neural Network Approach
Yang, Zhongjun
Bai, Wenle
IEEE COMMUNICATIONS LETTERS, 2022, 26 (03) : 696 - 700
[38] A Constrained Multi-objective Computation Offloading Algorithm in the Mobile Cloud Computing Environment
Liu, Li
Du, Yuanyuan
Fan, Qi
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (09) : 4329 - 4348
[39] Energy-efficient computation offloading model for mobile phone environment
Fekete, Krisztian
Csorba, Kristof
Forstner, Bertalan
Feher, Marcell
Vajk, Tamas
2012 IEEE 1ST INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (CLOUDNET), 2012,
[40] Energy-efficient computation offloading for vehicular edge computing networks
Gu, Xiaohui
Zhang, Guoan
COMPUTER COMMUNICATIONS, 2021, 166 : 244 - 253

← 1 2 3 4 5 →