DNNTune: Automatic Benchmarking DNN Models for Mobile-cloud Computing

被引:20
|
作者
Xia, Chunwei [1 ,2 ]
Zhao, Jiacheng [1 ,2 ]
Cui, Huimin [1 ,2 ]
Feng, Xiaobing [1 ,2 ]
Xue, Jingling [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, 6 Kexueyuan South Rd Zhongguancun, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, 19 A Yuquan Rd, Beijing 100049, Peoples R China
[3] Univ New South Wales, Sch Comp Sci & Engn, Gate 14 Barker St, Sydney, NSW 2052, Australia
基金
国家重点研发计划; 中国国家自然科学基金; 澳大利亚研究理事会;
关键词
DNN; mobile-cloud computing; heterogeneous computing;
D O I
10.1145/3368305
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Networks (DNNs) are now increasingly adopted in a variety of Artificial Intelligence (AI) applications. Meantime, more and more DNNs are moving from cloud to the mobile devices, as emerging AI chips are integrated into mobiles. Therefore, the DNN models can be deployed in the cloud, on the mobile devices, or even mobile-cloud coordinate processing, making it a big challenge to select an optimal deployment strategy under specific objectives. This article proposes a DNN tuning framework, i.e., DNNTune, that can provide layer-wise behavior analysis across a number of platforms. Using DNNTune, this article further selects 13 representative DNN models, including CNN, LSTM, and MLP, and three mobile devices ranging from low-end to high-end, and two AI accelerator chips to characterize the DNN models on these devices to further assist users finding opportunities for mobile-cloud coordinate computing. Our experimental results demonstrate that DNNTune can find a coordinated deployment achieving up to 1.66x speedup and 15% energy saving comparing with mobile-only and cloud-only deployment.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Collaborative Offloading for Distributed Mobile-Cloud Apps
    Debnath, Hillol
    Gezzi, Giacomo
    Corradi, Antonio
    Gehani, Narain
    Ding, Xiaoning
    Curtmola, Reza
    Borcea, Cristian
    [J]. 2018 6TH IEEE INTERNATIONAL CONFERENCE ON MOBILE CLOUD COMPUTING, SERVICES, AND ENGINEERING (MOBILECLOUD 2018), 2018, : 87 - 94
  • [22] RETRACTED ARTICLE: Mobile Cloud Computing: The Taxonomy and Comparison of Mobile Cloud Computing Application Models
    Raazia Sosan
    Choudhry Fahad Azim
    [J]. Wireless Personal Communications, 2016, 89 : 1435 - 1435
  • [23] A self-protecting agents based model for high-performance mobile-cloud computing
    Angin, Pelin
    Bhargava, Bharat
    Ranchal, Rohit
    [J]. COMPUTERS & SECURITY, 2018, 77 : 380 - 396
  • [24] AppMobiCloud: Improving mobile web applications by mobile-cloud convergence
    Wang, Xudong
    Liu, Xuanzhe
    Huang, Gang
    Liu, Yunxin
    [J]. ACM International Conference Proceeding Series, 2013,
  • [25] A Self-Cloning Agents Based Model for High-Performance Mobile-Cloud Computing
    Angin, Pelin
    Bhargava, Bharat
    Jin, Zhongjun
    [J]. 2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 301 - 308
  • [26] Enriched Connectivity-as-a-Service for Dynamic Mobile-Cloud
    Bala, Lounes
    Suciu, Lucian
    Bonnin, Jean-Marie
    [J]. 2012 IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE (CCNC), 2012, : 655 - 660
  • [27] A Survey of Mobile Cloud Computing Application Models
    Khan, Atta Ur Rehman
    Othman, Mazliza
    Madani, Sajjad Ahmad
    Khan, Samee Ullah
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (01): : 393 - 413
  • [28] Application Development Models for Mobile Cloud Computing
    Ali, Fawad
    Khan, Farhan Hassan
    Bashir, Saba
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRONIC AND ELECTRICAL ENGINEERING (ICE CUBE), 2018,
  • [29] RETRACTED: Mobile Cloud Computing: The Taxonomy and Comparison of Mobile Cloud Computing Application Models (Retracted Article)
    Sosan, Raazia
    Azim, Choudhry Fahad
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2016, 89 (04) : 1435 - 1435
  • [30] Mobile-cloud data processing system on digital images
    Samoylov, Alexey
    Borodyansky, Yuri
    Kostyuk, Andrei
    Polovko, Ivan
    [J]. 2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, : 1674 - 1678