DNNTune: Automatic Benchmarking DNN Models for Mobile-cloud Computing

被引:20
|
作者
Xia, Chunwei [1 ,2 ]
Zhao, Jiacheng [1 ,2 ]
Cui, Huimin [1 ,2 ]
Feng, Xiaobing [1 ,2 ]
Xue, Jingling [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, 6 Kexueyuan South Rd Zhongguancun, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, 19 A Yuquan Rd, Beijing 100049, Peoples R China
[3] Univ New South Wales, Sch Comp Sci & Engn, Gate 14 Barker St, Sydney, NSW 2052, Australia
基金
澳大利亚研究理事会; 国家重点研发计划; 中国国家自然科学基金;
关键词
DNN; mobile-cloud computing; heterogeneous computing;
D O I
10.1145/3368305
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Networks (DNNs) are now increasingly adopted in a variety of Artificial Intelligence (AI) applications. Meantime, more and more DNNs are moving from cloud to the mobile devices, as emerging AI chips are integrated into mobiles. Therefore, the DNN models can be deployed in the cloud, on the mobile devices, or even mobile-cloud coordinate processing, making it a big challenge to select an optimal deployment strategy under specific objectives. This article proposes a DNN tuning framework, i.e., DNNTune, that can provide layer-wise behavior analysis across a number of platforms. Using DNNTune, this article further selects 13 representative DNN models, including CNN, LSTM, and MLP, and three mobile devices ranging from low-end to high-end, and two AI accelerator chips to characterize the DNN models on these devices to further assist users finding opportunities for mobile-cloud coordinate computing. Our experimental results demonstrate that DNNTune can find a coordinated deployment achieving up to 1.66x speedup and 15% energy saving comparing with mobile-only and cloud-only deployment.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] The mobile-cloud computing (MCC) roadblocks
    20144400145442
    [J]. (1) Siemens Technology and Services, Bangalore, India, 2014, (IEEE Computer Society):
  • [2] The Mobile-Cloud Computing (MCC) Roadblocks
    Hazarika, Pinku
    Baliga, Vinod
    Tolety, Seshubabu
    [J]. 2014 ELEVENTH INTERNATIONAL CONFERENCE ON WIRELESS AND OPTICAL COMMUNICATIONS NETWORKS (WOCN), 2014,
  • [3] The Moitree middleware for distributed mobile-cloud computing
    Debnath, Hillol
    Khan, Mohammad A.
    Paiker, Nafize R.
    Ding, Xiaoning
    Gehani, Narain
    Curtmola, Reza
    Borcea, Cristian
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 157
  • [4] COMPSs-Mobile: parallel programming for Mobile-Cloud Computing
    Lordan, Francesc
    Badia, Rosa M.
    [J]. 2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, : 497 - 500
  • [5] Java']JavaScript Offloading for Web Applications in Mobile-Cloud Computing
    Yu, Meihua
    Huang, Gang
    Wang, Xudong
    Zhang, Ying
    Chen, Xiangqun
    [J]. 2015 IEEE THIRD INTERNATIONAL CONFERENCE ON MOBILE SERVICES MS 2015, 2015, : 269 - 276
  • [6] Process Batch Offloading Method for Mobile-Cloud Computing Platform
    Thakur, Pawan Kumar
    Verma, Amandeep
    [J]. JOURNAL OF CASES ON INFORMATION TECHNOLOGY, 2015, 17 (03) : 1 - 13
  • [7] Collaborative Mobile-Cloud Computing for Civil Infrastructure Condition Inspection
    Chen, ZhiQiang
    Chen, Jianfei
    Shen, Feichen
    Lee, Yugyung
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2015, 29 (05)
  • [8] A Computation Task Offloading Scheme based on Mobile-Cloud and Edge Computing for WBANs
    Zhang, Rongrong
    Zhou, Chen
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 4504 - 4509
  • [9] Tamper-Resistant Autonomous Agents-Based Mobile-Cloud Computing
    Angin, Pelin
    Bhargava, Bharat
    Ranchal, Rohit
    [J]. NOMS 2016 - 2016 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2016, : 843 - 847
  • [10] Characterizing DNN Models for Edge-Cloud Computing
    Xia, Chunwei
    Zhao, Jiacheng
    Cui, Huimin
    Feng, Xiaobing
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2018, : 82 - 83