Adaptive Parallel Execution of Deep Neural Networks on Heterogeneous Edge Devices

被引:62
|
作者
Zhou, Li [1 ]
Samavatian, Mohammad Hossein [1 ]
Bacha, Anys [2 ]
Majumdar, Saikat [1 ]
Teodorescu, Radu [1 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] Univ Michigan Dearborn, Dearborn, MI USA
关键词
deep learning; edge devices; parallel execution; inference; IOT;
D O I
10.1145/3318216.3363312
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
New applications such as smart homes, smart cities, and autonomous vehicles are driving an increased interest in deploying machine learning on edge devices. Unfortunately, deploying deep neural networks (DNNs) on resource-constrained devices presents significant challenges. These workloads are computationally intensive and often require cloud-like resources. Prior solutions attempted to address these challenges by either introducing more design efforts or by relying on cloud resources for assistance. In this paper, we propose a runtime adaptive convolutional neural network (CNN) acceleration framework that is optimized for heterogeneous Internet of Things (IoT) environments. The framework leverages spatial partitioning techniques through fusion of the convolution layers and dynamically selects the optimal degree of parallelism according to the availability of computational resources, as well as network conditions. Our evaluation shows that our framework outperforms state-of-art approaches by improving the inference speed and reducing communication costs while running on wirelessly-connected Raspberry-Pi3 devices. Experimental evaluation shows up to 1.9x similar to 3.7x speedup using 8 devices for three popular CNN models.
引用
收藏
页码:195 / 208
页数:14
相关论文
共 50 条
  • [1] Characterizing the Execution of Deep Neural Networks on Collaborative Robots and Edge Devices
    Merck, Matthew L.
    Wang, Bingyao
    Liu, Lixing
    Jia, Chunjun
    Siqueira, Arthur
    Huang, Qiusen
    Saraha, Abhijeet
    Lim, Dongsuk
    Cao, Jiashen
    Hadidi, Ramyad
    Kim, Hyesoon
    [J]. PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,
  • [2] Efficient Execution of Deep Neural Networks on Mobile Devices with NPU
    Tan, Tianxiang
    Cao, Guohong
    [J]. IPSN'21: PROCEEDINGS OF THE 20TH ACM/IEEE CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2021, : 283 - 298
  • [3] Verifiable Deep Learning Inference on Heterogeneous Edge Devices With Trusted Execution Environment
    Liao, Longlong
    Zheng, Yuqiang
    Lu, Hong
    Liu, Xinqi
    Chen, Shuguang
    Yu, Yuanlong
    [J]. IEEE SENSORS JOURNAL, 2024, 24 (17) : 28351 - 28362
  • [4] DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices
    Baller, Stephan Patrick
    Jindal, Anshul
    Chadha, Mohak
    Gerndt, Michael
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING, IC2E 2021, 2021, : 20 - 30
  • [5] The Case for Adaptive Deep Neural Networks in Edge Computing
    McNamee, Francis
    Dustdar, Schahram
    Kilpatrick, Peter
    Shi, Weisong
    Spence, Ivor
    Varghese, Blesson
    [J]. 2021 IEEE 14TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2021), 2021, : 43 - 52
  • [6] Deploying Deep Neural Networks on Edge Devices for Grape Segmentation
    Roesler, Mathias
    Mohimont, Lucas
    Alin, Francois
    Gaveau, Nathalie
    Steffenel, Luiz Angelo
    [J]. SMART AND SUSTAINABLE AGRICULTURE, SSA 2021, 2021, 1470 : 30 - 43
  • [7] Characterizing the Deployment of Deep Neural Networks on Commercial Edge Devices
    Hadidi, Ramyad
    Cao, Jiashen
    Xie, Yilun
    Asgari, Bahar
    Krishna, Tushar
    Kim, Hyesoon
    [J]. PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2019), 2019, : 35 - 48
  • [8] Distributed Deep Neural Networks over the Cloud, the Edge and End Devices
    Teerapittayanon, Surat
    McDanel, Bradley
    Kung, H. T.
    [J]. 2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 328 - 339
  • [9] Measurement Methods for Software Execution Time on Heterogeneous Edge Devices
    Rupprecht, Bernhard
    Vogel-Heuser, Birgit
    Neumann, Eva-Maria
    [J]. 2023 IEEE 21ST INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, INDIN, 2023,
  • [10] Networks-on-Chip based Deep Neural Networks Accelerators for IoT Edge Devices
    Ascia, Giuseppe
    Catania, Vincenzo
    Monteleone, Salvatore
    Palesi, Maurizio
    Patti, Davide
    Jose, John
    [J]. 2019 SIXTH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS: SYSTEMS, MANAGEMENT AND SECURITY (IOTSMS), 2019, : 227 - 234