Partitioning and Placement of Deep Neural Networks on Distributed Edge Devices to Maximize Inference Throughput

被引:2
|
作者
Parthasarathy, Arjun [1 ]
Krishnamachari, Bhaskar [2 ]
机构
[1] Crystal Springs Uplands Sch, Hillsborough, CA 94010 USA
[2] Univ Southern Calif, Los Angeles, CA 90007 USA
关键词
D O I
10.1109/ITNAC55475.2022.9998427
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Edge inference has become more widespread, as its diverse applications range from retail to wearable technology. Clusters of networked resource-constrained edge devices are becoming common, yet no system exists to split a DNN across these clusters while maximizing the inference throughput of the system. We present an algorithm which partitions DNNs and distributes them across a set of edge devices with the goal of minimizing the bottleneck latency and therefore maximizing inference throughput. The system scales well to systems of different node memory capacities and numbers of nodes. We find that we can reduce the bottleneck latency by 10x over a random algorithm and 35% over a greedy joint partitioning-placement algorithm. Furthermore we find empirically that for the set of representative models we tested, the algorithm produces results within 9.2% of the optimal bottleneck latency.
引用
收藏
页码:239 / 246
页数:8
相关论文
共 50 条
  • [21] Deploying Deep Neural Networks on Edge Devices for Grape Segmentation
    Roesler, Mathias
    Mohimont, Lucas
    Alin, Francois
    Gaveau, Nathalie
    Steffenel, Luiz Angelo
    [J]. SMART AND SUSTAINABLE AGRICULTURE, SSA 2021, 2021, 1470 : 30 - 43
  • [22] Characterizing the Deployment of Deep Neural Networks on Commercial Edge Devices
    Hadidi, Ramyad
    Cao, Jiashen
    Xie, Yilun
    Asgari, Bahar
    Krishna, Tushar
    Kim, Hyesoon
    [J]. PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2019), 2019, : 35 - 48
  • [23] DISSEC: A distributed deep neural network inference scheduling strategy for edge clusters
    Li, Qiang
    Huang, Liang
    Tong, Zhao
    Du, Ting-Ting
    Zhang, Jin
    Wang, Sheng-Chun
    [J]. NEUROCOMPUTING, 2022, 500 (449-460) : 449 - 460
  • [24] Improving QoE of Deep Neural Network Inference on Edge Devices: A Bandit Approach
    Lu, Bingqian
    Yang, Jianyi
    Xu, Jie
    Ren, Shaolei
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (21) : 21409 - 21420
  • [25] Inference and Energy Efficient Design of Deep Neural Networks for Embedded Devices
    Galanis, Ioannis
    Anagnostopoulos, Iraklis
    Nguyen, Chinh
    Bares, Guillermo
    Burkard, Dona
    [J]. 2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 36 - 41
  • [26] Efficient Distributed Inference of Deep Neural Networks via Restructuring and Pruning
    Abdi, Afshin
    Rashidi, Saeed
    Fekri, Faramarz
    Krishna, Tushar
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6640 - 6648
  • [27] Smart Classrooms aided by Deep Neural Networks inference on Mobile Devices
    Pacheco, Alberto
    Flores, Ever
    Sanchez, Raul
    Almanza-Garcia, Salvador
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 605 - 609
  • [28] OnceNAS: Discovering efficient on-device inference neural networks for edge devices
    Zhang, Yusen
    Qin, Yunchuan
    Zhang, Yufeng
    Zhou, Xu
    Jian, Songlei
    Tan, Yusong
    Li, Kenli
    [J]. INFORMATION SCIENCES, 2024, 669
  • [29] Distributed DNN Inference With Fine-Grained Model Partitioning in Mobile Edge Computing Networks
    Li, Hui
    Li, Xiuhua
    Fan, Qilin
    He, Qiang
    Wang, Xiaofei
    Leung, Victor C. M.
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (10) : 9060 - 9074
  • [30] Characterizing the Execution of Deep Neural Networks on Collaborative Robots and Edge Devices
    Merck, Matthew L.
    Wang, Bingyao
    Liu, Lixing
    Jia, Chunjun
    Siqueira, Arthur
    Huang, Qiusen
    Saraha, Abhijeet
    Lim, Dongsuk
    Cao, Jiashen
    Hadidi, Ramyad
    Kim, Hyesoon
    [J]. PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,