DISSEC: A distributed deep neural network inference scheduling strategy for edge clusters

被引：0

作者：

Li, Qiang ^{[1
]}

Huang, Liang ^{[1
]}

Tong, Zhao ^{[1
]}

Du, Ting-Ting ^{[1
]}

Zhang, Jin ^{[1
]}

Wang, Sheng-Chun ^{[1
]}

机构：

[1] Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Peoples R China

来源：

NEUROCOMPUTING | 2022年 / 500卷 / 449-460期

基金：

中国国家自然科学基金;

关键词：

Edge computing; Deep neural network; Internet of Things; Distributed inference;

D O I：

10.1016/j.neucom.2022.05.084

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

New applications such as intelligent manufacturing, autonomous vehicles and smart cities drive large-scale deep learning models deployed in the Internet of Things (IoT) edge environments. However, deep learning models require substantial computations, storage and communication resources to run. It is generally difficult to deploy and execute a complete deep neural network (DNN) on a resource-constrained edge device. One possible solution is to slice the DNN into multiple tiles distributed to different edge devices, which can reduce the number of computations and quantity of data on each edge device. In this paper, we propose DISSEC, a distributed scheduling strategy for DNN inference on IoT edge clusters. DISSEC leverages spatial partitioning techniques through fusing the convolutional layers and dividing them into multiple partitions that can be executed independently, and proposes a method to express the dependencies between partitions. It further proposes a search algorithm based on heuristics to produce a distributed parallel strategy with the best overall inference execution latency. The evaluation shows that our strategy can fully utilize the edge device resources by cooperating with multiple edge devices to perform partitioning tasks in parallel. Furthermore, compared to the existing work scheduling strategy, our strategy reduces communication overhead by 20% and overall execution latency by 9% under different partitioning granularities and numbers of edge devices. (C) 2022 Elsevier B.V. All rights reserved.

引用

页码：449 / 460

页数：12

共 50 条

[1] Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN
Zhang, Sai Qian
Lin, Jieyu
Zhang, Qi
[J]. PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
[2] Distributed Deep Neural Network Training on Edge Devices
Benditkis, Daniel
Keren, Aviv
Mor-Yosef, Liron
Avidor, Tomer
Shoham, Neta
Tal-Israel, Nadav
[J]. SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 304 - 306
[3] Automating Deep Neural Network Model Selection for Edge Inference
Lu, Bingqian
Yang, Jianyi
Chen, Lydia Y.
Ren, Shaolei
[J]. 2019 IEEE FIRST INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2019), 2019, : 184 - 193
[4] Computation Offloading Scheduling for Deep Neural Network Inference in Mobile Computing
Duan, Yubin
Wu, Jie
[J]. 2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
[5] Cooperative Distributed Deep Neural Network Deployment with Edge Computing
Yang, Cian-You
Kuo, Jian-Jhih
Sheu, Jang-Ping
Zheng, Ke-Jun
[J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[6] Low Latency Deep Learning Inference Model for Distributed Intelligent IoT Edge Clusters
Naveen, Soumyalatha
Kounte, Manjunath R.
Ahmed, Mohammed Riyaz
[J]. IEEE ACCESS, 2021, 9 : 160607 - 160621
[7] Poster: Scaling Up Deep Neural Network Optimization for Edge Inference
Lu, Bingqian
Yang, Jianyi
Ren, Shaolei
[J]. 2020 IEEE/ACM SYMPOSIUM ON EDGE COMPUTING (SEC 2020), 2020, : 170 - 172
[8] Modeling of Deep Neural Network (DNN) Placement and Inference in Edge Computing
Bensalem, Mounir
Dizdarevic, Jasenka
Jukan, Admela
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
[9] DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices
Hou, Xueyu
Guan, Yongjie
Han, Tao
Zhang, Ning
[J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 1097 - 1107
[10] DENNI: Distributed Neural Network Inference on Severely Resource Constrained Edge Devices
Sahu, Rohit
Toepfer, Ryan
Sinclair, Mathew D.
Duwe, Henry
[J]. 2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,

← 1 2 3 4 5 →