DISSEC: A distributed deep neural network inference scheduling strategy for edge clusters

被引:0
|
作者
Li, Qiang [1 ]
Huang, Liang [1 ]
Tong, Zhao [1 ]
Du, Ting-Ting [1 ]
Zhang, Jin [1 ]
Wang, Sheng-Chun [1 ]
机构
[1] Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Peoples R China
基金
中国国家自然科学基金;
关键词
Edge computing; Deep neural network; Internet of Things; Distributed inference;
D O I
10.1016/j.neucom.2022.05.084
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
New applications such as intelligent manufacturing, autonomous vehicles and smart cities drive large-scale deep learning models deployed in the Internet of Things (IoT) edge environments. However, deep learning models require substantial computations, storage and communication resources to run. It is generally difficult to deploy and execute a complete deep neural network (DNN) on a resource-constrained edge device. One possible solution is to slice the DNN into multiple tiles distributed to different edge devices, which can reduce the number of computations and quantity of data on each edge device. In this paper, we propose DISSEC, a distributed scheduling strategy for DNN inference on IoT edge clusters. DISSEC leverages spatial partitioning techniques through fusing the convolutional layers and dividing them into multiple partitions that can be executed independently, and proposes a method to express the dependencies between partitions. It further proposes a search algorithm based on heuristics to produce a distributed parallel strategy with the best overall inference execution latency. The evaluation shows that our strategy can fully utilize the edge device resources by cooperating with multiple edge devices to perform partitioning tasks in parallel. Furthermore, compared to the existing work scheduling strategy, our strategy reduces communication overhead by 20% and overall execution latency by 9% under different partitioning granularities and numbers of edge devices. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:449 / 460
页数:12
相关论文
共 50 条
  • [1] Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN
    Zhang, Sai Qian
    Lin, Jieyu
    Zhang, Qi
    [J]. PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
  • [2] Distributed Deep Neural Network Training on Edge Devices
    Benditkis, Daniel
    Keren, Aviv
    Mor-Yosef, Liron
    Avidor, Tomer
    Shoham, Neta
    Tal-Israel, Nadav
    [J]. SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 304 - 306
  • [3] Automating Deep Neural Network Model Selection for Edge Inference
    Lu, Bingqian
    Yang, Jianyi
    Chen, Lydia Y.
    Ren, Shaolei
    [J]. 2019 IEEE FIRST INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2019), 2019, : 184 - 193
  • [4] Computation Offloading Scheduling for Deep Neural Network Inference in Mobile Computing
    Duan, Yubin
    Wu, Jie
    [J]. 2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
  • [5] Cooperative Distributed Deep Neural Network Deployment with Edge Computing
    Yang, Cian-You
    Kuo, Jian-Jhih
    Sheu, Jang-Ping
    Zheng, Ke-Jun
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [6] Low Latency Deep Learning Inference Model for Distributed Intelligent IoT Edge Clusters
    Naveen, Soumyalatha
    Kounte, Manjunath R.
    Ahmed, Mohammed Riyaz
    [J]. IEEE ACCESS, 2021, 9 : 160607 - 160621
  • [7] Poster: Scaling Up Deep Neural Network Optimization for Edge Inference
    Lu, Bingqian
    Yang, Jianyi
    Ren, Shaolei
    [J]. 2020 IEEE/ACM SYMPOSIUM ON EDGE COMPUTING (SEC 2020), 2020, : 170 - 172
  • [8] Modeling of Deep Neural Network (DNN) Placement and Inference in Edge Computing
    Bensalem, Mounir
    Dizdarevic, Jasenka
    Jukan, Admela
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
  • [9] DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices
    Hou, Xueyu
    Guan, Yongjie
    Han, Tao
    Zhang, Ning
    [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 1097 - 1107
  • [10] DENNI: Distributed Neural Network Inference on Severely Resource Constrained Edge Devices
    Sahu, Rohit
    Toepfer, Ryan
    Sinclair, Mathew D.
    Duwe, Henry
    [J]. 2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,