Distributed Deep Neural Network Deployment for Smart Devices from the Edge to the Cloud

被引:9
|
作者
Lin, Chang-You [1 ]
Wang, Tzu-Chen [1 ]
Chen, Kuan-Chih [1 ]
Lee, Bor-Yan [1 ]
Kuo, Jian-Jhih [1 ]
机构
[1] Natl Chung Cheng Univ, Chiayi, Taiwan
关键词
distributed deep neural network deployment; hierarchical mobile network; edge computing; cloud computing;
D O I
10.1145/3331052.3332477
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Traditionally, deep learning acceleration mostly focuses on the trade-off between accuracy and training time but seldom addresses the deployment over hierarchical 5G networks to maximize the inference throughput. By contrast, computing offloading research emphasizes whether to offload the tasks to the cloud to reduce computing time and achieve a lower response time, and thus, the optimal deployment to maximize throughput has not been explored. In this paper, we explore Distributed Deep Neural Network Deployment Problem with Constrained Completion Time (TREND-WANT) to solve the deployment problem considering both response time and inference throughput. Due to the intractability of TREND-WANT, we first design a new algorithm, named Stage-Time-Aware Layer Deployment Algorithm (STEED), to maximize the throughput. Afterward, an extension termed STEED with Adaptable Completion Time (STEED-ADAPT) is developed to tailor the solution to achieve a lower responsible time. Simulation results manifest our algorithms outperform the traditional methods by at least 200%.
引用
收藏
页码:43 / 48
页数:6
相关论文
共 50 条
  • [1] Analyzing Distributed Deep Neural Network Deployment on Edge and Cloud Nodes in IoT Systems
    Ashouri, Majid
    Lorig, Fabian
    Davidsson, Paul
    Spalazzese, Romina
    Svorobej, Sergej
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING (EDGE 2020), 2020, : 59 - 66
  • [2] Cooperative Distributed Deep Neural Network Deployment with Edge Computing
    Yang, Cian-You
    Kuo, Jian-Jhih
    Sheu, Jang-Ping
    Zheng, Ke-Jun
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [3] Distributed Deep Neural Network Training on Edge Devices
    Benditkis, Daniel
    Keren, Aviv
    Mor-Yosef, Liron
    Avidor, Tomer
    Shoham, Neta
    Tal-Israel, Nadav
    [J]. SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 304 - 306
  • [4] Distributed Deep Neural Networks over the Cloud, the Edge and End Devices
    Teerapittayanon, Surat
    McDanel, Bradley
    Kung, H. T.
    [J]. 2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 328 - 339
  • [5] Characterizing the Deployment of Deep Neural Networks on Commercial Edge Devices
    Hadidi, Ramyad
    Cao, Jiashen
    Xie, Yilun
    Asgari, Bahar
    Krishna, Tushar
    Kim, Hyesoon
    [J]. PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2019), 2019, : 35 - 48
  • [6] Communication Failure Resilient Distributed Neural Network for Edge Devices
    Jeong, Jonghun
    Park, Jong Sung
    Yang, Hoeseok
    [J]. ELECTRONICS, 2021, 10 (14)
  • [7] A Cloud-Edge-Smart IoT Architecture for Speeding Up the Deployment of Neural Network Models with Transfer Learning Techniques
    Hsu, Tz-Heng
    Wang, Zhi-Hao
    See, Aaron Raymond
    [J]. ELECTRONICS, 2022, 11 (14)
  • [8] Deploying Deep Neural Network on Edge-Cloud environment
    Kum, Seungwoo
    Kim, Youngkee
    Moon, Jaewon
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 242 - 244
  • [9] Distributed Deep Learning Optimized System over the Cloud and Smart Phone Devices
    Jiang, Haotian
    Starkman, James
    Lee, Yu-Ju
    Chen, Huan
    Qian, Xiaoye
    Huang, Ming-Chun
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (01) : 147 - 161
  • [10] A collaborative cloud-edge computing framework in distributed neural network
    Xu, Shihao
    Zhang, Zhenjiang
    Kadoch, Michel
    Cheriet, Mohamed
    [J]. EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2020, 2020 (01)