Distributed Deep Neural Networks over the Cloud, the Edge and End Devices

被引:504
|
作者
Teerapittayanon, Surat [1 ]
McDanel, Bradley [1 ]
Kung, H. T. [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
关键词
distributed deep neural networks; deep neural networks; dnn; ddnn; embedded dnn; sensor fusion; distributed computing hierarchies; edge computing; cloud computing;
D O I
10.1109/ICDCS.2017.226
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose distributed deep neural networks (DDNNs) over distributed computing hierarchies, consisting of the cloud, the edge (fog) and end devices. While being able to accommodate inference of a deep neural network (DNN) in the cloud, a DDNN also allows fast and localized inference using shallow portions of the neural network at the edge and end devices. When supported by a scalable distributed computing hierarchy, a DDNN can scale up in neural network size and scale out in geographical span. Due to its distributed nature, DDNNs enhance sensor fusion, system fault tolerance and data privacy for DNN applications. In implementing a DDNN, we map sections of a DNN onto a distributed computing hierarchy. By jointly training these sections, we minimize communication and resource usage for devices and maximize usefulness of extracted features which are utilized in the cloud. The resulting system has built-in support for automatic sensor fusion and fault tolerance. As a proof of concept, we show a DDNN can exploit geographical diversity of sensors to improve object recognition accuracy and reduce communication cost. In our experiment, compared with the traditional method of offloading raw sensor data to be processed in the cloud, DDNN locally processes most sensor data on end devices while achieving high accuracy and is able to reduce the communication cost by a factor of over 20x.
引用
收藏
页码:328 / 339
页数:12
相关论文
共 50 条
  • [31] Content Delivery Networks in the Cloud with Distributed Edge Servers
    Panchal, Parag
    Ramaswamy, Nikhil Meenakshaiah
    Su, Xiao
    Dong, Yi
    [J]. 2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 526 - 532
  • [32] EasyDist: An End-to-End Distributed Deep Learning Tool for Cloud
    Natu, Varun
    Ghosh, Rahul
    [J]. PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, : 265 - 268
  • [33] Cloud-Edge-End Collaborative Task Offloading in Vehicular Edge Networks: A Multilayer Deep Reinforcement Learning Approach
    Wu, Jiaqi
    Tang, Ming
    Jiang, Changkun
    Gao, Lin
    Cao, Bin
    [J]. IEEE Internet of Things Journal, 2024, 11 (22) : 36272 - 36290
  • [34] Functions as a service for distributed deep neural network inference over the cloud-to-things continuum
    Bueno, Altair
    Rubio, Bartolome
    Martin, Cristian
    Diaz, Manuel
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2024, 54 (08): : 1297 - 1311
  • [35] Editorial: Deep neural networks with cloud computing
    Chan, Kit Yan
    Abu-Salih, Bilal
    Muhammad, Khan
    Palade, Vasile
    Chai, Rifai
    [J]. NEUROCOMPUTING, 2023, 521 : 189 - 190
  • [36] Shifting Capsule Networks from the Cloud to the Deep Edge
    Costa, Miguel
    Costa, Diogo
    Gomes, Tiago
    Pinto, Sandro
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (06)
  • [37] Efficient Deep Neural Networks for Edge Computing
    Alnemari, Mohammed
    Bagherzadeh, Nader
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING (IEEE EDGE), 2019, : 1 - 7
  • [38] Scaling for edge inference of deep neural networks
    Xu, Xiaowei
    Ding, Yukun
    Hu, Sharon Xiaobo
    Niemier, Michael
    Cong, Jason
    Hu, Yu
    Shi, Yiyu
    [J]. NATURE ELECTRONICS, 2018, 1 (04): : 216 - 222
  • [39] Update Compression for Deep Neural Networks on the Edge
    Chen, Bo
    Bakhshi, Ali
    Batista, Gustavo
    Ng, Brian
    Chin, Tat-Jun
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3075 - 3085
  • [40] PhD Forum: Deep Neural Networks at the Edge
    Viramontes, Robert
    [J]. 2024 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING, SMARTCOMP 2024, 2024, : 260 - 261