Distributed Deep Learning in An Edge Computing System

被引:0
|
作者
Sen, Tanmoy [1 ]
Shen, Haiying [1 ]
Mehrab, Zakaria [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22903 USA
关键词
D O I
10.1109/MASS56207.2022.00095
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In many scenarios (e.g., hurricanes, earthquake, rural areas), edge devices cannot access the cloud, which makes the cloud deep learning (DL) training approach inapplicable. However, an edge device may not be able to train a large-scale DL model due to its resource constraints. Though there are mobile-friendly DL models (e.g., mobilnet, shufflenet), it cannot meet the needs for different Deep Neural Networks (DNNs) and also model compression sacrifices accuracy. Distributed DL training among multiple edge devices is a solution. However, it poses challenges about how to partition a DNN model and assign the partitions among edge devices considering the DNN features and the resource availability, and how to handle edge overload to reduce the overall job time and accuracy loss. To handle the challenges, we propose both heuristic and Reinforcement Learning (RL) based DL job schedulers by leveraging DL job features. Our container-based emulation and real device experiments show that our job schedulers achieve up to 82% improvement on training time and 70% on consumed energy over comparison methods. We also open sourced our source code.
引用
收藏
页码:645 / 653
页数:9
相关论文
共 50 条
  • [1] EDDL: A Distributed Deep Learning System for Resource-limited Edge Computing Environment
    Hao, Pengzhan
    Zhang, Yifan
    [J]. 2021 ACM/IEEE 6TH SYMPOSIUM ON EDGE COMPUTING (SEC 2021), 2021, : 1 - 13
  • [2] Parameter Servers Placement for Distributed Deep Learning in Edge computing
    Yan, Jiaquan
    Wu, Yalan
    Wu, Jigang
    Chen, Long
    [J]. 19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 398 - 404
  • [3] Grouping Synchronous to Eliminate Stragglers with Edge Computing in Distributed Deep Learning
    Gui, Zhiyi
    Yang, Xiang
    Yang, Hao
    Li, Wei
    Zhang, Lei
    Qi, Qi
    Wang, Jingyu
    Sun, Haifeng
    Liao, Jianxin
    [J]. 19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 429 - 436
  • [4] Efficient Parameter Server Placement for Distributed Deep Learning in Edge Computing
    Wu, Yalan
    Yan, Jiaquan
    Chen, Long
    Wu, Jigang
    Li, Yidong
    [J]. COMPUTER JOURNAL, 2023, 66 (03): : 678 - 691
  • [5] Distributed Edge Computing Offloading Algorithm Based on Deep Reinforcement Learning
    Li, Yunzhao
    Qi, Feng
    Wang, Zhili
    Yu, Xiuming
    Shao, Sujie
    [J]. IEEE ACCESS, 2020, 8 : 85204 - 85215
  • [6] Distributed hierarchical deep optimization for federated learning in mobile edge computing
    Zheng, Xiao
    Shah, Syed Bilal Hussain
    Bashir, Ali Kashif
    Nawaz, Raheel
    Rana, Umer
    [J]. COMPUTER COMMUNICATIONS, 2022, 194 : 321 - 328
  • [7] A Distributed Hierarchical Deep Computation Model for Federated Learning in Edge Computing
    Zheng, Haifeng
    Gao, Min
    Chen, Zhizhang
    Feng, Xinxin
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (12) : 7946 - 7956
  • [8] Research and Application of Edge Computing and Deep Learning in a Recommender System
    Hao, Xiaopei
    Shan, Xinghua
    Zhang, Junfeng
    Meng, Ge
    Jiang, Lin
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [9] A Bayesian Deep Learning Network System Based on Edge Computing
    Liu, Lei
    [J]. INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2023, 20 (02N03)
  • [10] Distributed Deep Learning-based Offloading for Mobile Edge Computing Networks
    Huang, Liang
    Feng, Xu
    Feng, Anqi
    Huang, Yupin
    Qian, Li Ping
    [J]. MOBILE NETWORKS & APPLICATIONS, 2022, 27 (03): : 1123 - 1130