Distributed Deep Learning in An Edge Computing System

被引：0

作者：

Sen, Tanmoy ^{[1
]}

Shen, Haiying ^{[1
]}

Mehrab, Zakaria ^{[1
]}

机构：

[1] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22903 USA

来源：

2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022) | 2022年

关键词：

D O I：

10.1109/MASS56207.2022.00095

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In many scenarios (e.g., hurricanes, earthquake, rural areas), edge devices cannot access the cloud, which makes the cloud deep learning (DL) training approach inapplicable. However, an edge device may not be able to train a large-scale DL model due to its resource constraints. Though there are mobile-friendly DL models (e.g., mobilnet, shufflenet), it cannot meet the needs for different Deep Neural Networks (DNNs) and also model compression sacrifices accuracy. Distributed DL training among multiple edge devices is a solution. However, it poses challenges about how to partition a DNN model and assign the partitions among edge devices considering the DNN features and the resource availability, and how to handle edge overload to reduce the overall job time and accuracy loss. To handle the challenges, we propose both heuristic and Reinforcement Learning (RL) based DL job schedulers by leveraging DL job features. Our container-based emulation and real device experiments show that our job schedulers achieve up to 82% improvement on training time and 70% on consumed energy over comparison methods. We also open sourced our source code.

引用

页码：645 / 653

页数：9

共 50 条

[1] EDDL: A Distributed Deep Learning System for Resource-limited Edge Computing Environment
Hao, Pengzhan
Zhang, Yifan
[J]. 2021 ACM/IEEE 6TH SYMPOSIUM ON EDGE COMPUTING (SEC 2021), 2021, : 1 - 13
[2] Parameter Servers Placement for Distributed Deep Learning in Edge computing
Yan, Jiaquan
Wu, Yalan
Wu, Jigang
Chen, Long
[J]. 19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 398 - 404
[3] Grouping Synchronous to Eliminate Stragglers with Edge Computing in Distributed Deep Learning
Gui, Zhiyi
Yang, Xiang
Yang, Hao
Li, Wei
Zhang, Lei
Qi, Qi
Wang, Jingyu
Sun, Haifeng
Liao, Jianxin
[J]. 19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 429 - 436
[4] Efficient Parameter Server Placement for Distributed Deep Learning in Edge Computing
Wu, Yalan
Yan, Jiaquan
Chen, Long
Wu, Jigang
Li, Yidong
[J]. COMPUTER JOURNAL, 2023, 66 (03): : 678 - 691
[5] Distributed Edge Computing Offloading Algorithm Based on Deep Reinforcement Learning
Li, Yunzhao
Qi, Feng
Wang, Zhili
Yu, Xiuming
Shao, Sujie
[J]. IEEE ACCESS, 2020, 8 : 85204 - 85215
[6] Distributed hierarchical deep optimization for federated learning in mobile edge computing
Zheng, Xiao
Shah, Syed Bilal Hussain
Bashir, Ali Kashif
Nawaz, Raheel
Rana, Umer
[J]. COMPUTER COMMUNICATIONS, 2022, 194 : 321 - 328
[7] A Distributed Hierarchical Deep Computation Model for Federated Learning in Edge Computing
Zheng, Haifeng
Gao, Min
Chen, Zhizhang
Feng, Xinxin
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (12) : 7946 - 7956
[8] Research and Application of Edge Computing and Deep Learning in a Recommender System
Hao, Xiaopei
Shan, Xinghua
Zhang, Junfeng
Meng, Ge
Jiang, Lin
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (23):
[9] A Bayesian Deep Learning Network System Based on Edge Computing
Liu, Lei
[J]. INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2023, 20 (02N03)
[10] Distributed Deep Learning-based Offloading for Mobile Edge Computing Networks
Huang, Liang
Feng, Xu
Feng, Anqi
Huang, Yupin
Qian, Li Ping
[J]. MOBILE NETWORKS & APPLICATIONS, 2022, 27 (03): : 1123 - 1130

← 1 2 3 4 5 →