Adaptive Load Balancing for Parameter Servers in Distributed Machine Learning over Heterogeneous Networks

被引:1
|
作者
CAI Weibo [1 ]
YANG Shulin [1 ]
SUN Gang [1 ]
ZHANG Qiming [2 ]
YU Hongfang [1 ]
机构
[1] University of Electronic Science and Technology of China
[2] ZTE Corporation
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TN91 [通信]; TP181 [自动推理、机器学习];
学科分类号
0810 ; 081001 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
In distributed machine learning(DML) based on the parameter server(PS) architecture, unbalanced communication load distribution of PSs will lead to a significant slowdown of model synchronization in heterogeneous networks due to low utilization of bandwidth. To address this problem, a network-aware adaptive PS load distribution scheme is proposed, which accelerates model synchronization by proactively adjusting the communication load on PSs according to network states. We evaluate the proposed scheme on MXNet, known as a realworld distributed training platform, and results show that our scheme achieves up to 2.68 times speed-up of model training in the dynamic and heterogeneous network environment.
引用
收藏
页码:72 / 80
页数:9
相关论文
共 50 条
  • [1] Dynamic load balancing in geographically distributed heterogeneous Web servers
    Colajanni, M
    Yu, PS
    Cardellini, V
    18TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 1998, : 295 - 302
  • [2] Load Balancing in Heterogeneous Networks Based on Distributed Learning in Potential Games
    Ali, Mohd. Shabbir
    Coucheney, Pierre
    Coupechoux, Marceau
    2015 13TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT), 2015, : 371 - 378
  • [3] A redirection mechanism for a dynamic load balancing on heterogeneous distributed replica servers
    Kim, S
    Sung, S
    Park, JH
    Shin, YT
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 267 - 270
  • [4] A load balancing technique for heterogeneous distributed networks
    Labiaga, R
    Williams, DH
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2367 - 2371
  • [5] Adaptive load balancing of parallel applications with reinforcement learning on heterogeneous networks
    Parent, J
    Verbeeck, K
    Lemeire, J
    DCABES 2002, PROCEEDING, 2002, : 243 - 246
  • [6] Load Balancing for Heterogeneous Web Servers
    Piorkowski, Adam
    Kernpny, Aleksander
    Hajduki, Adrian
    Strzelczyk, Jacek
    COMPUTER NETWORKS, 2010, 79 : 189 - +
  • [7] A novel load balancing method in distributed heterogeneous multi-resource servers
    College of Computer and Information Engineering, Henan University of Economics and Law, No. 80, Wenhua Road, Zhengzhou 450002, China
    Li, X. (zhengzhoulxx@163.com), 1600, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (03):
  • [8] Load Balancing of Distributed Servers in Distributed File Systems
    Singh, Ravideep
    Gupta, Pradeep Kumar
    Gupta, Punit
    Malekian, Reza
    Maharaj, Bodhaswar T.
    Andriukaitis, Darius
    Valinevicius, Algimantas
    Bogatinoska, Dijana Capeska
    Karadimce, Aleksandar
    ICT INNOVATIONS 2015: EMERGING TECHNOLOGIES FOR BETTER LIVING, 2016, 399 : 29 - 37
  • [9] Scalable Load Balancing in the Presence of Heterogeneous Servers
    Gardner K.
    Abdul Jaleel J.
    Wickeham A.
    Doroudi S.
    Performance Evaluation Review, 2021, 48 (03): : 37 - 38
  • [10] Scalable load balancing in the presence of heterogeneous servers
    Gardner, Kristen
    Jaleel, Jazeem Abdul
    Wickeham, Alexander
    Doroudi, Sherwin
    PERFORMANCE EVALUATION, 2021, 145