Adaptive Load Balancing for Parameter Servers in Distributed Machine Learning over Heterogeneous Networks

被引:1
|
作者
CAI Weibo [1 ]
YANG Shulin [1 ]
SUN Gang [1 ]
ZHANG Qiming [2 ]
YU Hongfang [1 ]
机构
[1] University of Electronic Science and Technology of China
[2] ZTE Corporation
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TN91 [通信]; TP181 [自动推理、机器学习];
学科分类号
0810 ; 081001 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
In distributed machine learning(DML) based on the parameter server(PS) architecture, unbalanced communication load distribution of PSs will lead to a significant slowdown of model synchronization in heterogeneous networks due to low utilization of bandwidth. To address this problem, a network-aware adaptive PS load distribution scheme is proposed, which accelerates model synchronization by proactively adjusting the communication load on PSs according to network states. We evaluate the proposed scheme on MXNet, known as a realworld distributed training platform, and results show that our scheme achieves up to 2.68 times speed-up of model training in the dynamic and heterogeneous network environment.
引用
收藏
页码:72 / 80
页数:9
相关论文
共 50 条
  • [41] MMPacking: A load and storage balancing algorithm for distributed multimedia servers
    Serpanos, DN
    Georgiadis, L
    Bouloutas, T
    INTERNATIONAL CONFERENCE ON COMPUTER DESIGN - VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 1996, : 170 - 174
  • [42] DISTRIBUTED COUPLED LEARNING OVER ADAPTIVE NETWORKS
    Alghunaim, Sulaiman A.
    Sayed, Ali H.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6353 - 6357
  • [43] On fully distributed adaptive load balancing
    Breitgand, David
    Cohen, Rami
    Nahir, Amir
    Raz, Danny
    MANAGING VIRTUALIZATION OF NETWORKS AND SERVICES, PROCEEDINGS, 2007, 4785 : 74 - +
  • [44] Grid based load balancing algorithm over heterogeneous wireless networks
    Shi, Wen-Xiao
    Zhang, Ge
    Wang, Ji-Hong
    Zhao, Ying
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2013, 43 (03): : 788 - 793
  • [45] Load balancing over heterogeneous networks with gossip-based algorithms
    Franceschelli, Mauro
    Giua, Alessandro
    Seatzu, Carla
    2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 1987 - 1993
  • [46] Distributed machine learning load balancing strategy in cloud computing services
    Mingwei Li
    Jilin Zhang
    Jian Wan
    Yongjian Ren
    Li Zhou
    Baofu Wu
    Rui Yang
    Jue Wang
    Wireless Networks, 2020, 26 : 5517 - 5533
  • [47] Distributed machine learning load balancing strategy in cloud computing services
    Li, Mingwei
    Zhang, Jilin
    Wan, Jian
    Ren, Yongjian
    Zhou, Li
    Wu, Baofu
    Yang, Rui
    Wang, Jue
    WIRELESS NETWORKS, 2020, 26 (08) : 5517 - 5533
  • [48] DGLB: Distributed Stochastic Geographical Load Balancing over Cloud Networks
    Chen, Tianyi
    Marques, Antonio G.
    Giannakis, Georgios B.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (07) : 1866 - 1880
  • [49] Towards adaptive load balancing services for web application servers
    Fan, Guo-Chuang
    Zhu, Huan
    Huang, Tao
    Feng, Yu-Lin
    Ruan Jian Xue Bao/Journal of Software, 2003, 14 (06): : 1134 - 1141
  • [50] Autonomous load balancing of heterogeneous networks
    Kreuger, Per
    Gornerup, Olof
    Gillblad, Daniel
    Lundborg, Tomas
    Corcoran, Diarmuid
    Ermedahl, Andreas
    2015 IEEE 81ST VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2015,