Adaptive Load Balancing for Parameter Servers in Distributed Machine Learning over Heterogeneous Networks

被引:1
|
作者
CAI Weibo [1 ]
YANG Shulin [1 ]
SUN Gang [1 ]
ZHANG Qiming [2 ]
YU Hongfang [1 ]
机构
[1] University of Electronic Science and Technology of China
[2] ZTE Corporation
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TN91 [通信]; TP181 [自动推理、机器学习];
学科分类号
0810 ; 081001 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
In distributed machine learning(DML) based on the parameter server(PS) architecture, unbalanced communication load distribution of PSs will lead to a significant slowdown of model synchronization in heterogeneous networks due to low utilization of bandwidth. To address this problem, a network-aware adaptive PS load distribution scheme is proposed, which accelerates model synchronization by proactively adjusting the communication load on PSs according to network states. We evaluate the proposed scheme on MXNet, known as a realworld distributed training platform, and results show that our scheme achieves up to 2.68 times speed-up of model training in the dynamic and heterogeneous network environment.
引用
收藏
页码:72 / 80
页数:9
相关论文
共 50 条
  • [21] Self-Adaptive Gradient Quantization for Geo-Distributed Machine Learning Over Heterogeneous and Dynamic Networks
    Fan, Chenyu
    Zhang, Xiaoning
    Zhao, Yangming
    Liu, Yutao
    Yu, Shui
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (04) : 3483 - 3496
  • [22] Load Balancing in DCN Servers Through Software Defined Network Machine Learning
    Beissenova, Gulbakhram
    Zhidebayeva, Aziza
    Kopzhassarova, Zhadyra
    Kozhabekova, Pernekul
    Myrzakhmetova, Bayan
    Kerimbekov, Mukhtar
    Ussipbekova, Dinara
    Yeshenkozhaev, Nabi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (02) : 509 - 519
  • [23] Online bicriteria load balancing for distributed file servers
    Tse, Savio
    2007 SECOND INTERNATIONAL CONFERENCE IN COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1 AND 2, 2007, : 79 - 83
  • [24] Adaptive Load Balancing Algorithm For Wireless Distributed Computing Networks
    Alfaqawi, Mohammed I. M.
    Habaebi, Mohamed H.
    Siddiqi, Mohammad U.
    Islam, Md Rafiqul
    Khan, Sheroz
    Datla, Dinesh
    2016 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS ENGINEERING (ICISE), 2016, : 256 - 261
  • [25] Uplink load balancing over multipath heterogeneous wireless networks
    Delgado, Oscar
    Labeau, Fabrice
    2015 IEEE 81ST VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2015,
  • [26] Intelligent adaptive multi-parameter migration model for load balancing virtualized cluster of servers
    Inteligentni adaptivni više-parametarski migracijski model za uravnoteženje opterećenja virtualne skupine servera
    Motamedi, S. A., 1600, Strojarski Facultet (21):
  • [27] INTELLIGENT ADAPTIVE MULTI-PARAMETER MIGRATION MODEL FOR LOAD BALANCING VIRTUALIZED CLUSTER OF SERVERS
    Tarighi, Mohsen
    Motamedi, Seyed Ahmad
    Sharifian, Saeed
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2014, 21 (04): : 763 - 772
  • [28] Distributed Estimation and Learning over Heterogeneous Networks
    Rahimian, M. Amin
    Jadbabaie, Ali
    2016 54TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2016, : 1314 - 1321
  • [29] Load balancing in heterogeneous distributed systems
    Gopal, T.V.
    Karthic Nataraj, N.S.
    Ramamurthy, C.
    Sankaranarayanan, V.
    Microelectronics Reliability, 1996, 36 (09): : 1279 - 1286
  • [30] Distributed Load Balancing via Message Passing for Heterogeneous Cellular Networks
    Sohn, Illsoo
    Lee, Sang Hyun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (11) : 9287 - 9298