Placement of Parameter Server in Wide Area Network Topology for Geo-Distributed Machine Learning

被引:3
|
作者
Li, Yongyao [1 ]
Fan, Chenyu [2 ]
Zhang, Xiaoning [2 ]
Chen, Yufeng [1 ]
机构
[1] Macau Univ Sci & Technol, Ringgold Std Inst, Macau, Peoples R China
[2] Univ Elect Sci & Technol China, Ringgold Stand Inst, Sch Informat & Commun Engn, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Geo-distributed machine learning; routing; wide area networks; ALGORITHMS;
D O I
10.23919/JCN.2023.000021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
learning (ML) is extensively used in a wide range of real-world applications that require data all around world to pursue high accuracy of a global model. Unfortunately, it is impossible to transmit all gathered raw data to a central data center for training due to data privacy, data sovereignty and high communication cost. This brings the idea of geodistributed machine learning (Geo-DML), which completes the training of the global ML model across multiple data centers with the bottleneck of high communication cost over the limited wide area networks (WAN) bandwidth. In this paper, we study on the problem of parameter server (PS) placement in PS architecture for communication efficiency of Geo-DML. Our optimization aims to select an appropriate data center as the PS for global training algorithm based on the communication cost. We prove the PS placement problem is NP-hard. Further, we develop an approximation algorithm to solve the problem using the randomized rounding method. In order to validate the performance of our proposed algorithm, we conduct large-scale simulations, and the simulation results on two typical carrier network topologies show that our proposed algorithm can reduce the communication cost up to 61.78% over B4 topology and 21.78% over Internet2 network topology.
引用
收藏
页码:370 / 380
页数:11
相关论文
共 50 条
  • [21] Predictive Placement of Geo-distributed Blockchain Nodes for Performance Guarantee
    Lee, Junseok
    Yoo, Yeonho
    Yoo, Chuck
    Yang, Gyeongsik
    2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 66 - 68
  • [22] Nitro: Network-Aware Virtual Machine Image Management in Geo-Distributed Clouds
    Darrous, Jad
    Ibrahim, Shadi
    Zhou, Amelie Chi
    Perez, Christian
    2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 553 - 562
  • [23] The Role of Network Topology for Distributed Machine Learning
    Neglia, Giovanni
    Calbi, Gianmarco
    Towsley, Don
    Vardoyan, Gayane
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, : 2350 - 2358
  • [24] Optimal Online Data Partitioning for Geo-Distributed Machine Learning in Edge of Wireless Networks
    Lyu, Xinchen
    Ren, Chenshan
    Ni, Wei
    Tian, Hui
    Liu, Ren Ping
    Dutkiewicz, Eryk
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (10) : 2393 - 2406
  • [25] Efficient Parameter Server Placement for Distributed Deep Learning in Edge Computing
    Wu, Yalan
    Yan, Jiaquan
    Chen, Long
    Wu, Jigang
    Li, Yidong
    COMPUTER JOURNAL, 2023, 66 (03): : 678 - 691
  • [26] DOSP: an optimal synchronization of parameter server for distributed machine learning
    Meiguang Zheng
    Dongbang Mao
    Liu Yang
    Yeming Wei
    Zhigang Hu
    The Journal of Supercomputing, 2022, 78 : 13865 - 13892
  • [27] DOSP: an optimal synchronization of parameter server for distributed machine learning
    Zheng, Meiguang
    Mao, Dongbang
    Yang, Liu
    Wei, Yeming
    Hu, Zhigang
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (12): : 13865 - 13892
  • [28] Investigation of Network Traffic in Geo-Distributed Data Centers
    Koshiba, Yutaka
    Chen, Wuhui
    Yamada, Yuichi
    Tanaka, Takazumi
    Paik, Incheon
    2015 IEEE 7TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE & TECHNOLOGY (ICAST), 2015, : 174 - 179
  • [29] Accelerating model synchronization for distributed machine learning in an optical wide area network
    Liu, Ling
    Song, Liangjun
    Chen, Xi
    Yu, Hongfang
    Sun, Gang
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2022, 14 (10) : 852 - 865
  • [30] Distributed Profitable Deployment of Network Services to Geo-distributed Edge Systems
    Chen, Yi-Chia
    Yen, Li-Hsing
    APNOMS 2020: 2020 21ST ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2020, : 208 - 213