Placement of Parameter Server in Wide Area Network Topology for Geo-Distributed Machine Learning

被引:3
|
作者
Li, Yongyao [1 ]
Fan, Chenyu [2 ]
Zhang, Xiaoning [2 ]
Chen, Yufeng [1 ]
机构
[1] Macau Univ Sci & Technol, Ringgold Std Inst, Macau, Peoples R China
[2] Univ Elect Sci & Technol China, Ringgold Stand Inst, Sch Informat & Commun Engn, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Geo-distributed machine learning; routing; wide area networks; ALGORITHMS;
D O I
10.23919/JCN.2023.000021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
learning (ML) is extensively used in a wide range of real-world applications that require data all around world to pursue high accuracy of a global model. Unfortunately, it is impossible to transmit all gathered raw data to a central data center for training due to data privacy, data sovereignty and high communication cost. This brings the idea of geodistributed machine learning (Geo-DML), which completes the training of the global ML model across multiple data centers with the bottleneck of high communication cost over the limited wide area networks (WAN) bandwidth. In this paper, we study on the problem of parameter server (PS) placement in PS architecture for communication efficiency of Geo-DML. Our optimization aims to select an appropriate data center as the PS for global training algorithm based on the communication cost. We prove the PS placement problem is NP-hard. Further, we develop an approximation algorithm to solve the problem using the randomized rounding method. In order to validate the performance of our proposed algorithm, we conduct large-scale simulations, and the simulation results on two typical carrier network topologies show that our proposed algorithm can reduce the communication cost up to 61.78% over B4 topology and 21.78% over Internet2 network topology.
引用
收藏
页码:370 / 380
页数:11
相关论文
共 50 条
  • [31] Scalable and Adaptive Data Replica Placement for Geo-Distributed Cloud Storages
    Liu, Kaiyang
    Peng, Jun
    Wang, Jingrong
    Liu, Weirong
    Huang, Zhiwu
    Pan, Jianping
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (07) : 1575 - 1587
  • [32] Learning-based power prediction for geo-distributed Data Centers: weather parameter analysis
    Somayyeh Taheri
    Maziar Goudarzi
    Osamu Yoshie
    Journal of Big Data, 7
  • [33] Learning-based power prediction for geo-distributed Data Centers: weather parameter analysis
    Taheri, Somayyeh
    Goudarzi, Maziar
    Yoshie, Osamu
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [34] Achieving Cost Optimization for Tenant Task Placement in Geo-Distributed Clouds
    Luo, Luyao
    Zhao, Gongming
    Xu, Hongli
    Yu, Zhuolong
    Xie, Liguang
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (02) : 1391 - 1406
  • [35] Joint Data Purchasing and Data Placement in a Geo-Distributed Data Market
    Ren, Xiaoqi
    London, Palma
    Ziani, Juba
    Wierman, Adam
    SIGMETRICS/PERFORMANCE 2016: PROCEEDINGS OF THE SIGMETRICS/PERFORMANCE JOINT INTERNATIONAL CONFERENCE ON MEASUREMENT AND MODELING OF COMPUTER SCIENCE, 2016, : 383 - 384
  • [36] Placement of High Availability Geo-Distributed Data Centers in Emerging Economies
    Liu, Ruiyun
    Sun, Weiqiang
    Hu, Weisheng
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (03) : 3274 - 3288
  • [37] Topology-Aware Resource-Efficient Placement for High Availability Clusters Over Geo-Distributed Cloud Infrastructure
    Do, Truong-Xuan
    Kim, Younghan
    IEEE ACCESS, 2019, 7 : 107234 - 107246
  • [38] SNR: Network-aware Geo-Distributed Stream Analytics
    Mostafaei, Habib
    Afridi, Shafi
    Abawajy, Jemal H.
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 820 - 827
  • [39] Efficiently Embedding Service Function Chains with Dynamic Virtual Network Function Placement in Geo-Distributed Cloud System
    Pei, Jianing
    Hong, Peilin
    Xue, Kaiping
    Li, Defang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (10) : 2179 - 2192
  • [40] JointPS: Joint Parameter Server Placement and Flow Scheduling for Machine Learning Clusters
    Zhao, Yangming
    Yang, Cheng
    Zhao, Gongming
    Hou, Yunfei
    Wang, Ting
    Qiao, Chunming
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (12) : 3503 - 3518