Placement of Parameter Server in Wide Area Network Topology for Geo-Distributed Machine Learning

被引:3
|
作者
Li, Yongyao [1 ]
Fan, Chenyu [2 ]
Zhang, Xiaoning [2 ]
Chen, Yufeng [1 ]
机构
[1] Macau Univ Sci & Technol, Ringgold Std Inst, Macau, Peoples R China
[2] Univ Elect Sci & Technol China, Ringgold Stand Inst, Sch Informat & Commun Engn, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Geo-distributed machine learning; routing; wide area networks; ALGORITHMS;
D O I
10.23919/JCN.2023.000021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
learning (ML) is extensively used in a wide range of real-world applications that require data all around world to pursue high accuracy of a global model. Unfortunately, it is impossible to transmit all gathered raw data to a central data center for training due to data privacy, data sovereignty and high communication cost. This brings the idea of geodistributed machine learning (Geo-DML), which completes the training of the global ML model across multiple data centers with the bottleneck of high communication cost over the limited wide area networks (WAN) bandwidth. In this paper, we study on the problem of parameter server (PS) placement in PS architecture for communication efficiency of Geo-DML. Our optimization aims to select an appropriate data center as the PS for global training algorithm based on the communication cost. We prove the PS placement problem is NP-hard. Further, we develop an approximation algorithm to solve the problem using the randomized rounding method. In order to validate the performance of our proposed algorithm, we conduct large-scale simulations, and the simulation results on two typical carrier network topologies show that our proposed algorithm can reduce the communication cost up to 61.78% over B4 topology and 21.78% over Internet2 network topology.
引用
收藏
页码:370 / 380
页数:11
相关论文
共 50 条
  • [41] Scalable Data Placement of Data-intensive Services in Geo-distributed Clouds
    Atrey, Ankita
    Van Seghbroeck, Gregory
    Volckaert, Bruno
    De Turck, Filip
    CLOSER: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2018, : 497 - 508
  • [42] A Framework of Hypergraph-Based Data Placement Among Geo-Distributed Datacenters
    Yu, Boyang
    Pan, Jianping
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2020, 13 (03) : 395 - 409
  • [43] Datum: Managing Data Purchasing and Data Placement in a Geo-Distributed Data Market
    Ren, Xiaoqi
    London, Palma
    Ziani, Juba
    Wierman, Adam
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2018, 26 (02) : 893 - 905
  • [44] Location-Aware Data Placement for Geo-distributed Online Social Networks
    Zhou, Jingya
    Fan, Jianxi
    Jia, Juncheng
    Cheng, Baolei
    Liu, Zhao
    2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 234 - 239
  • [45] Online Training Flow Scheduling for Geo-Distributed Machine Learning Jobs Over Heterogeneous and Dynamic Networks
    Fan, Lang
    Zhang, Xiaoning
    Zhao, Yangming
    Sood, Keshav
    Yu, Shui
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (01) : 277 - 291
  • [46] Self-Adaptive Gradient Quantization for Geo-Distributed Machine Learning Over Heterogeneous and Dynamic Networks
    Fan, Chenyu
    Zhang, Xiaoning
    Zhao, Yangming
    Liu, Yutao
    Yu, Shui
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (04) : 3483 - 3496
  • [47] A Network Cost-aware Geo-distributed Data Analytics System
    Oh, Kwangsung
    Chandra, Abhishek
    Weissman, Jon
    2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 649 - 658
  • [48] Simulated Analysis of Server Placement on Network Topology Designs
    Habib, Sami J.
    3RD ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, 2005, 2005,
  • [49] Smart Partitioning of Geo-Distributed Resources to Improve Cloud Network Performance
    Sajjad, Hooman Peiro
    Rahimian, Fatemeh
    Vlassov, Vladimir
    2015 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (CLOUDNET), 2015, : 112 - 118
  • [50] Network Cost-Aware Geo-Distributed Data Analytics System
    Oh, Kwangsung
    Zhang, Minmin
    Chandra, Abhishek
    Weissman, Jon
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (06) : 1407 - 1420