Communication-Constrained Distributed Learning: TSI-Aided Asynchronous Optimization with Stale Gradient

被引:0
|
作者
Yu, Siyuan [1 ,2 ]
Chen, Wei [1 ,2 ]
Poor, H. Vincent [3 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China
[3] Princeton Univ, Dept Elect & Comp Engn, Princeton, NJ 08544 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Asynchronous optimization; stochastic gradient descent; timing side information; gradient staleness; federated learning;
D O I
10.1109/GLOBECOM54140.2023.10437351
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Distributed machine learning including federated learning has attracted considerable attention due to its potential of scaling the computational resources, reducing the training time, and helping protect the user privacy. As one of key enablers of distributed learning, asynchronous optimization allows multiple workers to process data simultaneously without paying a cost of synchronization delay. However, given limited communication bandwidth, asynchronous optimization can be hampered by gradient staleness, which severely hinders the learning process. In this paper, we present a communication-constrained distributed learning scheme, in which asynchronous stochastic gradients generated by parallel workers are transmitted over a shared medium or link. Our aim is to minimize the average training time by striking the optimal tradeoff between the number of parallel workers and their gradient staleness. To this end, a queueing theoretic model is formulated, which allows us to find the optimal number of workers participating in the asynchronous optimization. Furthermore, we also leverage the packet arrival time at the parameter server, also referred to as Timing Side Information (TSI), to compress the staleness information for the staleness-aware Asynchronous Stochastic Gradients Descent (Asyn-SGD). Numerical results demonstrate the substantial reduction of training time owing to both the worker selection and TSI-aided compression of staleness information.
引用
收藏
页码:1495 / 1500
页数:6
相关论文
共 50 条
  • [1] Communication-Constrained Distributed Task Assignment
    Jackson, Justin
    Faied, Mariam
    Kabamba, Pierre
    Girard, Anouck
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 570 - 577
  • [2] Communication-constrained distributed radar detection in spiky clutter
    Lombardini, F
    Verrazzani, L
    PROCEEDINGS OF THE 1996 IEEE NATIONAL RADAR CONFERENCE, 1996, : 285 - 290
  • [3] Communication-Constrained Distributed Quantile Regression with Optimal Statistical Guarantees
    Tan, Kean Ming
    Battey, Heather
    Zhou, Wen-Xin
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23 : 1 - 61
  • [4] Communication-Constrained Distributed Quantile Regression with Optimal Statistical Guarantees
    Tan, Kean Ming
    Battey, Heather
    Zhou, Wen-Xin
    Journal of Machine Learning Research, 2022, 23
  • [5] Distributed control scheme for motor networks with communication-constrained channels
    Univ of Wisconsin - Madison, Madison, United States
    Proc IEEE Int Conf Rob Autom, (207-212):
  • [6] A distributed control scheme for motor networks with communication-constrained channels
    Kim, K
    Ferrier, NJ
    ICRA '99: IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, PROCEEDINGS, 1999, : 207 - 212
  • [7] Distributed Asynchronous Constrained Stochastic Optimization
    Srivastava, Kunal
    Nedic, Angelia
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (04) : 772 - 790
  • [8] Asynchronous SGD with stale gradient dynamic adjustment for deep learning training
    Tan, Tao
    Xie, Hong
    Xia, Yunni
    Shi, Xiaoyu
    Shang, Mingsheng
    INFORMATION SCIENCES, 2024, 681
  • [9] Asynchronous Distributed Optimization with Minimal Communication
    Zhong, Minyi
    Cassandras, Christos G.
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 363 - 368
  • [10] A distributed task allocation method for heterogeneous UAVs in dynamic and communication-constrained environments
    Shaokun Yan
    Yuanqing Xia
    Yan, Shaokun (yanshaokun@foxmail.com), 2025, 81 (01):