Accelerated Federated Learning Over Wireless Fading Channels With Adaptive Stochastic Momentum

被引:0
|
作者
Su, Liqun [1 ]
Lau, Vincent K. N. [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 08期
关键词
Stochastic processes; Training; Heuristic algorithms; Convergence; Servers; Adaptation models; Computational modeling; Accelerated algorithm; federated learning (FL); momentum method; stochastic differential equation (SDE); CONVERGENCE;
D O I
10.1109/JIOT.2023.3340014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning, as a well-known framework for collaborative training among distributed local sensors and devices, has been widely used in practical learning applications. To reduce communication resource consumption and training delay, acceleration training algorithms, especially momentum-based methods, are further developed for the training process. However, it is observed that under the influence of transmission noise, existing momentum methods exhibit poor training performance due to the noise accumulation along with the momentum term. This motivates us to propose a novel acceleration algorithm to achieve an efficient tradeoff between the training acceleration and noise smoothing. Specifically, to obtain clearer insights into the model update dynamics, we utilize a stochastic differential equation (SDE) model to mimic the discrete-time (DT) training trajectory. Through high-order drift approximation analysis on a general momentum-based SDE model, we propose a dynamic momentum weight and gradient stepsize design for the update rule, which is adaptive to both the training state and gradient quality. Such adaptation ensures that the training algorithm can seize good update opportunities and avoid noise explosion. The corresponding DT training algorithm is then derived via discretization of the proposed SDE model, which shows a superior training performance compared to state-of-the-art baselines.
引用
收藏
页码:14136 / 14152
页数:17
相关论文
共 50 条
  • [31] Gradient Statistics Aware Power Control for Over-the-Air Federated Learning in Fading Channels
    Zhang, Naifu
    Tao, Meixia
    2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
  • [32] Adaptive Noise Aggregation Based Secure Image Transmission over Wireless Fading Channels
    Huang, Liwei
    Ren, Pinyi
    Xu, Dongyang
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 37 - 42
  • [33] Adaptive multicast with power and rate control over fading channels in mobile wireless networks
    Du, Qinghe
    Zhang, Xi
    2007 IEEE WIRELESS COMMUNICATIONS & NETWORKING CONFERENCE, VOLS 1-9, 2007, : 3337 - 3342
  • [34] Adaptive modulation over Nakagami fading channels
    Alouini, Mohamed-Slim
    Goldsmith, Andrea J.
    Wireless Personal Communications, 2000, 13 (01) : 119 - 143
  • [35] Adaptive Modulation over Nakagami Fading Channels
    Mohamed-Slim Alouini
    Andrea J. Goldsmith
    Wireless Personal Communications, 2000, 13 : 119 - 143
  • [36] Adaptive modulation over Nakagami fading channels
    Alouini, MS
    Goldsmith, AJ
    WIRELESS PERSONAL COMMUNICATIONS, 2000, 13 (1-2) : 119 - 143
  • [37] Adaptive communications over fading satellite channels
    Choi, JP
    Chan, VWS
    2001 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-10, CONFERENCE RECORD, 2001, : 2635 - 2639
  • [38] Adaptive Modulation for Wireless Federated Edge Learning
    Xu, Xinyi
    Yu, Guanding
    Liu, Shengli
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2023, 9 (04) : 1096 - 1109
  • [39] Adaptive Network Pruning for Wireless Federated Learning
    Liu, Shengli
    Yu, Guanding
    Yin, Rui
    Yuan, Jiantao
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (07) : 1572 - 1576
  • [40] Online Power Control for Distributed Multitask Learning Over Noisy Fading Wireless Channels
    Zhang, Kai
    Cao, Xuanyu
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 3679 - 3694