Federated Learning With Nesterov Accelerated Gradient

被引:11
|
作者
Yang, Zhengjie [1 ]
Bao, Wei [1 ]
Yuan, Dong [2 ]
Tran, Nguyen H. [1 ]
Zomaya, Albert Y. [1 ]
机构
[1] Univ Sydney, Sch Comp Sci, Sydney, NSW 2006, Australia
[2] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
基金
澳大利亚研究理事会;
关键词
Convergence; Training; Computational modeling; Collaborative work; Quantization (signal); Servers; Internet of Things; Edge computing; federated learning; nesterov accelerated gradient;
D O I
10.1109/TPDS.2022.3206480
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Federated learning (FL) is a fast-developing technique that allows multiple workers to train a global model based on a distributed dataset. Conventional FL (FedAvg) employs gradient descent algorithm, which may not be efficient enough. Momentum is able to improve the situation by adding an additional momentum step to accelerate the convergence and has demonstrated its benefits in both centralized and FL environments. It is well-known that Nesterov Accelerated Gradient (NAG) is a more advantageous form of momentum, but it is not clear how to quantify the benefits of NAG in FL so far. This motives us to propose FedNAG, which employs NAG in each worker as well as NAG momentum and model aggregation in the aggregator. We provide a detailed convergence analysis of FedNAG and compare it with FedAvg. Extensive experiments based on real-world datasets and trace-driven simulation are conducted, demonstrating that FedNAG increases the learning accuracy by 3-24% and decreases the total training time by 11-70% compared with the benchmarks under a wide range of settings.
引用
收藏
页码:4863 / 4873
页数:11
相关论文
共 50 条
  • [1] Adaptive Learning Rate Method Based on Nesterov Accelerated Gradient
    Xu, Zhenxing
    Yang, Ping
    Xu, Bing
    Li, Heping
    [J]. AOPC 2017: OPTICAL SENSING AND IMAGING TECHNOLOGY AND APPLICATIONS, 2017, 10462
  • [2] Iterative Learning Control Based on Nesterov Accelerated Gradient Method
    Gu, Panpan
    Tian, Senping
    Chen, Yangquan
    [J]. IEEE ACCESS, 2019, 7 : 115836 - 115842
  • [3] Accelerated Distributed Nesterov Gradient Descent
    Qu, Guannan
    Li, Na
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (06) : 2566 - 2581
  • [4] Accelerated Distributed Composite Nesterov Gradient Descent Algorithm
    Yue, Xinhui
    Ni, Yuan-Hua
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 1967 - 1972
  • [5] Nesterov Accelerated Shuffling Gradient Method for Convex Optimization
    Tran, Trang H.
    Scheinberg, Katya
    Nguyen, Lam M.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [6] NAGSC: NESTEROV'S ACCELERATED GRADIENT METHODS FOR SPARSE CODING
    Liu, Liang
    Zhang, Ling
    Dai, Xiangguang
    Feng, Yuming
    [J]. ITALIAN JOURNAL OF PURE AND APPLIED MATHEMATICS, 2018, (40): : 724 - 735
  • [7] Accelerated Distributed Nesterov Gradient Descent for Convex and Smooth Functions
    Qu, Guannan
    Li, Na
    [J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [8] Nesterov’s Accelerated Gradient Descent: The Controlled Contraction Approach
    Gunjal, Revati
    Nayyer, Syed Shadab
    Wagh, Sushama
    Singh, Navdeep M.
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 163 - 168
  • [9] Ultrasound Image Reconstruction Using Nesterov's Accelerated Gradient
    Wang, Hongjian
    Dalkilic, Burak
    Gemmeke, Hartmut
    Hopp, Torsten
    Hesser, Juergen
    [J]. 2018 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE PROCEEDINGS (NSS/MIC), 2018,
  • [10] On the Convergence of Nesterov's Accelerated Gradient Method in Stochastic Settings
    Assran, Mahmoud
    Rabbat, Michael
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119