Distributed Adaptive Subgradient Algorithms for Online Learning Over Time-Varying Networks

被引:11
|
作者
Zhang, Mingchuan [1 ]
Hao, Bowei [1 ]
Ge, Quanbo [2 ]
Zhu, Junlong [1 ]
Zheng, Ruijuan [1 ]
Wu, Qingtao [1 ]
机构
[1] Henan Univ Sci & Technol, Sch Informat Engn, Luoyang 471023, Peoples R China
[2] Anhui Univ, Sch Artificial Intelligence, Hefei 230039, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimization; Heuristic algorithms; Training; Linear programming; Convergence; Machine learning algorithms; Deep learning; Adaptive subgradient algorithms; generalization capacity; regret bound; OPTIMIZATION; CONSENSUS; CONVERGENCE;
D O I
10.1109/TSMC.2021.3097714
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive gradient algorithms have recently become extremely popular because they have been applied successfully in training deep neural networks, such as Adam, AMSGrad, and AdaBound. Despite their success, however, the distributed variant of the adaptive method, which is expected to possess a rapid training speed at the beginning and a good generalization capacity at the end, is rarely studied. To fill the gap, a distributed adaptive subgradient algorithm is presented, called D-AdaBound, where the learning rates are dynamically bounded by clipping the learning rates. Moreover, we obtain the regret bound of D-AdaBound, in which the objective functions are convex. Finally, we confirm the effectiveness of D-AdaBound by simulation experiments on different datasets. The results show the performance improvement of D-AdaBound relative to existing distributed online learning algorithms.
引用
收藏
页码:4518 / 4529
页数:12
相关论文
共 50 条
  • [1] Distributed online adaptive subgradient optimization with dynamic bound of learning rate over time-varying networks
    Fang, Runyue
    Li, Dequan
    Shen, Xiuyu
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2022, 16 (18): : 1834 - 1846
  • [2] Differentially Private Distributed Online Algorithms Over Time-Varying Directed Networks
    Zhu, Junlong
    Xu, Changqiao
    Guan, Jianfeng
    Wu, Dapeng Oliver
    [J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2018, 4 (01): : 4 - 17
  • [3] Distributed adaptive clustering learning over time-varying multitask networks
    Shi, Qing
    Chen, Feng
    Li, Xinyu
    Duan, Shukai
    [J]. INFORMATION SCIENCES, 2021, 567 : 278 - 297
  • [4] Distributed Online Learning Algorithms for Aggregative Games Over Time-Varying Unbalanced Digraphs
    Zuo, Xiaolong
    Deng, Zhenhua
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 2278 - 2283
  • [5] Provable distributed adaptive temporal-difference learning over time-varying networks*
    Zhu, Junlong
    Li, Bing
    Wang, Lin
    Zhang, Mingchuan
    Xing, Ling
    Xi, Jiangtao
    Wu, Qingtao
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [6] Distributed Constrained Optimization With Delayed Subgradient Information Over Time-Varying Network Under Adaptive Quantization
    Liu, Jie
    Yu, Zhan
    Ho, Daniel W. C.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 143 - 156
  • [7] Distributed Stochastic Subgradient Projection Algorithms Based on Weight-Balancing over Time-Varying Directed Graphs
    Zhu, Junlong
    Xie, Ping
    Zhang, Mingchuan
    Zheng, Ruijuan
    Xing, Ling
    Wu, Qingtao
    [J]. COMPLEXITY, 2019, 2019
  • [8] Distributed subgradient-push online convex optimization on time-varying directed graphs
    Akbari, Mohammad
    Gharesifard, Bahman
    Linder, Tamas
    [J]. 2014 52ND ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2014, : 264 - 269
  • [9] DISTRIBUTED SUBGRADIENT-FREE STOCHASTIC OPTIMIZATION ALGORITHM FOR NONSMOOTH CONVEX FUNCTIONS OVER TIME-VARYING NETWORKS
    Wang, Yinghui
    Zhao, Wenxiao
    Hong, Yiguang
    Zamani, Mohsen
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2019, 57 (04) : 2821 - 2842
  • [10] DISTRIBUTED NONCONVEX OPTIMIZATION OVER TIME-VARYING NETWORKS
    Di Lorenzo, Paolo
    Scutari, Gesualdo
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4124 - 4128