Distributed Adaptive Subgradient Algorithms for Online Learning Over Time-Varying Networks

被引:11
|
作者
Zhang, Mingchuan [1 ]
Hao, Bowei [1 ]
Ge, Quanbo [2 ]
Zhu, Junlong [1 ]
Zheng, Ruijuan [1 ]
Wu, Qingtao [1 ]
机构
[1] Henan Univ Sci & Technol, Sch Informat Engn, Luoyang 471023, Peoples R China
[2] Anhui Univ, Sch Artificial Intelligence, Hefei 230039, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimization; Heuristic algorithms; Training; Linear programming; Convergence; Machine learning algorithms; Deep learning; Adaptive subgradient algorithms; generalization capacity; regret bound; OPTIMIZATION; CONSENSUS; CONVERGENCE;
D O I
10.1109/TSMC.2021.3097714
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive gradient algorithms have recently become extremely popular because they have been applied successfully in training deep neural networks, such as Adam, AMSGrad, and AdaBound. Despite their success, however, the distributed variant of the adaptive method, which is expected to possess a rapid training speed at the beginning and a good generalization capacity at the end, is rarely studied. To fill the gap, a distributed adaptive subgradient algorithm is presented, called D-AdaBound, where the learning rates are dynamically bounded by clipping the learning rates. Moreover, we obtain the regret bound of D-AdaBound, in which the objective functions are convex. Finally, we confirm the effectiveness of D-AdaBound by simulation experiments on different datasets. The results show the performance improvement of D-AdaBound relative to existing distributed online learning algorithms.
引用
收藏
页码:4518 / 4529
页数:12
相关论文
共 50 条
  • [21] Projection-free decentralized online learning for submodular maximization over time-varying networks
    Zhu, Junlong
    Wu, Qingtao
    Zhang, Mingchuan
    Zheng, Ruijuan
    Li, Keqin
    [J]. Journal of Machine Learning Research, 2021, 22
  • [22] Distributed Newton Step Projection Algorithm for Online Convex Optimization Over Time-Varying Unbalanced Networks
    Wu, Jiayi
    Tian, Yu-Ping
    [J]. IEEE ACCESS, 2024, 12 : 1189 - 1200
  • [23] Projection-free Decentralized Online Learning for Submodular Maximization over Time-Varying Networks
    Zhu, Junlong
    Wu, Qingtao
    Zhang, Mingchuan
    Zheng, Ruijuan
    Li, Keqin
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [24] Adaptive cluster synchronization in networks with time-varying and distributed coupling delays
    Li, Kezan
    Zhou, Jin
    Yu, Wenwu
    Small, Michael
    Fu, Xinchu
    [J]. APPLIED MATHEMATICAL MODELLING, 2014, 38 (04) : 1300 - 1314
  • [25] Adaptive synchronization of neural networks with time-varying delay and distributed delay
    Wang, Kai
    Teng, Zhidong
    Jiang, Haijun
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2008, 387 (2-3) : 631 - 642
  • [26] A Privacy-Masking Learning Algorithm for Online Distributed Optimization over Time-Varying Unbalanced Digraphs
    Hu, Rong
    Zhang, Binru
    [J]. JOURNAL OF MATHEMATICS, 2021, 2021
  • [27] Microgrid Distributed Frequency Control Over Time-Varying Communication Networks
    Zholbaryssov, Madi
    Dominguez-Garcia, Alejandro D.
    [J]. 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 5722 - 5727
  • [28] Distributed dynamic stochastic approximation algorithm over time-varying networks
    Fu K.
    Chen H.-F.
    Zhao W.
    [J]. Autonomous Intelligent Systems, 2021, 1 (01):
  • [29] Adaptive identification algorithms for time-varying parameters
    Hidaka, K
    Ohmori, H
    Sano, A
    [J]. PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 4308 - 4313
  • [30] Distributed Online Optimization in Time-Varying Unbalanced Networks Without Explicit Subgradients
    Xiong, Yongyang
    Li, Xiang
    You, Keyou
    Wu, Ligang
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 4047 - 4060