Distributed Adaptive Subgradient Algorithms for Online Learning Over Time-Varying Networks

被引：11

作者：

Zhang, Mingchuan ^{[1
]}

Hao, Bowei ^{[1
]}

Ge, Quanbo ^{[2
]}

Zhu, Junlong ^{[1
]}

Zheng, Ruijuan ^{[1
]}

Wu, Qingtao ^{[1
]}

机构：

[1] Henan Univ Sci & Technol, Sch Informat Engn, Luoyang 471023, Peoples R China

[2] Anhui Univ, Sch Artificial Intelligence, Hefei 230039, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2022年 / 52卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Optimization; Heuristic algorithms; Training; Linear programming; Convergence; Machine learning algorithms; Deep learning; Adaptive subgradient algorithms; generalization capacity; regret bound; OPTIMIZATION; CONSENSUS; CONVERGENCE;

D O I：

10.1109/TSMC.2021.3097714

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Adaptive gradient algorithms have recently become extremely popular because they have been applied successfully in training deep neural networks, such as Adam, AMSGrad, and AdaBound. Despite their success, however, the distributed variant of the adaptive method, which is expected to possess a rapid training speed at the beginning and a good generalization capacity at the end, is rarely studied. To fill the gap, a distributed adaptive subgradient algorithm is presented, called D-AdaBound, where the learning rates are dynamically bounded by clipping the learning rates. Moreover, we obtain the regret bound of D-AdaBound, in which the objective functions are convex. Finally, we confirm the effectiveness of D-AdaBound by simulation experiments on different datasets. The results show the performance improvement of D-AdaBound relative to existing distributed online learning algorithms.

引用

页码：4518 / 4529

页数：12

共 50 条

[21] Projection-free decentralized online learning for submodular maximization over time-varying networks
Zhu, Junlong
Wu, Qingtao
Zhang, Mingchuan
Zheng, Ruijuan
Li, Keqin
[J]. Journal of Machine Learning Research, 2021, 22
[22] Distributed Newton Step Projection Algorithm for Online Convex Optimization Over Time-Varying Unbalanced Networks
Wu, Jiayi
Tian, Yu-Ping
[J]. IEEE ACCESS, 2024, 12 : 1189 - 1200
[23] Projection-free Decentralized Online Learning for Submodular Maximization over Time-Varying Networks
Zhu, Junlong
Wu, Qingtao
Zhang, Mingchuan
Zheng, Ruijuan
Li, Keqin
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
[24] Adaptive cluster synchronization in networks with time-varying and distributed coupling delays
Li, Kezan
Zhou, Jin
Yu, Wenwu
Small, Michael
Fu, Xinchu
[J]. APPLIED MATHEMATICAL MODELLING, 2014, 38 (04) : 1300 - 1314
[25] Adaptive synchronization of neural networks with time-varying delay and distributed delay
Wang, Kai
Teng, Zhidong
Jiang, Haijun
[J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2008, 387 (2-3) : 631 - 642
[26] A Privacy-Masking Learning Algorithm for Online Distributed Optimization over Time-Varying Unbalanced Digraphs
Hu, Rong
Zhang, Binru
[J]. JOURNAL OF MATHEMATICS, 2021, 2021
[27] Microgrid Distributed Frequency Control Over Time-Varying Communication Networks
Zholbaryssov, Madi
Dominguez-Garcia, Alejandro D.
[J]. 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 5722 - 5727
[28] Distributed dynamic stochastic approximation algorithm over time-varying networks
Fu K.
Chen H.-F.
Zhao W.
[J]. Autonomous Intelligent Systems, 2021, 1 (01):
[29] Adaptive identification algorithms for time-varying parameters
Hidaka, K
Ohmori, H
Sano, A
[J]. PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 4308 - 4313
[30] Distributed Online Optimization in Time-Varying Unbalanced Networks Without Explicit Subgradients
Xiong, Yongyang
Li, Xiang
You, Keyou
Wu, Ligang
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 4047 - 4060

← 1 2 3 4 5 →