Communication-efficient federated learning with stagewise training strategy

被引：1

作者：

Cheng, Yifei ^{[1
,2
,3
]}

Shen, Shuheng ^{[4
]}

Liang, Xianfeng ^{[1
,3
,5
]}

Liu, Jingchang ^{[6
]}

Chen, Joya ^{[1
,3
,5
]}

Zhang, Tie ^{[5
]}

Chen, Enhong ^{[1
,2
,3
,5
]}

机构：

[1] Anhui Prov Key Lab Big Data Anal & Applicat, Hefei, Peoples R China

[2] Univ Sci & Technol China, Sch Data Sci, Hefei, Peoples R China

[3] State Key Lab Cognit Intelligence, Hefei, Peoples R China

[4] Ant Financial Serv Grp, Hangzhou, Peoples R China

[5] Univ Sci & Technol China, Sch Comp Sci, Hefei, Peoples R China

[6] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China

来源：

NEURAL NETWORKS | 2023年 / 167卷

基金：

中国国家自然科学基金;

关键词：

Federated learning; Optimization algorithm; Communication complexity; Convergence rate;

D O I：

10.1016/j.neunet.2023.08.033

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The efficiency of communication across workers is a significant factor that affects the performance of federated learning. Though periodic communication strategy is applied to reduce communication rounds in training, the communication cost is still high when the training data distributions are not independently and identically distributed (non-IID) which is common in federated learning. Recently, some works introduce variance reduction to eliminate the effect caused by non-IID data among workers. Nevertheless the provable optimal communication complexity O(log(ST)) and convergence rate O(1/(ST)) cannot be achieved simultaneously, where S denotes the number of sampled workers in each round and T is the number of iterations. To deal with this dilemma, we propose an optimization algorithm SQUARFA that adopts stagewise training framework coupling with variance reduction and uses a quick-start phase in each loop. Theoretical results show that SQUARFA achieves both optimal convergence rate and communication complexity for both strongly convex objectives and non-convex objectives under PL condition, thus fills the gap mentioned above. Then, a variant of SQUARFA yields the optimal theoretical results for general non-convex objectives. We further extend the technique in SQUARFA to the large batch setting and achieve optimal communication complexity. Experimental results demonstrate the superiority of the proposed algorithms.

引用

页码：460 / 472

页数：13

共 50 条

[1] Communication-efficient federated learning
Chen, Mingzhe
Shlezinger, Nir
Poor, H. Vincent
Eldar, Yonina C.
Cui, Shuguang
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (17)
[2] Communication-Efficient Vertical Federated Learning
Khan, Afsana
ten Thij, Marijn
Wilbik, Anna
[J]. ALGORITHMS, 2022, 15 (08)
[3] Communication-Efficient Adaptive Federated Learning
Wang, Yujia
Lin, Lu
Chen, Jinghui
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[4] Communication-Efficient Federated Learning with Heterogeneous Devices
Chen, Zhixiong
Yi, Wenqiang
Liu, Yuanwei
Nallanathan, Arumugam
[J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3602 - 3607
[5] Communication-Efficient Federated Learning for Decision Trees
Zhao, Shuo
Zhu, Zikun
Li, Xin
Chen, Ying-Chi
[J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 5478 - 5492
[6] Communication-Efficient Federated Learning with Adaptive Quantization
Mao, Yuzhu
Zhao, Zihao
Yan, Guangfeng
Liu, Yang
Lan, Tian
Song, Linqi
Ding, Wenbo
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (04)
[7] Communication-Efficient Secure Aggregation for Federated Learning
Ergun, Irem
Sami, Hasin Us
Guler, Basak
[J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3881 - 3886
[8] FedBoost: Communication-Efficient Algorithms for Federated Learning
Hamer, Jenny
Mohri, Mehryar
Suresh, Ananda Theertha
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[9] Ternary Compression for Communication-Efficient Federated Learning
Xu, Jinjin
Du, Wenli
Jin, Yaochu
He, Wangli
Cheng, Ran
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (03) : 1162 - 1176
[10] Neuron Pruning-Based Federated Learning for Communication-Efficient Distributed Training
Guan, Jianfeng
Wang, Pengcheng
Yao, Su
Zhang, Jing
[J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT IV, 2024, 14490 : 63 - 81

← 1 2 3 4 5 →