Representative Kernels-Based CNN for Faster Transmission in Federated Learning

被引：0

作者：

Li, Wei ^{[1
,2
]}

Shen, Zichen ^{[1
,2
]}

Liu, Xiulong ^{[3
]}

Wang, Mingfeng ^{[4
]}

Ma, Chao ^{[5
]}

Ding, Chuntao ^{[6
]}

Cao, Jiannong ^{[7
]}

机构：

[1] Jiangnan Univ, Res Ctr Intelligent Technol Healthcare, Sch Artificial Intelligence & Comp Sci & Engn, Minist Educ, Wuxi 214126, Jiangsu, Peoples R China

[2] Jiangnan Univ, Jiangsu Key Lab Media Design & Software Technol, Wuxi 214126, Jiangsu, Peoples R China

[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China

[4] Brunel Univ London, Dept Mech & Aerosp Engn, London UB8 3PH, England

[5] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430072, Peoples R China

[6] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China

[7] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Federated learning; convolution neural network; representative kernels; kernel generation function; parameter reduction; module selection; BANDWIDTH;

D O I：

10.1109/TMC.2024.3423448

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Due to the contradiction between limited bandwidth and huge transmission parameters, federated Learning (FL) has been an ongoing challenge to reduce the model parameters that need to be transmitted to server in clients for fast transmission. Existing works that attempt to reduce the amount of transmitted parameters have limitations: 1) the reduced number of parameters is not significant; 2) the performance of the global model is limited. In this paper, we propose a novel method called Fed-KGF that significantly reduces the amount of model parameters while improving the global model performance. Our goal is to reduce those transmitted parameters by reducing the number of convolution kernels. Specifically, we construct an incomplete model with a few representative convolution kernels, and propose Kernel Generation Function (KGF) to generate other convolution kernels to render the incomplete model to be a complete one. We discard those generated kernels after training local models, and solely transmit those representative kernels during training, thereby significantly reducing the transmitted parameters. Furthermore, there is a client-drift in the traditional FL because of the averaging method, which hurts the global model performance. We innovatively select one or few modules from all client models in a permutation way, and only aggregate the uploaded modules rather than averaging all modules to reduce client-drift, thus improving the global model performance and further reducing the transmitted parameters. Experimental results on both non-Independent and Identically Distributed (non-IID) and IID scenarios for image classification and object detection tasks demonstrate that our Fed-KGF outperforms SOTA FL models.

引用

页码：13062 / 13075

页数：14

共 50 条

[31] Pedestrian Detection based on Faster R-CNN
Liu S.
Cui X.
Li J.
Yang H.
Lukač N.
International Journal of Performability Engineering, 2019, 15 (07) : 1792 - 1801
[32] AN ENVIRONMENTALLY FRIENDLY DEFECT DETECTION METHOD FOR SMALL FITTINGS OF TRANSMISSION LINES BASED ON FASTER R-CNN
Wang, Hongxing
Pan, Zhixin
Chen, Yuquan
Huang, Zheng
Huang, Xiang
Gao, Xiaowei
FRESENIUS ENVIRONMENTAL BULLETIN, 2020, 29 (11): : 9914 - 9923
[33] Federated learning framework integrating REFINED CNN and Deep Regression Forests
Nolte, Daniel
Bazgir, Omid
Ghosh, Souparno
Pal, Ranadip
BIOINFORMATICS ADVANCES, 2023, 3 (01):
[34] Autism Spectrum Disorder detection framework for children based on federated learning integrated CNN-LSTM
Lakhan, Abdullah
Mohammed, Mazin Abed
Abdulkareem, Karrar Hameed
Hamouda, Hassen
Alyahya, Saleh
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
[35] Federated Transfer Learning For Diabetic Retinopathy Detection Using CNN Architectures
Nasajpour, Mohammad
Karakaya, Mahmut
Pouriyeh, Seyedamin
Parizi, Reza M.
SOUTHEASTCON 2022, 2022, : 655 - 660
[36] Faster Dynamic Graph CNN: Faster Deep Learning on 3D Point Cloud Data
Hong, Jinseok
Kim, Keeyoung
Lee, Hongchul
IEEE ACCESS, 2020, 8 : 190529 - 190538
[37] Faster R-CNN Based Deep Learning for Seagrass Detection from Underwater Digital Images
Moniruzzaman, Md
Islam, Syed Mohammed Shamsul
Lavery, Paul
Bennamoun, Mohammed
2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 41 - 47
[38] Smart traffic management of vehicles using faster R-CNN based deep learning method
Chaudhuri, Arindam
SCIENTIFIC REPORTS, 2024, 14 (01):
[39] Faster dynamic graph CNN: Faster deep learning on 3d point cloud data
Hong, Jinseok
Kim, Keeyoung
Lee, Hongchul
IEEE Access, 2020, 8 : 190529 - 190538
[40] On Model Transmission Strategies in Federated Learning With Lossy Communications
Su, Xiaoxin
Zhou, Yipeng
Cui, Laizhong
Liu, Jiangchuan
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (04) : 1173 - 1185

← 1 2 3 4 5 →