Representative Kernels-Based CNN for Faster Transmission in Federated Learning

被引:0
|
作者
Li, Wei [1 ,2 ]
Shen, Zichen [1 ,2 ]
Liu, Xiulong [3 ]
Wang, Mingfeng [4 ]
Ma, Chao [5 ]
Ding, Chuntao [6 ]
Cao, Jiannong [7 ]
机构
[1] Jiangnan Univ, Res Ctr Intelligent Technol Healthcare, Sch Artificial Intelligence & Comp Sci & Engn, Minist Educ, Wuxi 214126, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Key Lab Media Design & Software Technol, Wuxi 214126, Jiangsu, Peoples R China
[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China
[4] Brunel Univ London, Dept Mech & Aerosp Engn, London UB8 3PH, England
[5] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430072, Peoples R China
[6] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[7] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Federated learning; convolution neural network; representative kernels; kernel generation function; parameter reduction; module selection; BANDWIDTH;
D O I
10.1109/TMC.2024.3423448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the contradiction between limited bandwidth and huge transmission parameters, federated Learning (FL) has been an ongoing challenge to reduce the model parameters that need to be transmitted to server in clients for fast transmission. Existing works that attempt to reduce the amount of transmitted parameters have limitations: 1) the reduced number of parameters is not significant; 2) the performance of the global model is limited. In this paper, we propose a novel method called Fed-KGF that significantly reduces the amount of model parameters while improving the global model performance. Our goal is to reduce those transmitted parameters by reducing the number of convolution kernels. Specifically, we construct an incomplete model with a few representative convolution kernels, and propose Kernel Generation Function (KGF) to generate other convolution kernels to render the incomplete model to be a complete one. We discard those generated kernels after training local models, and solely transmit those representative kernels during training, thereby significantly reducing the transmitted parameters. Furthermore, there is a client-drift in the traditional FL because of the averaging method, which hurts the global model performance. We innovatively select one or few modules from all client models in a permutation way, and only aggregate the uploaded modules rather than averaging all modules to reduce client-drift, thus improving the global model performance and further reducing the transmitted parameters. Experimental results on both non-Independent and Identically Distributed (non-IID) and IID scenarios for image classification and object detection tasks demonstrate that our Fed-KGF outperforms SOTA FL models.
引用
收藏
页码:13062 / 13075
页数:14
相关论文
共 50 条
  • [31] Pedestrian Detection based on Faster R-CNN
    Liu S.
    Cui X.
    Li J.
    Yang H.
    Lukač N.
    International Journal of Performability Engineering, 2019, 15 (07) : 1792 - 1801
  • [32] AN ENVIRONMENTALLY FRIENDLY DEFECT DETECTION METHOD FOR SMALL FITTINGS OF TRANSMISSION LINES BASED ON FASTER R-CNN
    Wang, Hongxing
    Pan, Zhixin
    Chen, Yuquan
    Huang, Zheng
    Huang, Xiang
    Gao, Xiaowei
    FRESENIUS ENVIRONMENTAL BULLETIN, 2020, 29 (11): : 9914 - 9923
  • [33] Federated learning framework integrating REFINED CNN and Deep Regression Forests
    Nolte, Daniel
    Bazgir, Omid
    Ghosh, Souparno
    Pal, Ranadip
    BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [34] Autism Spectrum Disorder detection framework for children based on federated learning integrated CNN-LSTM
    Lakhan, Abdullah
    Mohammed, Mazin Abed
    Abdulkareem, Karrar Hameed
    Hamouda, Hassen
    Alyahya, Saleh
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
  • [35] Federated Transfer Learning For Diabetic Retinopathy Detection Using CNN Architectures
    Nasajpour, Mohammad
    Karakaya, Mahmut
    Pouriyeh, Seyedamin
    Parizi, Reza M.
    SOUTHEASTCON 2022, 2022, : 655 - 660
  • [36] Faster Dynamic Graph CNN: Faster Deep Learning on 3D Point Cloud Data
    Hong, Jinseok
    Kim, Keeyoung
    Lee, Hongchul
    IEEE ACCESS, 2020, 8 : 190529 - 190538
  • [37] Faster R-CNN Based Deep Learning for Seagrass Detection from Underwater Digital Images
    Moniruzzaman, Md
    Islam, Syed Mohammed Shamsul
    Lavery, Paul
    Bennamoun, Mohammed
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 41 - 47
  • [38] Smart traffic management of vehicles using faster R-CNN based deep learning method
    Chaudhuri, Arindam
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] Faster dynamic graph CNN: Faster deep learning on 3d point cloud data
    Hong, Jinseok
    Kim, Keeyoung
    Lee, Hongchul
    IEEE Access, 2020, 8 : 190529 - 190538
  • [40] On Model Transmission Strategies in Federated Learning With Lossy Communications
    Su, Xiaoxin
    Zhou, Yipeng
    Cui, Laizhong
    Liu, Jiangchuan
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (04) : 1173 - 1185