Privacy-preserving clustering federated learning for non-IID data

被引:3
|
作者
Luo, Guixun [1 ]
Chen, Naiyue [1 ,2 ]
He, Jiahuan [1 ]
Jin, Bingwei [1 ]
Zhang, Zhiyuan [3 ]
Li, Yidong [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Minist Educ, Key Lab Big Data & Artificial Intelligence Transpo, Beijing, Peoples R China
[3] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Key Lab Commun & Informat Syst, Beijing Municipal Commiss Educ, Beijing 100044, Peoples R China
基金
北京市自然科学基金;
关键词
Federated learning; Privacy-preserving; Similarity measurement; Cluster; Differential privacy; RANDOMIZED-RESPONSE; SCHEME;
D O I
10.1016/j.future.2024.01.005
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the increasing number of intelligent devices joining into the Internet of Things (IoT), traditional centralized learning struggles to meet the performance requirements of terminal time-critical systems under heterogeneous data distribution. This challenge arises from the non-independent and non-identically distributed nature of data on terminal devices in real-world scenarios, which impacts the overall model convergence speed and terminal performance. As federated learning provides a privacy-preserving collaborative training framework, this paper focuses on the studying of the time response and performance issues in the context of data heterogeneity. In this paper, we propose a lightweight Randomized Response (RR) differential privacy method to protect the distribution characteristics of clients' data while quantifying their similarity. The paper introduces a community detection algorithm with linear time complexity to divide clients into clusters, which addresses inherent non-IID challenges in federated learning and meeting the rapid response requirements of time-critical systems. We conduct experiments on scenarios with different data distribution scenarios. The results show that the privacy-preserving mechanism has a negligible impact on model accuracy, and our algorithm demonstrates significant performance improvements in personalization compared to baseline methods. Additionally, in the presence of partially disconnected clients during training, compared to solo training, the pp-CFL algorithm enhances the timeliness and accuracy of the personalized local model.
引用
收藏
页码:384 / 395
页数:12
相关论文
共 50 条
  • [1] GFL: Federated Learning on Non-IID data via Privacy-preserving Synthetic data
    Cheng, Yihang
    Zhang, Lan
    Li, Anran
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS, PERCOM, 2023, : 61 - 70
  • [2] Privacy-Preserving Federated Graph Neural Network Learning on Non-IID Graph Data
    Zhang, Kainan
    Cai, Zhipeng
    Seo, Daehee
    [J]. Wireless Communications and Mobile Computing, 2023, 2023
  • [3] Privacy-Preserving Asynchronous Federated Learning Under Non-IID Settings
    Miao, Yinbin
    Kuang, Da
    Li, Xinghua
    Xu, Shujiang
    Li, Hongwei
    Choo, Kim-Kwang Raymond
    Deng, Robert H.
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 5828 - 5841
  • [4] Privacy-preserving Blockchain-based Global Data Sharing for Federated Learning with Non-IID Data
    Lian, Zhuotao
    Zeng, Qingkui
    Su, Chunhua
    [J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2022, : 193 - 198
  • [5] Privacy-Preserving Federated Learning Against Label-Flipping Attacks on Non-IID Data
    Shen, Xicong
    Liu, Ying
    Li, Fu
    Li, Chunguang
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (01): : 1241 - 1255
  • [6] Blockchain-Enabled Federated Learning for Privacy-Preserving Non-IID Data Sharing in Industrial Internet
    Wang, Qiuyan
    Dong, Haibing
    Huang, Yongfei
    Liu, Zenglei
    Gou, Yundong
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 1967 - 1983
  • [7] Dynamic Clustering Federated Learning for Non-IID Data
    Chen, Ming
    Wu, Jinze
    Yin, Yu
    Huang, Zhenya
    Liu, Qi
    Chen, Enhong
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 119 - 131
  • [8] Privacy-Enhanced Federated Learning for Non-IID Data
    Tan, Qingjie
    Wu, Shuhui
    Tao, Yuanhong
    [J]. MATHEMATICS, 2023, 11 (19)
  • [9] FedCML: Federated Clustering Mutual Learning with non-IID Data
    Chen, Zekai
    Wang, Fuyi
    Yu, Shengxing
    Liu, Ximeng
    Zheng, Zhiwei
    [J]. EURO-PAR 2023: PARALLEL PROCESSING, 2023, 14100 : 623 - 636
  • [10] Hierarchical Federated Learning with Adaptive Clustering on Non-IID Data
    Tian, Yuqing
    Zhang, Zhaoyang
    Yang, Zhaohui
    Jin, Richeng
    [J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 627 - 632