Federated Data Quality Assessment Approach: Robust Learning With Mixed Label Noise

被引:1
|
作者
Zeng, Bixiao [1 ]
Yang, Xiaodong [1 ,2 ]
Chen, Yiqiang [1 ,3 ,4 ]
Yu, Hanchao [5 ]
Hu, Chunyu [6 ]
Zhang, Yingwei [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100045, Peoples R China
[2] Shandong Acad Intelligent Comp Technol, Inst Comp Technol, Jinan, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[4] Beijing Key Lab Mobile Comp & Pervas Device, Beijing 100190, Peoples R China
[5] Chinese Acad Sci, Bur Frontier Sci & Educ, Beijing 100045, Peoples R China
[6] Qilu Univ Technol, Shandong Acad Sci, Jinan 250353, Peoples R China
基金
中国国家自然科学基金;
关键词
Noise measurement; Servers; Task analysis; Adaptation models; Data models; Data integrity; Computers; Data quality assessment; federated learning (FL); noise-robust algorithm;
D O I
10.1109/TNNLS.2023.3306874
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) has been an effective way to train a machine learning model distributedly, holding local data without exchanging them. However, due to the inaccessibility of local data, FL with label noise would be more challenging. Most existing methods assume only open-set or closed-set noise and correspondingly propose filtering or correction solutions, ignoring that label noise can be mixed in real-world scenarios. In this article, we propose a novel FL method to discriminate the type of noise and make the FL mixed noise-robust, named FedMIN. FedMIN employs a composite framework that captures local-global differences in multiparticipant distributions to model generalized noise patterns. By determining adaptive thresholds for identifying mixed label noise in each client and assigning appropriate weights during model aggregation, FedMIN enhances the performance of the global model. Furthermore, FedMIN incorporates a loss alignment mechanism using local and global Gaussian mixture models (GMMs) to mitigate the risk of revealing samplewise loss. Extensive experiments are conducted on several public datasets, which include the simulated FL testbeds, i.e., CIFAR-10, CIFAR-100, and SVHN, and the real-world ones, i.e., Camelyon17 and multiorgan nuclei challenge (MoNuSAC). Compared to FL benchmarks, FedMIN improves model accuracy by up to 9.9% due to its superior noise estimation capabilities.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Robust Federated Opportunistic Learning in the Presence of Label Quality Disparity
    Li, Chengxi
    Li, Gang
    Varshney, Pramod K.
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (13) : 10978 - 10987
  • [2] Federated Learning with Extreme Label Skew: A Data Extension Approach
    Tijani, Saheed A.
    Ma, Xingjun
    Zhang, Ran
    Jiang, Frank
    Doss, Robin
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] Robust quantum federated learning with noise
    Chen, Liangjun
    Yan, Lili
    Zhang, Shibin
    [J]. PHYSICA SCRIPTA, 2024, 99 (07)
  • [4] FedNoRo: Towards Noise-Robust Federated Learning by Addressing Class Imbalance and Label Noise Heterogeneity
    Wu, Nannan
    Yu, Li
    Jiang, Xuefeng
    Cheng, Kwang-Ting
    Yan, Zengqiang
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4424 - 4432
  • [5] Client Selection for Federated Learning With Label Noise
    Yang, Miao
    Qian, Hua
    Wang, Ximin
    Zhou, Yong
    Zhu, Honghin
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (02) : 2193 - 2197
  • [6] FedFixer: Mitigating Heterogeneous Label Noise in Federated Learning
    Ji, Xinyuan
    Zhu, Zhaowei
    Xi, Wei
    Gadyatskaya, Olga
    Song, Zilong
    Cai, Yong
    Liu, Yang
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12830 - 12838
  • [7] Data Quality Detection Mechanism Against Label Flipping Attacks in Federated Learning
    Jiang, Yifeng
    Zhang, Weiwen
    Chen, Yanxi
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1625 - 1637
  • [8] GANFAT: Robust federated adversarial learning with label distribution skew
    Luo, Yayu
    Zhu, Tongzhijun
    Liu, Zediao
    Mao, Tenglong
    Chen, Ziyi
    Pi, Huan
    Lin, Ying
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 160 : 711 - 723
  • [9] BOBA: Byzantine-Robust Federated Learning with Label Skewness
    Bao, Wenxuan
    Wu, Jun
    He, Jingrui
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [10] Robust TSK Fuzzy System Based on Semisupervised Learning for Label Noise Data
    Zhang, Te
    Deng, Zhaohong
    Ishibuchi, Hisao
    Pang, Lie Meng
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (08) : 2145 - 2157