Towards Unsupervised Sudden Data Drift Detection in Federated Learning with Fuzzy Clustering

被引:0
|
作者
Stallmann, Morris [1 ]
Wilbik, Anna [1 ]
Weiss, Gerhard [1 ]
机构
[1] Maastricht Univ, Dept Adv Comp Sci, Maastricht, Netherlands
关键词
federated learning; fuzzy clustering; unsupervised; drift; drift detection; federated drift detection; federated data drift detection; FCM;
D O I
10.1109/FUZZ-IEEE60900.2024.10611883
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) is a machine learning (ML) discipline that allows to train ML models on distributed data without revealing raw data instances. It promises to enable ML in environments with data sharing constraints, e.g., due to data privacy concerns, or other considerations. Data and concept drift are commonly referred to as unpredictable changes in data distributions over time. It is known to impact a ML model's performances in many real-world scenarios. While drift detection and adaptation has been studied extensively in the non-federated setting, it is still less explored in the FL setting. The private and distributed nature of data in FL makes drift detection much harder in FL since no entity can oversee all data instances to estimate changes in the global data distribution. In this paper, we propose a novel unsupervised federated data drift detection method that is based on federated fuzzy c-means clustering and the federated fuzzy Davies-Bouldin index, a global cluster validation metric. First, using the federated fuzzy c-means clustering algorithm, an initial global data model is learned. Second, the federated fuzzy Davies-Bouldin index . is calculated estimating how well the data fits the learned model. Third, whenever a new batch of data is available at time t, the fit of initial data model and new data is evaluated through the federated fuzzy Davies-Bouldin index Delta(t). Finally Delta and Delta(t) are compared to detect drift. The method is unsupervised as it does not require any labels and detects global data drift while keeping all data private. We evaluate our method carefully in a controlled environment by simulating multiple federated drift scenarios. We observe promising results as it rarely signals false positive alarms and detects drift in multiple scenarios. We also observe short-comings such as sensitivity to parameter choices and low detection rate in case only few data points in a new batch of data are affected by drift.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Parameter Distribution Ensemble Learning for Sudden Concept Drift Detection
    Khanh-Tung Nguyen
    Trung Tran
    Anh-Duc Nguyen
    Xuan-Hieu Phan
    Quang-Thuy Ha
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 192 - 203
  • [42] ACCELERATED UNSUPERVISED CLUSTERING IN ACOUSTIC SENSOR NETWORKS USING FEDERATED LEARNING AND A VARIATIONAL AUTOENCODER
    Becker, Luca
    Nelus, Alexandra
    Glitza, Rene
    Martin, Rainer
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [43] Towards Boosting Federated Learning Convergence: A Computation Offloading & Clustering Approach
    AbdulRahman, Sawsan
    Bouachir, Ouns
    Otoum, Safa
    Mourad, Azzam
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 106 - 111
  • [44] Unsupervised Data Splitting Scheme for Federated Edge Learning in IoT Networks
    Nour, Boubakr
    Cherkaoui, Soumaya
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022,
  • [45] Embedding Alignment for Unsupervised Federated Learning via Smart Data Exchange
    Wagle, Satyavrat
    Hosseinalipour, Seyyedali
    Khosravan, Naji
    Chiang, Mung
    Brinton, Christopher G.
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 492 - 497
  • [46] Cluster detection and characterisation based on unsupervised fuzzy learning
    Bouroumi, A.
    Benrabh, M.
    Radouane, A.
    Hamdoun, A.
    Advances in Modelling and Analysis A, 2006, 43 (1-2): : 39 - 55
  • [47] Towards Accelerating the Adoption of Federated Learning for Heterogeneous Data
    Ntokos, Christos
    Bakalos, Nikolaos
    Kalogeras, Dimitrios
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2023, 2023, : 617 - 624
  • [48] Towards practical data alignment in production federated learning
    Shi, Yexuan
    Yu, Wei
    Zhang, Yuanyuan
    Xue, Chunbo
    Zeng, Yuxiang
    Zhou, Zimu
    Guo, Manxue
    Xin, Lun
    Nie, Wenjing
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (01)
  • [49] Towards Taming the Resource and Data Heterogeneity in Federated Learning
    Chai, Zheng
    Fayyaz, Hannan
    Fayyaz, Zeshan
    Anwar, Ali
    Zhou, Yi
    Baracaldo, Nathalie
    Ludwig, Heiko
    Cheng, Yue
    PROCEEDINGS OF THE 2019 USENIX CONFERENCE ON OPERATIONAL MACHINE LEARNING, 2019, : 19 - 21
  • [50] Unsupervised Change Detection of Remotely Sensed Images using Fuzzy Clustering
    Ghosh, Susmita
    Mishra, Niladri Shekhar
    Ghosh, Ashish
    ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 385 - 388