Feature-Based Dataset Fingerprinting for Clustered Federated Learning on Medical Image Data

被引:0
|
作者
Scheliga, Daniel [1 ]
Maeder, Patrick [1 ,2 ]
Seeland, Marco [1 ]
机构
[1] Tech Univ Ilmenau, Dept Comp Sci & Automat, Data Intens Syst & Visualizat Grp dAI SY, Max Planck Ring 14, D-98693 Ilmenau, Germany
[2] Friedrich Schiller Univ, Fac Biol Sci, Jena, Germany
关键词
Information leakage - Medical imaging;
D O I
10.1080/08839514.2024.2394756
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated Learning (FL) allows multiple clients to train a common model without sharing their private training data. In practice, federated optimization struggles with sub-optimal model utility because data is not independent and identically distributed (non-IID). Recent work has proposed to cluster clients according to dataset fingerprints to improve model utility in such situations. These fingerprints aim to capture the key characteristics of clients' local data distributions. Recently, a mechanism was proposed to calculate dataset fingerprints from raw client data. We find that this fingerprinting mechanism comes with substantial time and memory consumption, limiting its practical use to small datasets. Additionally, shared raw data fingerprints can directly leak sensitive visual information, in certain cases even resembling the original client training data. To alleviate these problems, we propose a Feature-based dataset FingerPrinting mechanism (FFP). We use the MedMNIST database to develop a highly realistic case study for FL on medical image data. Compared to existing methods, our proposed FFP reduces the computational overhead of fingerprint calculation while achieving similar model utility. Furthermore, FFP mitigates the risk of raw data leakage from fingerprints by design.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Feature-Based Fusion of Medical Imaging Data
    Calhoun, Vince D.
    Adali, Tuelay
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2009, 13 (05): : 711 - 720
  • [2] Federated Contrastive Learning With Feature-Based Distillation for Human Activity Recognition
    Xiao, Zhiwen
    Tong, Huagang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025,
  • [3] Feature-Based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy
    Wang, Feng
    Gursoy, M. Cenk
    Velipasalar, Senem
    IEEE Transactions on Machine Learning in Communications and Networking, 2024, 2 : 823 - 840
  • [4] Clustered Federated Learning Framework with Acceleration Based on Data Similarity
    Gao, ZhiPeng
    Xiong, ZiJian
    Zhao, Chen
    Feng, FuTeng
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT VII, 2024, 14493 : 80 - 92
  • [5] Feature-based interactive visualization of volumetric medical data
    Sun, HQ
    Cheung, HF
    Lam, CF
    Heng, PA
    Baciu, G
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 1162 - 1167
  • [6] Medical image segmentation using feature-based GVF snake
    Ng, H. P.
    Foong, K. W. C.
    Ong, S. H.
    Goh, P. S.
    Nowinski, W. L.
    2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 800 - +
  • [7] Clustered Federated Learning Based on Momentum Gradient Descent for Heterogeneous Data
    Zhao, Xiaoyi
    Xie, Ping
    Xing, Ling
    Zhang, Gaoyuan
    Ma, Huahong
    ELECTRONICS, 2023, 12 (09)
  • [8] SoFL: Clustered Federated Learning Based on Dual Clustering for Heterogeneous Data
    Zhang, Jianfei
    Qiao, Zhiming
    ELECTRONICS, 2024, 13 (18)
  • [9] Feature-based image analysis
    Lillholm, M
    Nielsen, M
    Griffin, LD
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2003, 52 (2-3) : 73 - 95
  • [10] Feature-Based Image Analysis
    Martin Lillholm
    Mads Nielsen
    Lewis D. Griffin
    International Journal of Computer Vision, 2003, 52 : 73 - 95