Feature-Based Dataset Fingerprinting for Clustered Federated Learning on Medical Image Data

被引:0
|
作者
Scheliga, Daniel [1 ]
Maeder, Patrick [1 ,2 ]
Seeland, Marco [1 ]
机构
[1] Tech Univ Ilmenau, Dept Comp Sci & Automat, Data Intens Syst & Visualizat Grp dAI SY, Max Planck Ring 14, D-98693 Ilmenau, Germany
[2] Friedrich Schiller Univ, Fac Biol Sci, Jena, Germany
关键词
Information leakage - Medical imaging;
D O I
10.1080/08839514.2024.2394756
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated Learning (FL) allows multiple clients to train a common model without sharing their private training data. In practice, federated optimization struggles with sub-optimal model utility because data is not independent and identically distributed (non-IID). Recent work has proposed to cluster clients according to dataset fingerprints to improve model utility in such situations. These fingerprints aim to capture the key characteristics of clients' local data distributions. Recently, a mechanism was proposed to calculate dataset fingerprints from raw client data. We find that this fingerprinting mechanism comes with substantial time and memory consumption, limiting its practical use to small datasets. Additionally, shared raw data fingerprints can directly leak sensitive visual information, in certain cases even resembling the original client training data. To alleviate these problems, we propose a Feature-based dataset FingerPrinting mechanism (FFP). We use the MedMNIST database to develop a highly realistic case study for FL on medical image data. Compared to existing methods, our proposed FFP reduces the computational overhead of fingerprint calculation while achieving similar model utility. Furthermore, FFP mitigates the risk of raw data leakage from fingerprints by design.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Feature-Based Image Stitching Algorithms
    Bonny, Moushumi Zaman
    Uddin, Mohammad Shorif
    2016 INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE (IWCI), 2016, : 198 - 203
  • [32] Edge-texture feature-based image forgery detection with cross-dataset evaluation
    Asghar, Khurshid
    Sun, Xianfang
    Rosin, Paul L.
    Saddique, Mubbashar
    Hussain, Muhammad
    Habib, Zulfiqar
    MACHINE VISION AND APPLICATIONS, 2019, 30 (7-8) : 1243 - 1262
  • [33] Adaptive Clustered Federated Learning for Heterogeneous Data in Edge Computing
    Biyao Gong
    Tianzhang Xing
    Zhidan Liu
    Junfeng Wang
    Xiuya Liu
    Mobile Networks and Applications, 2022, 27 : 1520 - 1530
  • [34] A Remedy for Heterogeneous Data: Clustered Federated Learning with Gradient Trajectory
    Liu, Ruiqi
    Yu, Songcan
    Lan, Linsi
    Wang, Junbo
    Kant, Krishna
    Calleja, Neville
    BIG DATA MINING AND ANALYTICS, 2024, 7 (04): : 1050 - 1064
  • [35] Contrastive encoder pre-training-based clustered federated learning for heterogeneous data
    Tun, Ye Lin
    Nguyen, Minh N. H.
    Thwal, Chu Myaet
    Choi, Jinwoo
    Hong, Choong Seon
    NEURAL NETWORKS, 2023, 165 : 689 - 704
  • [36] Clustered Federated Learning with Weighted Model Aggregation for Imbalanced Data
    Dong Wang
    Naifu Zhang
    Meixia Tao
    China Communications, 2022, 19 (08) : 41 - 56
  • [37] Exploring Feature-Based Learning for Data-Driven Haptic Rendering
    Sianov, Anatolii
    Harders, Matthias
    IEEE TRANSACTIONS ON HAPTICS, 2018, 11 (03) : 388 - 399
  • [38] Comparison of feature-based and image registration-based retrieval of image data using multidimensional data access methods
    Arslan, Serdar
    Yazici, Adnan
    Sacan, Ahmet
    Toroslu, Ismail H.
    Acar, Esra
    DATA & KNOWLEDGE ENGINEERING, 2013, 86 : 124 - 145
  • [39] Clustered Federated Learning Based on Client's Prototypes
    Lai, Weimin
    Xu, Zirong
    Yan, Qiao
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 909 - 914
  • [40] Clustered federated learning based on nonconvex pairwise fusion
    Yu, Xue
    Liu, Ziyi
    Wang, Wu
    Sun, Yifan
    INFORMATION SCIENCES, 2024, 678