Feature-Based Dataset Fingerprinting for Clustered Federated Learning on Medical Image Data

被引:0
|
作者
Scheliga, Daniel [1 ]
Maeder, Patrick [1 ,2 ]
Seeland, Marco [1 ]
机构
[1] Tech Univ Ilmenau, Dept Comp Sci & Automat, Data Intens Syst & Visualizat Grp dAI SY, Max Planck Ring 14, D-98693 Ilmenau, Germany
[2] Friedrich Schiller Univ, Fac Biol Sci, Jena, Germany
关键词
Information leakage - Medical imaging;
D O I
10.1080/08839514.2024.2394756
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated Learning (FL) allows multiple clients to train a common model without sharing their private training data. In practice, federated optimization struggles with sub-optimal model utility because data is not independent and identically distributed (non-IID). Recent work has proposed to cluster clients according to dataset fingerprints to improve model utility in such situations. These fingerprints aim to capture the key characteristics of clients' local data distributions. Recently, a mechanism was proposed to calculate dataset fingerprints from raw client data. We find that this fingerprinting mechanism comes with substantial time and memory consumption, limiting its practical use to small datasets. Additionally, shared raw data fingerprints can directly leak sensitive visual information, in certain cases even resembling the original client training data. To alleviate these problems, we propose a Feature-based dataset FingerPrinting mechanism (FFP). We use the MedMNIST database to develop a highly realistic case study for FL on medical image data. Compared to existing methods, our proposed FFP reduces the computational overhead of fingerprint calculation while achieving similar model utility. Furthermore, FFP mitigates the risk of raw data leakage from fingerprints by design.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case
    Singh, Gurtaj
    Violi, Vincenzo
    Fisichella, Marco
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (01)
  • [22] A distribution information sharing federated learning approach for medical image data
    Zhao, Leiyang
    Huang, Jianjun
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 5625 - 5636
  • [23] A distribution information sharing federated learning approach for medical image data
    Leiyang Zhao
    Jianjun Huang
    Complex & Intelligent Systems, 2023, 9 : 5625 - 5636
  • [24] Medical Image Segmentation Based on Federated Distillation Optimization Learning on Non-IID Data
    Liu, Fangbo
    Yang, Feng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III, 2023, 14088 : 347 - 358
  • [25] Feature-based clustered geometry for interpolated Ray-casting
    Garcia, Francisco Gonzalez
    Martin, Ignacio
    Patow, Gustavo
    COMPUTERS & GRAPHICS-UK, 2022, 102 : 175 - 186
  • [26] FEATURE-BASED IMAGE BANDWIDTH COMPRESSION
    SAGHRI, JA
    TESCHER, AG
    OPTICAL ENGINEERING, 1988, 27 (10) : 854 - 860
  • [27] FEATURE-BASED IMAGE SET COMPRESSION
    Shi, Zhongbo
    Sun, Xiaoyan
    Wu, Feng
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [28] Feature-Based Panoramic Image Stitching
    Alomran, Murtadha
    Chai, Douglas
    2016 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2016,
  • [29] Image browsing for feature-based products
    Yang, CC
    Kwok, SH
    Yip, M
    ELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY III, 2002, 4925 : 350 - 357
  • [30] Multiresolution feature-based image registration
    Hsu, CT
    Beuker, RA
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2000, PTS 1-3, 2000, 4067 : 1490 - 1498