Image Distillation for Safe Data Sharing in Histopathology

被引:0
|
作者
Li, Zhe [1 ]
Kainz, Bernhard [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Image Data Explorat & Anal Lab, Erlangen, Germany
基金
欧洲研究理事会;
关键词
Dataset Distillation; Image Generation; Privacy;
D O I
10.1007/978-3-031-72117-5_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Histopathology can help clinicians make accurate diagnoses, determine disease prognosis, and plan appropriate treatment strategies. As deep learning techniques prove successful in the medical domain, the primary challenges become limited data availability and concerns about data sharing and privacy. Federated learning has addressed this challenge by training models locally and updating parameters on a server. However, issues, such as domain shift and bias, persist and impact overall performance. Dataset distillation presents an alternative approach to overcoming these challenges. It involves creating a small synthetic dataset that encapsulates essential information, which can be shared without constraints. At present, this paradigm is not practicable as current distillation approaches only generate non human readable representations and exhibit insufficient performance for downstream learning tasks. We train a latent diffusion model and construct a new distilled synthetic dataset with a small number of human readable synthetic images. Selection of maximally informative synthetic images is done via graph community analysis of the representation space. We compare downstream classification models trained on our synthetic distillation data to models trained on real data and reach performances suitable for practical application. Codes are available at https://github.com/ZheLi2020/InfoDist.
引用
收藏
页码:459 / 469
页数:11
相关论文
共 50 条
  • [1] Quintet Margin Loss for an Improved Knowledge Distillation in Histopathology Image Analysis
    Vuong, Trinh T. L.
    Kwak, Jin Tae
    MEDICAL IMAGING 2023, 2023, 12471
  • [2] Compressed gastric image generation based on soft-label dataset distillation for medical data sharing
    Li, Guang
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 227
  • [3] Compressed gastric image generation based on soft-label dataset distillation for medical data sharing
    Li, Guang
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    Computer Methods and Programs in Biomedicine, 2022, 227
  • [4] Safe Secret Image Sharing with Fault Tolerance Key
    Fang, Wen-Pinn
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (09): : 20 - 23
  • [5] Encouraging Data Sharing for Safe Autonomous Driving
    Kim, Keonhyeong
    Jung, Im Y.
    2020 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2020,
  • [6] Scripting Multiple CPUs with Safe Data Sharing
    Skyrme, Alexandre
    Rodriguez, Noemi
    Ierusalimschy, Roberto
    IEEE SOFTWARE, 2014, 31 (05) : 44 - 51
  • [7] Sharing and reusing cell image data
    Zaritsky, Assaf
    MOLECULAR BIOLOGY OF THE CELL, 2018, 29 (11) : 1274 - 1280
  • [8] Realistic Data Enrichment for Robust Image Segmentation in Histopathology
    Cechnicka, Sarah
    Ball, James
    Reynaud, Hadrien
    Arthurs, Callum
    Roufosse, Candice
    Kainz, Bernhard
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, DART 2023, 2024, 14293 : 63 - 72
  • [9] Overcoming the challenges of multi-modal medical image sharing: A novel data distillation strategy via contrastive learning
    Du, Taoli
    Li, Wenhui
    Wang, Zeyu
    Yang, Feiyang
    Teng, Peihong
    Yi, Xingcheng
    Chen, Hongyu
    Wang, Zixuan
    Zhang, Ping
    Zhang, Tianyang
    NEUROCOMPUTING, 2025, 617
  • [10] Data Augmentation Based on DiscrimDiff for Histopathology Image Classification
    Guan, Xianchao
    Wang, Yifeng
    Lin, Yiyang
    Zhang, Yongbing
    DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS, DALI 2023, 2024, 14379 : 53 - 62