Image Distillation for Safe Data Sharing in Histopathology

被引:0
|
作者
Li, Zhe [1 ]
Kainz, Bernhard [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Image Data Explorat & Anal Lab, Erlangen, Germany
基金
欧洲研究理事会;
关键词
Dataset Distillation; Image Generation; Privacy;
D O I
10.1007/978-3-031-72117-5_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Histopathology can help clinicians make accurate diagnoses, determine disease prognosis, and plan appropriate treatment strategies. As deep learning techniques prove successful in the medical domain, the primary challenges become limited data availability and concerns about data sharing and privacy. Federated learning has addressed this challenge by training models locally and updating parameters on a server. However, issues, such as domain shift and bias, persist and impact overall performance. Dataset distillation presents an alternative approach to overcoming these challenges. It involves creating a small synthetic dataset that encapsulates essential information, which can be shared without constraints. At present, this paradigm is not practicable as current distillation approaches only generate non human readable representations and exhibit insufficient performance for downstream learning tasks. We train a latent diffusion model and construct a new distilled synthetic dataset with a small number of human readable synthetic images. Selection of maximally informative synthetic images is done via graph community analysis of the representation space. We compare downstream classification models trained on our synthetic distillation data to models trained on real data and reach performances suitable for practical application. Codes are available at https://github.com/ZheLi2020/InfoDist.
引用
收藏
页码:459 / 469
页数:11
相关论文
共 50 条
  • [21] Review on Safe Reversible Image Data Hiding
    Naqash, Talha
    Iqbal, Assad
    Shah, Sajjad Hussain
    2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 929 - 932
  • [22] ImPACT: A networked service architecture for safe sharing of restricted data
    Baldin, Ilya
    Chase, Jeff
    Crabtree, Jonathan
    Nechyba, Thomas
    Christopherson, Laura
    Stealey, Michael
    Kneifel, Charley
    Orlikowski, Victor
    Carter, Rob
    Scott, Erik
    Sone, Akio
    Sizemore, Don
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 129 : 269 - 285
  • [23] A RS image sharing data model based on Data Grid
    Wang, Lianbei
    2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES: ITESS 2008, VOL 1, 2008, : 312 - 316
  • [24] Generative Image Translation for Data Augmentation in Colorectal Histopathology Images
    Wei, Jerry
    Suriawinata, Arief
    Vaickus, Louis
    Ren, Bing
    Liu, Xiaoying
    Wei, Jason
    Hassanpour, Saeed
    MACHINE LEARNING FOR HEALTH WORKSHOP, VOL 116, 2019, 116 : 10 - +
  • [25] Image-to-Lidar Relational Distillation for Autonomous Driving Data
    Mahmoud, Anas
    Harakeh, Ali
    Waslander, Steven
    COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 459 - 475
  • [26] Solar Distillation Safe water
    不详
    CURRENT SCIENCE, 2017, 113 (04): : 540 - 540
  • [27] Community Image Data portal: sharing licensed Earth observation data
    Eyraud, F.
    Burger, A.
    Strand, P.
    Di Matteo, G.
    INTERNATIONAL JOURNAL OF SPATIAL DATA INFRASTRUCTURES RESEARCH, 2011, 6 : 187 - 205
  • [28] SAFE SHARING SITES
    Austin, Lisa M.
    Lie, David
    NEW YORK UNIVERSITY LAW REVIEW, 2019, 94 (04) : 581 - 623
  • [29] A Sharing Analysis for SAFE
    Pena, Ricardo
    Segura, Clara
    Montenegro, Manuel
    TRENDS IN FUNCTIONAL PROGRAMMING, VOL 7, 2007, 7 : 109 - 128
  • [30] An Online System for Sharing Image Data for Cardiac Modeling
    Nasaruddin, Fariza Hanum
    Zakeryfar, Maryam
    PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE (ACS'08): RECENT ADVANCES ON APPLIED COMPUTER SCIENCE, 2008, : 139 - +