Compressed gastric image generation based on soft-label dataset distillation for medical data sharing

被引:17
|
作者
Li, Guang [1 ]
Togo, Ren [2 ]
Ogawa, Takahiro [2 ]
Haseyama, Miki [2 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, N-14, W-9, Kita Ku, Sapporo 0600814, Japan
[2] Hokkaido Univ, Fac Informat Sci & Technol, N-14, W-9, Kita Ku, Sapporo 0600814, Japan
关键词
Medical image distillation; Medical data sharing; Model compression; Anonymization; HELICOBACTER-PYLORI INFECTION; HEALTH-CARE; PRIVACY; CLOUD; MACHINES; RECORDS; MODEL; RISK;
D O I
10.1016/j.cmpb.2022.107189
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and objective: Sharing of medical data is required to enable the cross-agency flow of health-care information and construct high-accuracy computer-aided diagnosis systems. However, the large sizes of medical datasets, the massive amount of memory of saved deep convolutional neural network (DCNN) models, and patients ' privacy protection are problems that can lead to inefficient medical data sharing. Therefore, this study proposes a novel soft-label dataset distillation method for medical data sharing. Methods: The proposed method distills valid information of medical image data and generates several compressed images with different data distributions for anonymous medical data sharing. Furthermore, our method can extract essential weights of DCNN models to reduce the memory required to save trained models for efficient medical data sharing. Results: The proposed method can compress tens of thousands of images into several soft-label images and reduce the size of a trained model to a few hundredths of its original size. The compressed images obtained after distillation have been visually anonymized; therefore, they do not contain the private in-formation of the patients. Furthermore, we can realize high-detection performance with a small number of compressed images. Conclusions: The experimental results show that the proposed method can improve the efficiency and security of medical data sharing. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 18 条
  • [1] Compressed gastric image generation based on soft-label dataset distillation for medical data sharing
    Li, Guang
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    Computer Methods and Programs in Biomedicine, 2022, 227
  • [2] Soft-Label Dataset Distillation and Text Dataset Distillation
    Sucholutsky, Ilia
    Schonlau, Matthias
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] SOFT-LABEL ANONYMOUS GASTRIC X-RAY IMAGE DISTILLATION
    Li, Guang
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 305 - 309
  • [4] Overcoming barriers to data sharing with medical image generation: a comprehensive evaluation
    August DuMont Schütte
    Jürgen Hetzel
    Sergios Gatidis
    Tobias Hepp
    Benedikt Dietz
    Stefan Bauer
    Patrick Schwab
    npj Digital Medicine, 4
  • [5] Overcoming barriers to data sharing with medical image generation: a comprehensive evaluation
    Schuette, August DuMont
    Hetzel, Juergen
    Gatidis, Sergios
    Hepp, Tobias
    Dietz, Benedikt
    Bauer, Stefan
    Schwab, Patrick
    NPJ DIGITAL MEDICINE, 2021, 4 (01)
  • [6] Label Generation System Based on Generative Adversarial Network for Medical Image
    Li, Jiyun
    Hong, Yongliang
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION (AIPR 2019), 2019, : 78 - 82
  • [7] Cyclic Federated Learning Method Based on Distribution Information Sharing and Knowledge Distillation for Medical Data
    Yu, Liang
    Huang, Jianjun
    ELECTRONICS, 2022, 11 (23)
  • [8] A data sharing method for remote medical system based on federated distillation learning and consortium blockchain
    Li, Ning
    Zhang, Ruijie
    Zhu, Chengyu
    Ou, Wei
    Han, Wenbao
    Zhang, Qionglu
    CONNECTION SCIENCE, 2023, 35 (01)
  • [9] Feature-Based Dataset Fingerprinting for Clustered Federated Learning on Medical Image Data
    Scheliga, Daniel
    Maeder, Patrick
    Seeland, Marco
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [10] Overcoming the challenges of multi-modal medical image sharing: A novel data distillation strategy via contrastive learning
    Du, Taoli
    Li, Wenhui
    Wang, Zeyu
    Yang, Feiyang
    Teng, Peihong
    Yi, Xingcheng
    Chen, Hongyu
    Wang, Zixuan
    Zhang, Ping
    Zhang, Tianyang
    NEUROCOMPUTING, 2025, 617