Medical report generation based on multimodal federated learning

被引:0
|
作者
Chen, Jieying [1 ]
Pan, Rong [1 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image report generation; Multimodal data; Federated Learning; Privacy protection; Deep learning;
D O I
10.1016/j.compmedimag.2024.102342
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Medical image reports are integral to clinical decision-making and patient management. Despite their importance, the confidentiality and private nature of medical data pose significant issues for the sharing and analysis of medical image data. This paper addresses these concerns by introducing a multimodal federated learning-based methodology for medical image reporting. This methodology harnesses distributed computing for co-training models across various medical institutions. Under the federated learning framework, every medical institution is capable of training the model locally and aggregating the updated model parameters to curate a top-tier medical image report model. Initially, we advocate for an architecture facilitating multimodal federated learning, including model creation, parameter consolidation, and algorithm enhancement steps. In the model selection phase, we introduce a deep learning-based strategy that utilizes multimodal data for training to produce medical image reports. In the parameter aggregation phase, the federal average algorithm is applied to amalgamate model parameters trained by each institution, which leads to a comprehensive global model. In addition, we introduce an evidence-based optimization algorithm built upon the federal average algorithm. The efficacy of the proposed architecture and scheme is showcased through a series of experiments. Our experimental results validate the proficiency of the proposed multimodal federated learning approach in generating medical image reports. Compared to conventional centralized learning methods, our proposal not only enhances the protection of patient confidentiality but also enriches the accuracy and overall quality of medical image reports. Through this research, we offer a novel solution for the privacy issues linked with the sharing and analyzing of medical data. Expected to assume a crucial role in medical image report generation and other medical applications, the multimodal federated learning method is set to deliver more precise, efficient, and privacy-secured medical services for healthcare professionals and patients.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Medical report generation based on multimodal federated learning
    Chen, Jieying
    Pan, Rong
    [J]. Computerized Medical Imaging and Graphics, 2024, 113
  • [2] Competence-based Multimodal Curriculum Learning for Medical Report Generation
    Liu, Fenglin
    Ge, Shen
    Wu, Xian
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3001 - 3012
  • [3] Privacy-Preserving Medical Dialogue Generation Based on Federated Learning
    Xu, Bo
    Zhou, Yingjie
    Zong, Linlin
    Lin, Hongfei
    Mei, Fang
    [J]. HEALTH INFORMATION PROCESSING, CHIP 2023, 2023, 1993 : 227 - 237
  • [4] A Multimodal Prediction Method with Federated Machine Learning on Medical Data
    Zhang, Jianyi
    Guo, Xingyu
    Zhang, Fangjiao
    Liu, Jin
    Wang, Zhiqiang
    Liu, Biao
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 79 - 79
  • [5] A Multimodal Biometric Recognition Method Based on Federated Learning
    Chen, Guang
    Luo, Dacan
    Lian, Fengzhao
    Tian, Feng
    Yang, Xu
    Kang, Wenxiong
    [J]. IET Biometrics, 2024, 2024 (01)
  • [6] Multimodal Federated Learning: A Survey
    Che, Liwei
    Wang, Jiaqi
    Zhou, Yao
    Ma, Fenglong
    [J]. SENSORS, 2023, 23 (15)
  • [7] Multimodal contrastive learning for radiology report generation
    Wu X.
    Li J.
    Wang J.
    Qian Q.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (08) : 11185 - 11194
  • [8] FedMultimodal: A Benchmark For Multimodal Federated Learning
    Feng, Tiantian
    Bose, Digbalay
    Zhang, Tuo
    Hebbar, Rajat
    Ramakrishna, Anil
    Gupta, Rahul
    Zhang, Mi
    Avestimehr, Salman
    Narayanan, Shrikanth
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 4035 - 4045
  • [9] Multimodal Federated Learning on IoT Data
    Zhao, Yuchen
    Barnaghi, Payam
    Haddadi, Hamed
    [J]. 7TH ACM/IEEE CONFERENCE ON INTERNET-OF-THINGS DESIGN AND IMPLEMENTATION (IOTDI 2022), 2022, : 43 - 54
  • [10] Gas Detection and Classification Using Multimodal Data Based on Federated Learning
    Sharma, Ashutosh
    Khullar, Vikas
    Kansal, Isha
    Chhabra, Gunjan
    Arora, Priya
    Popli, Renu
    Kumar, Rajeev
    [J]. Sensors, 2024, 24 (18)