A cross-modal deep metric learning model for disease diagnosis based on chest x-ray images

被引:7
|
作者
Jin, Yufei [1 ,2 ]
Lu, Huijuan [1 ,2 ]
Li, Zhao [3 ]
Wang, Yanbin [4 ]
机构
[1] China JiLiang Univ, Hangzhou 310018, Zhejiang, Peoples R China
[2] Key Lab Electromagnet Wave Informat Technol & Metr, Hangzhou 310018, Zhejiang, Peoples R China
[3] Zhejiang Univ, Hangzhou 310018, Zhejiang, Peoples R China
[4] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Chest X-ray; Generalized zero-shot learning; Deep metric learning; Cross-modal; Multi-label classification; CLASSIFICATION;
D O I
10.1007/s11042-023-14790-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of unknown diseases is often with few or no samples available. Zero-shot learning and few-shot learning have promising applications in medical image analysis. In this paper, we propose a Cross-Modal Deep Metric Learning Generalized Zero-Shot Learning (CM-DML-GZSL) model. The proposed network consists of a visual feature extractor, a fixed semantic feature extractor, and a deep regression module. The network belongs to a two-stream network for multiple modalities. In a multi-label setting, each sample contains a small number of positive labels and a large number of negative labels on average. This positive-negative imbalance dominates the optimization procedure and may prevent the establishment of an effective correspondence between visual features and semantic vectors during training, resulting in a low degree of accuracy. A novel weighted focused Euclidean distance metric loss is introduced in this regard. This loss not only can dynamically increase the weight of hard samples and decrease the weight of simple samples, but it can also promote the connection between samples and semantic vectors corresponding to their positive labels, which helps mitigate bias in predicting unseen classes in the generalized zero-shot learning setting. The weighted focused Euclidean distance metric loss function can dynamically adjust sample weights, enabling zero-shot multi-label learning for chest X-ray diagnosis, as experimental results on large publicly available datasets demonstrate.
引用
收藏
页码:33421 / 33442
页数:22
相关论文
共 50 条
  • [21] Multi-modal fusion of deep transfer learning based COVID-19 diagnosis and classification using chest x-ray images
    A. Siva Krishna Reddy
    K. N. Brahmaji Rao
    Narasimha Reddy Soora
    Kotte Shailaja
    N. C. Santosh Kumar
    Abel Sridharan
    J. Uthayakumar
    Multimedia Tools and Applications, 2023, 82 : 12653 - 12677
  • [22] X-TRA: Improving Chest X-ray Tasks with Cross-Modal Retrieval Augmentation
    van Sonsbeek, Tom
    Worring, Marcel
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2023, 2023, 13939 : 471 - 482
  • [23] Category supervised cross-modal hashing retrieval for chest X-ray and radiology reports
    Zhang, Yong
    Ou, Weihua
    Zhang, Jiacheng
    Deng, Jiaxin
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 98
  • [24] Identifying disease-free chest X-ray images with deep transfer learning
    Wong, Ken C. L.
    Moradi, Mehdi
    Wu, Joy
    Syeda-Mahmood, Tanveer
    MEDICAL IMAGING 2019: COMPUTER-AIDED DIAGNOSIS, 2019, 10950
  • [25] Heart Segmentation on PA Chest X-ray Images by Model-Based Deep Learning Approach
    Tumay, Adam
    Hadhazi, Daniel
    Hullam, Gabor
    2024 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS, MEMEA 2024, 2024,
  • [26] Study-level cross-modal retrieval of chest x-ray images and reports with adapter-based fine-tuning
    Chen, Yingjie
    Ou, Weihua
    Gao, Zhifan
    Lai, Lingge
    Wu, Yang
    Chen, Qianqian
    PHYSICS IN MEDICINE AND BIOLOGY, 2025, 70 (04):
  • [27] Deep Learning Based Mathematical Model for Feature Extraction to Detect Corona Virus Disease using Chest X-ray Images
    Gupta, Rajeev Kumar
    Sahu, Yatendra
    Kunhare, Nilesh
    Gupta, Abhishek
    Prakash, Deo
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2021, 29 (06) : 921 - 947
  • [28] COVID-19 Diagnosis Through Deep Learning Techniques and Chest X-Ray Images
    Negreiros R.R.B.
    Silva I.H.S.
    Alves A.L.F.
    Valadares D.C.G.
    Perkusich A.
    Baptista C.S.
    SN Computer Science, 4 (5)
  • [29] Cross-modal metric learning and local attention for referring relationships in images
    Zhu, Jian
    Wang, Hanli
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 800 - 807
  • [30] COVID Pneumonia Prediction Based on Chest X-Ray Images Using Deep Learning
    Khare, Akshat
    Patel, Pranjal
    Sankaranarayanan, Suresh
    Lorenz, Pascal
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 2580 - 2585