A cross-modal deep metric learning model for disease diagnosis based on chest x-ray images

被引:7
|
作者
Jin, Yufei [1 ,2 ]
Lu, Huijuan [1 ,2 ]
Li, Zhao [3 ]
Wang, Yanbin [4 ]
机构
[1] China JiLiang Univ, Hangzhou 310018, Zhejiang, Peoples R China
[2] Key Lab Electromagnet Wave Informat Technol & Metr, Hangzhou 310018, Zhejiang, Peoples R China
[3] Zhejiang Univ, Hangzhou 310018, Zhejiang, Peoples R China
[4] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Chest X-ray; Generalized zero-shot learning; Deep metric learning; Cross-modal; Multi-label classification; CLASSIFICATION;
D O I
10.1007/s11042-023-14790-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of unknown diseases is often with few or no samples available. Zero-shot learning and few-shot learning have promising applications in medical image analysis. In this paper, we propose a Cross-Modal Deep Metric Learning Generalized Zero-Shot Learning (CM-DML-GZSL) model. The proposed network consists of a visual feature extractor, a fixed semantic feature extractor, and a deep regression module. The network belongs to a two-stream network for multiple modalities. In a multi-label setting, each sample contains a small number of positive labels and a large number of negative labels on average. This positive-negative imbalance dominates the optimization procedure and may prevent the establishment of an effective correspondence between visual features and semantic vectors during training, resulting in a low degree of accuracy. A novel weighted focused Euclidean distance metric loss is introduced in this regard. This loss not only can dynamically increase the weight of hard samples and decrease the weight of simple samples, but it can also promote the connection between samples and semantic vectors corresponding to their positive labels, which helps mitigate bias in predicting unseen classes in the generalized zero-shot learning setting. The weighted focused Euclidean distance metric loss function can dynamically adjust sample weights, enabling zero-shot multi-label learning for chest X-ray diagnosis, as experimental results on large publicly available datasets demonstrate.
引用
收藏
页码:33421 / 33442
页数:22
相关论文
共 50 条
  • [1] A cross-modal deep metric learning model for disease diagnosis based on chest x-ray images
    Yufei Jin
    Huijuan Lu
    Zhao Li
    Yanbin Wang
    Multimedia Tools and Applications, 2023, 82 : 33421 - 33442
  • [2] Explainable Multi-Modal Deep Learning With Cross-Modal Attention for Diagnosis of Dyssynergic Defecation Using Abdominal X-Ray Images and Symptom Questionnaire
    Sangnark, Sirapob
    Rattanachaisit, Pakkapon
    Patcharatrakul, Tanisa
    Vateekul, Peerapon
    IEEE ACCESS, 2024, 12 : 78132 - 78147
  • [3] Quantitative evaluation model of variable diagnosis for chest X-ray images using deep learning
    Nakagawa, Shota
    Ono, Naoaki
    Hakamata, Yukichika
    Ishii, Takashi
    Saito, Akira
    Yanagimoto, Shintaro
    Kanaya, Shigehiko
    PLOS DIGITAL HEALTH, 2024, 3 (03):
  • [4] Diagnosis of Pneumonia from Chest X-Ray Images using Deep Learning
    Ayan, Enes
    Unver, Halil Murat
    2019 SCIENTIFIC MEETING ON ELECTRICAL-ELECTRONICS & BIOMEDICAL ENGINEERING AND COMPUTER SCIENCE (EBBT), 2019,
  • [5] Cross-modal retrieval of chest X-ray images and diagnostic reports based on report entity graph and dual attention
    Ou, Weihua
    Chen, Yingjie
    Liang, Linqing
    Gou, Jianping
    Xiong, Jiahao
    Zhang, Jiacheng
    Lai, Lingge
    Zhang, Lei
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [6] DEEP LEARNING CLASSIFICATION OF CHEST X-RAY IMAGES
    Majdi, Mohammad S.
    Salman, Khalil N.
    Morris, Michael F.
    Merchant, Nirav C.
    Rodriguez, Jeffrey J.
    2020 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION (SSIAI 2020), 2020, : 116 - 119
  • [7] Oracle Character Recognition Based on Cross-Modal Deep Metric Learning
    Zhang Y.-K.
    Zhang H.
    Liu Y.-G.
    Liu C.-L.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (04): : 791 - 800
  • [8] Deep adversarial metric learning for cross-modal retrieval
    Xu, Xing
    He, Li
    Lu, Huimin
    Gao, Lianli
    Ji, Yanli
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 657 - 672
  • [9] Deep Coupled Metric Learning for Cross-Modal Matching
    Liong, Venice Erin
    Lu, Jiwen
    Tan, Yap-Peng
    Zhou, Jie
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (06) : 1234 - 1244
  • [10] Deep adversarial metric learning for cross-modal retrieval
    Xing Xu
    Li He
    Huimin Lu
    Lianli Gao
    Yanli Ji
    World Wide Web, 2019, 22 : 657 - 672