Multi-grained Cross-Modal Feature Fusion Network for Diagnosis Prediction

被引:0
|
作者
An, Ying [1 ]
Zhao, Zhenrui [2 ]
Chen, Xianlai [1 ]
机构
[1] Cent South Univ, Big Data Inst, Changsha, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
Electronic Health Records; Multimodal Fusion; Diagnosis Prediction;
D O I
10.1007/978-981-97-5131-0_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Electronic Health Record (EHR) contains a wealth of data from multiple modalities. Utilizing these data to comprehensively reflect changes in patients' conditions and accurately predict their diseases is an important research issue in the medical field. However, most fusion approaches employed in existing multimodal learning studies are excessively simplistic and often neglect the hierarchical nature of intermodal interactions. In this paper, we propose a novel multi-grained cross-modal feature fusion network. In this model, we first use hierarchical encoders to learn multilevel representations of multimodal data and a specially designed attention mechanism to explore hierarchical relationships within a single modality. Afterward, we construct a fine-grained cross-modal clinical semantic relationship graph between code and sentence representations. Then we employ Graph Convolutional Networks (GCN) on this graph to achieve fine-grained feature fusion. Finally, we use attention mechanisms to fully learn the contextual interactions between visit-level multimodal representations, and realize coarsegrained feature fusion. We evaluate our model on two real-world clinical datasets, and the experimental results validate the effectiveness of our model.
引用
收藏
页码:221 / 232
页数:12
相关论文
共 50 条
  • [1] Multi-grained Representation Learning for Cross-modal Retrieval
    Zhao, Shengwei
    Xu, Linhai
    Liu, Yuying
    Du, Shaoyi
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2194 - 2198
  • [2] Scalable Multi-grained Cross-modal Similarity Query with Interpretability
    Zhu, Mingdong
    Shen, Derong
    Xu, Lixin
    Wang, Xianfang
    DATA SCIENCE AND ENGINEERING, 2021, 6 (03) : 280 - 293
  • [3] Adversarial Multi-Grained Embedding Network for Cross-Modal Text-Video Retrieval
    Han, Ning
    Chen, Jingjing
    Zhang, Hao
    Wang, Huanwen
    Chen, Hao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
  • [4] Scalable Multi-grained Cross-modal Similarity Query with Interpretability
    Mingdong Zhu
    Derong Shen
    Lixin Xu
    Xianfang Wang
    Data Science and Engineering, 2021, 6 : 280 - 293
  • [5] Complementarity is the king: Multi-modal and multi-grained hierarchical semantic enhancement network for cross-modal retrieval
    Pei, Xinlei
    Liu, Zheng
    Gao, Shanshan
    Su, Yijun
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 216
  • [6] Multi-grained encoding and joint embedding space fusion for video and text cross-modal retrieval
    Xiaotao Cui
    Jing Xiao
    Yang Cao
    Jia Zhu
    Multimedia Tools and Applications, 2022, 81 : 34367 - 34386
  • [7] Multi-grained encoding and joint embedding space fusion for video and text cross-modal retrieval
    Cui, Xiaotao
    Xiao, Jing
    Cao, Yang
    Zhu, Jia
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34367 - 34386
  • [8] Multi-Grained Similarity Preserving and Updating for Unsupervised Cross-Modal Hashing
    Wu, Runbing
    Zhu, Xinghui
    Yi, Zeqian
    Zou, Zhuoyang
    Liu, Yi
    Zhu, Lei
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [9] Improving Speech Translation by Cross-Modal Multi-Grained Contrastive Learning
    Zhang, Hao
    Si, Nianwen
    Chen, Yaqi
    Zhang, Wenlin
    Yang, Xukui
    Qu, Dan
    Zhang, Wei-Qiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1075 - 1086
  • [10] Hybrid Network Based on Cross-Modal Feature Fusion for Diagnosis of Alzheimer's Disease
    Qiu, Zifeng
    Yang, Peng
    Wang, Tianfu
    Lei, Baiying
    ETHICAL AND PHILOSOPHICAL ISSUES IN MEDICAL IMAGING, MULTIMODAL LEARNING AND FUSION ACROSS SCALES FOR CLINICAL DECISION SUPPORT, AND TOPOLOGICAL DATA ANALYSIS FOR BIOMEDICAL IMAGING, EPIMI 2022, ML-CDS 2022, TDA4BIOMEDICALIMAGING, 2022, 13755 : 87 - 99