Transformer-Based Multi-Modal Data Fusion Method for COPD Classification and Physiological and Biochemical Indicators Identification

被引:4
|
作者
Xie, Weidong [1 ]
Fang, Yushan [1 ]
Yang, Guicheng [1 ]
Yu, Kun [2 ]
Li, Wei [1 ,3 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110169, Peoples R China
[2] Northeastern Univ, Coll Med & Bioinformat Engn, Shenyang 110169, Peoples R China
[3] Key Lab Intelligent Comp Med Image MIIC, Shenyang 110169, Peoples R China
关键词
multi-modal fusion; cross-modal transformer; low-rank multi-modal fusion; COPD; PREDICTION; PROGNOSIS;
D O I
10.3390/biom13091391
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As the number of modalities in biomedical data continues to increase, the significance of multi-modal data becomes evident in capturing complex relationships between biological processes, thereby complementing disease classification. However, the current multi-modal fusion methods for biomedical data require more effective exploitation of intra- and inter-modal interactions, and the application of powerful fusion methods to biomedical data is relatively rare. In this paper, we propose a novel multi-modal data fusion method that addresses these limitations. Our proposed method utilizes a graph neural network and a 3D convolutional network to identify intra-modal relationships. By doing so, we can extract meaningful features from each modality, preserving crucial information. To fuse information from different modalities, we employ the Low-rank Multi-modal Fusion method, which effectively integrates multiple modalities while reducing noise and redundancy. Additionally, our method incorporates the Cross-modal Transformer to automatically learn relationships between different modalities, facilitating enhanced information exchange and representation. We validate the effectiveness of our proposed method using lung CT imaging data and physiological and biochemical data obtained from patients diagnosed with Chronic Obstructive Pulmonary Disease (COPD). Our method demonstrates superior performance compared to various fusion methods and their variants in terms of disease classification accuracy.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] TMIF: transformer-based multi-modal interactive fusion for automatic rumor detection
    Lv, Jiandong
    Wang, Xingang
    Shao, Cuiling
    MULTIMEDIA SYSTEMS, 2022, 29 (5) : 2979 - 2989
  • [2] TMIF: transformer-based multi-modal interactive fusion for automatic rumor detection
    Jiandong Lv
    Xingang Wang
    Cuiling Shao
    Multimedia Systems, 2023, 29 : 2979 - 2989
  • [3] Transformer-based Automatic Music Mood Classification Using Multi-modal Framework
    Kumar, Sujeesha Ajithakumari Suresh
    Rajan, Rajeev
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2023, 23 (01): : 18 - 34
  • [4] Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer
    Zhang, Zhihao
    Chen, Yiwei
    Zhang, Weizhan
    Yan, Caixia
    Zheng, Qinghua
    Wang, Qi
    Chen, Wangdu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3560 - 3568
  • [5] TRANSFORMER-BASED MULTI-MODAL LEARNING FOR MULTI-LABEL REMOTE SENSING IMAGE CLASSIFICATION
    Hoffmann, David Sebastian
    Clasen, Kai Norman
    Demir, Begum
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 4891 - 4894
  • [6] Representation, Alignment, Fusion: A Generic Transformer-Based Framework for Multi-modal Glaucoma Recognition
    Zhou, You
    Yang, Gang
    Zhou, Yang
    Ding, Dayong
    Zhao, Jianchun
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII, 2023, 14226 : 704 - 713
  • [7] A Transformer-based multi-modal fusion network for 6D pose estimation
    Hong, Jia-Xin
    Zhang, Hong-Bo
    Liu, Jing-Hua
    Lei, Qing
    Yang, Li-Jie
    Du, Ji-Xiang
    INFORMATION FUSION, 2024, 105
  • [8] A Transformer-based Multi-modal Joint Attention Fusion Model for Molecular Property Prediction
    Wang, Ke
    Zhang, Wei
    Liu, Yong
    Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023, 2023, : 4972 - 4974
  • [9] MULTI-VIEW AND MULTI-MODAL EVENT DETECTION UTILIZING TRANSFORMER-BASED MULTI-SENSOR FUSION
    Yasuda, Masahiro
    Ohishi, Yasunori
    Saito, Shoichiro
    Harado, Noboru
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4638 - 4642
  • [10] Classification of Hyperspectral and LiDAR Data Using Multi-Modal Transformer Cascaded Fusion Net
    Wang, Shuo
    Hou, Chengchao
    Chen, Yiming
    Liu, Zhengjun
    Zhang, Zhenbei
    Zhang, Geng
    REMOTE SENSING, 2023, 15 (17)