Cross-modal alignment and contrastive learning for enhanced cancer survival prediction

被引:0
|
作者
Li, Tengfei [1 ]
Zhou, Xuezhong [1 ]
Xue, Jingyan [1 ]
Zeng, Lili [1 ]
Zhu, Qiang [1 ]
Wang, Ruiping [1 ]
Yu, Haibin [2 ]
Xia, Jianan [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp Sci & Technol, Beijing 100044, Peoples R China
[2] Henan Univ Chinese Med, Affiliated Hosp 1, Zhengzhou 450000, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Survival prediction; Histopathological image; Multi-omics; Multi-modal fusion; MODEL;
D O I
10.1016/j.cmpb.2025.108633
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: Integrating multimodal data, such as pathology images and genomics, is crucial for understanding cancer heterogeneity, personalized treatment complexity, and enhancing survival prediction. However, most current prognostic methods are limited to a single domain of histopathology or genomics, inevitably reducing their potential for accurate patient outcome prediction. Despite advancements in the concurrent analysis of pathology and genomic data, existing approaches inadequately address the intricate intermodal relationships. Methods: This paper introduces the CPathomic method for multimodal data-based survival prediction. By leveraging whole slide pathology images to guide local pathological features, the method effectively mitigates significant intermodal differences through a cross-modal representational contrastive learning module. Furthermore, it facilitates interactive learning between different modalities through cross-modal and gated attention modules. Results: The extensive experiments on five public TCGA datasets demonstrate that CPathomic framework effectively bridges modality gaps, consistently outperforming alternative multimodal survival prediction methods. Conclusion: The model we propose, CPathomic, unveils the potential of contrastive learning and cross-modal attention in the representation and fusion of multimodal data, enhancing the performance of patient survival prediction.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Cross-Modal Contrastive Learning for Code Search
    Shi, Zejian
    Xiong, Yun
    Zhang, Xiaolong
    Zhang, Yao
    Li, Shanshan
    Zhu, Yangyong
    2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 94 - 105
  • [2] Cross-modal Contrastive Learning for Speech Translation
    Ye, Rong
    Wang, Mingxuan
    Li, Lei
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5099 - 5113
  • [3] Research on medical publication recommendation based on cross-modal semantic alignment with contrastive learning
    Ding, Hao
    Xia, Zhonghua
    Zhu, Weiwei
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (04)
  • [4] Cross-Modal Translation and Alignment for Survival Analysis
    Zhou, Fengtao
    Chen, Hao
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21428 - 21437
  • [5] Cross-modal contrastive learning for multimodal sentiment recognition
    Yang, Shanliang
    Cui, Lichao
    Wang, Lei
    Wang, Tao
    APPLIED INTELLIGENCE, 2024, 54 (05) : 4260 - 4276
  • [6] Cross-Modal Graph Contrastive Learning with Cellular Images
    Zheng, Shuangjia
    Rao, Jiahua
    Zhang, Jixian
    Zhou, Lianyu
    Xie, Jiancong
    Cohen, Ethan
    Lu, Wei
    Li, Chengtao
    Yang, Yuedong
    ADVANCED SCIENCE, 2024, 11 (32)
  • [7] Cross-modal contrastive learning for multimodal sentiment recognition
    Shanliang Yang
    Lichao Cui
    Lei Wang
    Tao Wang
    Applied Intelligence, 2024, 54 : 4260 - 4276
  • [8] TRAJCROSS: Trajecotry Cross-Modal Retrieval with Contrastive Learning
    Jing, Quanliang
    Yao, Di
    Gong, Chang
    Fan, Xinxin
    Wang, Baoli
    Tan, Haining
    Bi, Jingping
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 344 - 349
  • [9] Recalibrated cross-modal alignment network for radiology report generation with weakly supervised contrastive learning
    Hou, Xiaodi
    Li, Xiaobo
    Liu, Zhi
    Sang, Shengtian
    Lu, Mingyu
    Zhang, Yijia
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 269
  • [10] Unimodal and cross-modal prediction is enhanced in musicians
    Eliana Vassena
    Katty Kochman
    Julie Latomme
    Tom Verguts
    Scientific Reports, 6