PMFN-SSL: Self-supervised learning-based progressive multimodal fusion network for cancer diagnosis and prognosis

被引:3
|
作者
Li, Le [1 ,2 ]
Pan, Hudan [3 ]
Liang, Yong [1 ,4 ]
Shao, Mingwen [5 ]
Xie, Shengli [6 ]
Lu, Shanghui [2 ]
Liao, Shuilin [2 ]
机构
[1] Peng Cheng Lab, Shenzhen, Peoples R China
[2] Macau Univ Sci & Technol, Sch Fac Innovat Engn, Macau, Peoples R China
[3] Guangzhou Univ Chinese Med, Affiliated Hosp 2, State Key Lab Tradit Chinese Med Syndrome, Guangzhou, Peoples R China
[4] Pazhou Lab Huangpu, Guangzhou, Peoples R China
[5] China Univ Petr, Coll Comp Sci & Technol, Qingdao, Peoples R China
[6] Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
基金
美国国家科学基金会;
关键词
Multimodal learning; Self-supervised learning; Survival analysis; Grade prediction; ARTIFICIAL-INTELLIGENCE; SURVIVAL PREDICTION; CLASSIFICATION; IMAGES;
D O I
10.1016/j.knosys.2024.111502
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The integration of digital pathology images and genetic data is a developing field in cancer research, presenting potential opportunities for predicting survival and classifying grades through multiple source data. However, obtaining comprehensive annotations proves challenging in practical medical settings, and the extraction of features from high -resolution pathology images is hindered by inter-domain disparities. Current data fusion methods ignore the spatio-temporal incongruity among multimodal data. To address the above challenges, we propose a novel self-supervised transformer-based pathology feature extraction strategy, and construct an interpretable Progressive Multimodal Fusion Network (PMFN-SSL) for cancer diagnosis and prognosis. Our contributions are mainly divided into three aspects. Firstly, we propose a joint patch sampling strategy based on the information entropy and HSV components of an image, which reduces the demand for sample annotations and avoid image quality degradation caused by manual contamination. Secondly, a self-supervised transformerbased feature extraction module for pathology images is proposed and innovatively leverages partially weakly supervised labeling to align the extracted features with downstream medical tasks. Further, we improve the existing multimodal feature fusion model with an progressive fusion strategy to reduce the inconsistency between multimodal data due to differences in collection of temporal and spatial. Abundant ablation and comparison experiments demonstrate that the proposed data preprocessing method and multimodal fusion paradigm strengthen the quality of feature extraction and improve the prediction based on real cancer grading and prognosis. Code and trained models are made available at: https://github.com/Mercuriiio/PMFN-SSL.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Progressive Video Summarization via Multimodal Self-supervised Learning
    Li, Haopeng
    Ke, Qiuhong
    Gong, Mingming
    Drummond, Tom
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5573 - 5582
  • [2] Supervised and self-supervised learning-based cascade spatiotemporal fusion framework and its application
    Sun, Weixuan
    Li, Jie
    Jiang, Menghui
    Yuan, Qiangqiang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 203 : 19 - 36
  • [3] Efficient deep learning-based automated diagnosis from echocardiography with contrastive self-supervised learning
    Holste, Gregory
    Oikonomou, Evangelos K.
    Mortazavi, Bobak J.
    Wang, Zhangyang
    Khera, Rohan
    COMMUNICATIONS MEDICINE, 2024, 4 (01):
  • [4] A layer-wise fusion network incorporating self-supervised learning for multimodal MR image synthesis
    Zhou, Qian
    Zou, Hua
    FRONTIERS IN GENETICS, 2022, 13
  • [5] ON THE IMPACT OF SELF-SUPERVISED LEARNING IN SKIN CANCER DIAGNOSIS
    Verdelho, Maria Rita
    Barata, Catarina
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [6] Self-Supervised Learning Model for Skin Cancer Diagnosis
    Masood, Ammara
    Al-Jumaily, Adel
    Anam, Khairul
    2015 7TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING (NER), 2015, : 1012 - 1015
  • [7] Isotropic Self-Supervised Learning for Driver Drowsiness Detection With Attention-Based Multimodal Fusion
    Mou, Luntian
    Zhou, Chao
    Xie, Pengtao
    Zhao, Pengfei
    Jain, Ramesh
    Gao, Wen
    Yin, Baocai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 529 - 542
  • [8] SSL-Net: Point-Cloud Generation Network With Self-Supervised Learning
    Sun, Ran
    Gao, Yongbin
    Fang, Zhijun
    Wang, Anjie
    Zhong, Cengsi
    IEEE ACCESS, 2019, 7 : 82206 - 82217
  • [9] Optical Flow Estimation through Fusion Network based on Self-supervised Deep Learning
    Liu, Cong
    Shi, Dianxi
    Li, Ruihao
    Xu, Huachi
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [10] BADGR: An Autonomous Self-Supervised Learning-Based Navigation System
    Kahn, Gregory
    Abbeel, Pieter
    Levine, Sergey
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 1312 - 1319