A Contrastive-Learning-Based Deep Neural Network for Cancer Subtyping by Integrating Multi-Omics Data

被引:0
|
作者
Chai, Hua [1 ]
Deng, Weizhen [1 ]
Wei, Junyu [1 ]
Guan, Ting [1 ]
He, Minfan [1 ]
Liang, Yong [3 ]
Li, Le [2 ,3 ]
机构
[1] Foshan Univ, Sch Math & Big Data, Foshan 528000, Peoples R China
[2] Macau Univ Sci & Technol, Fac Innovat Engn, Macau 999078, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Cancer subtype identification; Multi-omics data; Contrastive learning; Bioinformatics; EXPRESSION; POLYMORPHISMS;
D O I
10.1007/s12539-024-00641-y
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Accurate identification of cancer subtypes is crucial for disease prognosis evaluation and personalized patient management. Recent advances in computational methods have demonstrated that multi-omics data provides valuable insights into tumor molecular subtyping. However, the high dimensionality and small sample size of the data may result in ambiguous and overlapping cancer subtypes during clustering. In this study, we propose a novel contrastive-learning-based approach to address this issue. The proposed end-to-end deep learning method can extract crucial information from the multi-omics features by self-supervised learning for patient clustering. Results By applying our method to nine public cancer datasets, we have demonstrated superior performance compared to existing methods in separating patients with different survival outcomes (p < 0.05). To further evaluate the impact of various omics data on cancer survival, we developed an XGBoost classification model and found that mRNA had the highest importance score, followed by DNA methylation and miRNA. In the presented case study, our method successfully clustered subtypes and identified 14 cancer-related genes, of which 12 (85.7%) were validated through literature review. Conclusions Our findings demonstrate that our method is capable of identifying cancer subtypes that are both statistically and biologically significant. The code about COLCS is given at: https://github.com/Mercuriiio/COLCS.
引用
收藏
页码:966 / 975
页数:10
相关论文
共 50 条
  • [31] Classifying Breast Cancer Subtypes Using Deep Neural Networks Based on Multi-Omics Data
    Lin, Yuqi
    Zhang, Wen
    Cao, Huanshen
    Li, Gaoyang
    Du, Wei
    GENES, 2020, 11 (08) : 1 - 18
  • [32] Multi-omics data integration for hepatocellular carcinoma subtyping with multi-kernel learning
    Wang, Jiaying
    Miao, Yuting
    Li, Lingmei
    Wu, Yongqing
    Ren, Yan
    Cui, Yuehua
    Cao, Hongyan
    FRONTIERS IN GENETICS, 2022, 13
  • [33] PCLSurv: a prototypical contrastive learning-based multi-omics data integration model for cancer survival prediction
    Li, Zhimin
    Chen, Wenlan
    Zhong, Hai
    Liang, Cheng
    BRIEFINGS IN BIOINFORMATICS, 2025, 26 (02)
  • [34] A deep contrastive multi-modal encoder for multi-omics data integration and analysis
    Yinghua, Ma
    Khan, Ahmad
    Heng, Yang
    Khan, Fiaz Gul
    Ali, Farman
    Al-Otaibi, Yasser D.
    Bashir, Ali Kashif
    INFORMATION SCIENCES, 2025, 700
  • [35] Supervised graph contrastive learning for cancer subtype identification through multi-omics data integration
    Chen, Fangxu
    Peng, Wei
    Dai, Wei
    Wei, Shoulin
    Fu, Xiaodong
    Liu, Li
    Liu, Lijun
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2024, 12 (01)
  • [36] A benchmark study of deep learning-based multi-omics data fusion methods for cancer
    Dongjin Leng
    Linyi Zheng
    Yuqi Wen
    Yunhao Zhang
    Lianlian Wu
    Jing Wang
    Meihong Wang
    Zhongnan Zhang
    Song He
    Xiaochen Bo
    Genome Biology, 23
  • [37] A benchmark study of deep learning-based multi-omics data fusion methods for cancer
    Leng, Dongjin
    Zheng, Linyi
    Wen, Yuqi
    Zhang, Yunhao
    Wu, Lianlian
    Wang, Jing
    Wang, Meihong
    Zhang, Zhongnan
    He, Song
    Bo, Xiaochen
    GENOME BIOLOGY, 2022, 23 (01)
  • [38] Deep learning-based ovarian cancer subtypes identification using multi-omics data
    Long-Yi Guo
    Ai-Hua Wu
    Yong-xia Wang
    Li-ping Zhang
    Hua Chai
    Xue-Fang Liang
    BioData Mining, 13
  • [39] Deep learning-based ovarian cancer subtypes identification using multi-omics data
    Guo, Long-Yi
    Wu, Ai-Hua
    Wang, Yong-xia
    Zhang, Li-ping
    Chai, Hua
    Liang, Xue-Fang
    BIODATA MINING, 2020, 13 (01)
  • [40] Enhancing Lung Cancer Classification and Prediction With Deep Learning and Multi-Omics Data
    Mohamed, Tehnan I. A.
    Ezugwu, Absalom El-Shamir
    IEEE ACCESS, 2024, 12 : 59880 - 59892