Integration of pan-cancer multi-omics data for novel mixed subgroup identification using machine learning methods

被引:4
|
作者
Khadirnaikar, Seema [1 ]
Shukla, Sudhanshu [2 ]
Prasanna, S. R. M. [1 ]
机构
[1] Indian Inst Technol Dharwad, Dept Elect Engn, Dharwad, Karnataka, India
[2] Indian Inst Technol Dharwad, Dept Biosci & Bioengn, Dharwad, Karnataka, India
来源
PLOS ONE | 2023年 / 18卷 / 10期
关键词
MOLECULAR CLASSIFICATION; HETEROGENEITY;
D O I
10.1371/journal.pone.0287176
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cancer is a heterogeneous disease, and patients with tumors from different organs can share similar epigenetic and genetic alterations. Therefore, it is crucial to identify the novel subgroups of patients with similar molecular characteristics. It is possible to propose a better treatment strategy when the heterogeneity of the patient is accounted for during subgroup identification, irrespective of the tissue of origin. This work proposes a machine learning (ML) based pipeline for subgroup identification in pan-cancer. Here, mRNA, miRNA, DNA methylation, and protein expression features from pan-cancer samples were concatenated and non-linearly projected to a lower dimension using an ML algorithm. This data was then clustered to identify multi-omics-based novel subgroups. The clinical characterization of these ML subgroups indicated significant differences in overall survival (OS) and disease-free survival (DFS) (p-value<0.0001). The subgroups formed by the patients from different tumors shared similar molecular alterations in terms of immune microenvironment, mutation profile, and enriched pathways. Further, decision-level and feature-level fused classification models were built to identify the novel subgroups for unseen samples. Additionally, the classification models were used to obtain the class labels for the validation samples, and the molecular characteristics were verified. To summarize, this work identified novel ML subgroups using multi-omics data and showed that the patients with different tumor types could be similar molecularly. We also proposed and validated the classification models for subgroup identification. The proposed classification models can be used to identify the novel multi-omics subgroups, and the molecular characteristics of each subgroup can be used to design appropriate treatment regimen.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Integration of Multi-Omics Data for the Classification of Glioma Types and Identification of Novel Biomarkers
    Vieira, Francisca G.
    Bispo, Regina
    Lopes, Marta B.
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2024, 18
  • [42] Identification of ovarian cancer driver genes by using module network integration of multi-omics data
    Gevaert, Olivier
    Villalobos, Victor
    Sikic, Branimir I.
    Plevritis, Sylvia K.
    INTERFACE FOCUS, 2013, 3 (04)
  • [43] Identification of ovarian cancer driver genes by using module network integration of multi-omics data
    Gevaert, Olivier
    Villalobos, Victor
    Sikic, Branimir I.
    Plevritis, Sylvia K.
    INTERFACE FOCUS, 2014, 4 (03)
  • [44] Using machine learning approaches for multi-omics data analysis: A review
    Reel, Parminder S.
    Reel, Smarti
    Pearson, Ewan
    Trucco, Emanuele
    Jefferson, Emily
    BIOTECHNOLOGY ADVANCES, 2021, 49
  • [45] Multi-Omics Analysis of MCM2 as a Promising Biomarker in Pan-Cancer
    Yuan, Jing
    Lan, Hua
    Huang, Dongqing
    Guo, Xiaohui
    Liu, Chu
    Liu, Shuping
    Zhang, Peng
    Cheng, Yan
    Xiao, Songshu
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2022, 10
  • [46] A Multi-Omics Analysis of an Exhausted T Cells' Molecular Signature in Pan-Cancer
    Rigopoulos, Christos
    Georgakopoulos-Soares, Ilias
    Zaravinos, Apostolos
    JOURNAL OF PERSONALIZED MEDICINE, 2024, 14 (07):
  • [47] Multi-omics analysis of TLCD1 as a promising biomarker in pan-cancer
    Wang, Shengli
    Zhang, Mingyue
    Sun, Hongyan
    Li, Tao
    Hao, Jianlei
    Fang, Meixia
    Dong, Jie
    Xu, Hongbiao
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2024, 11
  • [48] An integrated multi-omics analysis of topoisomerase family in pan-cancer: Friend or foe?
    Zhou, Xin
    Yao, Guixiang
    Zhang, Jin
    Bian, Jiasheng
    Li, Guanghao
    Xu, Jianfeng
    PLOS ONE, 2022, 17 (10):
  • [49] A multi-omics supervised autoencoder for pan-cancer clinical outcome endpoints prediction
    Tan, Kaiwen
    Huang, Weixian
    Hu, Jinlong
    Dong, Shoubin
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 3)
  • [50] CAMOIP: A web server for comprehensive analysis on multi-omics of immunotherapy in pan-cancer
    Luo, P.
    Lin, A.
    Zhang, J.
    ANNALS OF ONCOLOGY, 2021, 32 : S1389 - S1389