Multi-Omic Data Improve Prediction of Personalized Tumor Suppressors and Oncogenes

被引:0
|
作者
Sudhakar, Malvika [1 ,2 ,3 ]
Rengaswamy, Raghunathan [1 ,2 ,4 ]
Raman, Karthik [1 ,2 ,3 ]
机构
[1] Indian Inst Technol IIT Madras, Ctr Integrat Biol & Syst Med IBSE, Chennai, India
[2] IIT Madras, Robert Bosch Ctr Data Sci & Artificial Intelligenc, Chennai, India
[3] IIT Madras, Bhupat & Jyoti Mehta Sch Biosci, Dept Biotechnol, Chennai, India
[4] IIT Madras, Dept Chem Engn, Chennai, India
关键词
machine learning; driver genes; personalized driver genes; cancer genomics; PIVOT; DIFFERENTIAL EXPRESSION ANALYSIS; BREAST-CANCER; MUTATIONAL PROCESSES; DRIVER GENES; HETEROGENEITY; SIGNATURES; EVOLUTION; RECEPTOR; SOX9;
D O I
10.3389/fgene.2022.854190
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The progression of tumorigenesis starts with a few mutational and structural driver events in the cell. Various cohort-based computational tools exist to identify driver genes but require multiple samples to identify less frequently mutated driver genes. Many studies use different methods to identify driver mutations/genes from mutations that have no impact on tumor progression; however, a small fraction of patients show no mutational events in any known driver genes. Current unsupervised methods map somatic and expression data onto a network to identify personalized driver genes based on changes in expression. Our method is the first machine learning model to classify genes as tumor suppressor gene (TSG), oncogene (OG), or neutral, thus assigning the functional impact of the gene in the patient. In this study, we develop a multi-omic approach, PIVOT (Personalized Identification of driVer OGs and TSGs), to train on experimentally or computationally validated mutational and structural driver events. Given the lack of any gold standards for the identification of personalized driver genes, we label the data using four strategies and, based on classification metrics, show gene-based labeling strategies perform best. We build different models using SNV, RNA, and multi-omic features to be used based on the data available. Our models trained on multi-omic data improved predictions compared with mutation and expression data, achieving an accuracy & GE; 0.99 for BRCA, LUAD, and COAD datasets. We show network and expression-based features contribute the most to PIVOT. Our predictions on BRCA, COAD, and LUAD cancer types reveal commonly altered genes such as TP53 and PIK3CA, which are predicted drivers for multiple cancer types. Along with known driver genes, our models also identify new driver genes such as PRKCA, SOX9, and PSMD4. Our multi-omic model labels both CNV and mutations with a more considerable contribution by CNV alterations. While predicting labels for genes mutated in multiple samples, we also label rare driver events occurring in as few as one sample. We also identify genes with dual roles within the same cancer type. Overall, PIVOT labels personalized driver genes as TSGs and OGs and also identified rare driver genes.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Multi-Omic Analysis: A Possible Platform Toward Personalized and Adaptable Cancer Treatment
    Benesch, Matthew G. K.
    Cherkassky, Leonid
    Nurkin, Steven J.
    [J]. ANNALS OF SURGICAL ONCOLOGY, 2024, 31 (08) : 4831 - 4833
  • [32] A Gene Signature of Survival Prediction for Kidney Renal Cell Carcinoma by Multi-Omic Data Analysis
    Hu, Fuyan
    Zeng, Wenying
    Liu, Xiaoping
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (22)
  • [33] 'Multi-omic' data analysis using O-miner
    Sangaralingam, Ajanthah
    Ullah, Abu Z. Dayem
    Marzec, Jacek
    Gadaleta, Emanuela
    Nagano, Ai
    Ross-Adams, Helen
    Wang, Jun
    Lemoine, Nicholas R.
    Chelala, Claude
    [J]. BRIEFINGS IN BIOINFORMATICS, 2019, 20 (01) : 130 - 143
  • [34] Deep Learning and Networks for Integrative Analysis of Multi-Omic Data
    Zhang, Aidong
    [J]. 2018 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2018,
  • [35] Multi-omic approaches for host-microbiome data integration
    Chetty, Ashwin
    Blekhman, Ran
    [J]. GUT MICROBES, 2024, 16 (01)
  • [36] HMPDACC: a Human Microbiome Project Multi-omic data resource
    Creasy, Heather Huot
    Felix, Victor
    Aluvathingal, Jain
    Crabtree, Jonathan
    Ifeonu, Olukemi
    Matsumura, James
    McCracken, Carrie
    Nickel, Lance
    Orvis, Joshua
    Schor, Mike
    Giglio, Michelle
    Mahurkar, Anup
    White, Owen
    [J]. NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) : D734 - D742
  • [37] Integrative Analysis of Multi-Omic Data for the Characteristics of Endometrial Cancer
    Li, Tong
    Ruan, Zhijun
    Song, Chunli
    Yin, Feng
    Zhang, Tuanjie
    Shi, Liyun
    Zuo, Min
    Lu, Linlin
    An, Yuhao
    Wang, Rui
    Ye, Xiyang
    [J]. ACS OMEGA, 2024, 9 (12): : 14489 - 14499
  • [38] Multi-Omic Approaches in Colorectal Cancer beyond Genomic Data
    Sardo, Emilia
    Napolitano, Stefania
    Della Corte, Carminia Maria
    Ciardiello, Davide
    Raucci, Antonio
    Arrichiello, Gianluca
    Troiani, Teresa
    Ciardiello, Fortunato
    Martinelli, Erika
    Martini, Giulia
    [J]. JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (02):
  • [39] NEMO: cancer subtyping by integration of partial multi-omic data
    Rappoport, Nimrod
    Shamir, Ron
    [J]. BIOINFORMATICS, 2019, 35 (18) : 3348 - 3356
  • [40] A Framework for Multi-Omic Prediction of Treatment Response to Biologic Therapy for Psoriasis
    Foulkes, Amy C.
    Watson, David S.
    Carr, Daniel F.
    Kenny, John G.
    Slidel, Timothy
    Parslew, Richard
    Pirmohamed, Munir
    Anders, Simon
    Reynolds, Nick J.
    Griffiths, Christopher E. M.
    Warren, Richard B.
    Barnes, Michael R.
    [J]. JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2019, 139 (01) : 100 - 107