Prediction of disease-free survival for precision medicine using cooperative learning on multi-omic data

被引:1
|
作者
Hahn, Georg [1 ]
Prokopenko, Dmitry [2 ]
Hecker, Julian [3 ]
Lutz, Sharon M. [1 ]
Mullin, Kristina [2 ]
Sejour, Leinal
Hide, Winston [4 ]
Vlachos, Ioannis [4 ]
DeSantis, Stacia [5 ]
Tanzi, Rudolph E. [2 ]
Lange, Christoph [1 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Biostat, 677 Huntington Ave, Boston, MA 02115 USA
[2] Massachusetts Gen Hosp MGH, McCance Ctr Brain Hlth, Dept Neurol, Genet & Aging Res Unit, Boston, MA 02114 USA
[3] Brigham & Womens Hosp, Harvard Med Sch, Dept Med, Cardiovasc Div, 75 Francis St, Boston, MA 02115 USA
[4] Beth Israel Deaconess Med Ctr, Dept Pathol, 330 Brookline Ave, Boston, MA 02215 USA
[5] Univ Texas Hlth Sci Ctr Houston, 1200 Pressler St,Houston Campus, Houston, TX 77030 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Alzheimer; cooperative learning; Cox proportional hazard; lasso; penalized regression; precision medicine; survival; INSIGHTS; RISK;
D O I
10.1093/bib/bbae267
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In precision medicine, both predicting the disease susceptibility of an individual and forecasting its disease-free survival are areas of key research. Besides the classical epidemiological predictor variables, data from multiple (omic) platforms are increasingly available. To integrate this wealth of information, we propose new methodology to combine both cooperative learning, a recent approach to leverage the predictive power of several datasets, and polygenic hazard score models. Polygenic hazard score models provide a practitioner with a more differentiated view of the predicted disease-free survival than the one given by merely a point estimate, for instance computed with a polygenic risk score. Our aim is to leverage the advantages of cooperative learning for the computation of polygenic hazard score models via Cox's proportional hazard model, thereby improving the prediction of the disease-free survival. In our experimental study, we apply our methodology to forecast the disease-free survival for Alzheimer's disease (AD) using three layers of data. One layer contains epidemiological variables such as sex, APOE (apolipoprotein E, a genetic risk factor for AD) status and 10 leading principal components. Another layer contains selected genomic loci, and the last layer contains methylation data for selected CpG sites. We demonstrate that the survival curves computed via cooperative learning yield an AUC of around $0.7$, above the state-of-the-art performance of its competitors. Importantly, the proposed methodology returns (1) a linear score that can be easily interpreted (in contrast to machine learning approaches), and (2) a weighting of the predictive power of the involved data layers, allowing for an assessment of the importance of each omic (or other) platform. Similarly to polygenic hazard score models, our methodology also allows one to compute individual survival curves for each patient.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Improvement in the prediction power of an astrocyte genome-scale metabolic model using multi-omic data
    Angarita-Rodriguez, Andrea
    Mendoza-Mejia, Nicolas
    Gonzalez, Janneth
    Papin, Jason
    Aristizabal, Andres Felipe
    Pinzon, Andres
    FRONTIERS IN SYSTEMS BIOLOGY, 2025, 4
  • [22] Impact of hemolysis on multi-OMIC pancreatic biomarker discovery to derisk biomarker development in precision medicine studies
    Searfoss, Richard
    Shah, Punit
    Ofori-Mensa, Kennedy
    Bussberg, Valerie
    Tolstikov, Vladimir
    Greenwood, Bennett
    Li, Hongyan
    Richardson, Kris
    Miller, Gregory M.
    DeCicco, Corinne
    Granger, Elder
    Rodrigues, Leonardo O.
    Grund, Eric M.
    Moser, A. James
    Sarangarajan, Rangaprasad
    Narain, Niven R.
    Kiebish, Michael A.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [23] Overall and Disease-Free Survival Prediction of Postoperative Breast Cancer Patients using Machine Learning Techniques
    Shouket, Tahreem
    Mahmood, Sajid
    Hassan, Malik Tahir
    Iftikhar, Afnan
    2019 22ND IEEE INTERNATIONAL MULTI TOPIC CONFERENCE (INMIC), 2019, : 238 - 243
  • [24] IDENTIFYING MOLECULAR FEATURES CONTRIBUTING TO DISEASE HETEROGENEITY IN CROHN'S DISEASE USING MULTI-OMIC DATA INTEGRATION
    Sudhakar, Padhmanand
    Verstockt, Bram
    Creyns, Brecht
    Cremer, Jonathan
    Van Assche, Gert A.
    Korcsmaros, Tamas
    Ferrante, Marc
    Vermeire, Severine
    GASTROENTEROLOGY, 2019, 156 (06) : S1117 - S1117
  • [25] Prediction of Disease-Free Survival in Breast Cancer using Deep Learning with Ultrasound and Mammography: A Multicenter Study
    Han, Junqi
    Hua, Hui
    Fei, Jie
    Liu, Jingjing
    Guo, Yijun
    Ma, Wenjuan
    Chen, Jingjing
    CLINICAL BREAST CANCER, 2024, 24 (03) : 215 - 226
  • [26] Impact of hemolysis on multi-OMIC pancreatic biomarker discovery to derisk biomarker development in precision medicine studies
    Richard Searfoss
    Punit Shah
    Kennedy Ofori-Mensa
    Valerie Bussberg
    Vladimir Tolstikov
    Bennett Greenwood
    Hongyan Li
    Kris Richardson
    Gregory M. Miller
    Corinne DeCicco
    Elder Granger
    Leonardo O. Rodrigues
    Eric M. Grund
    A. James Moser
    Rangaprasad Sarangarajan
    Niven R. Narain
    Michael A. Kiebish
    Scientific Reports, 12
  • [27] Multi-omic integration of microbiome data for identifying disease-associated modules
    Efrat Muller
    Itamar Shiryan
    Elhanan Borenstein
    Nature Communications, 15
  • [28] Integrating Multi-Omic Data With Deep Subspace Fusion Clustering for Cancer Subtype Prediction
    Yang, Bo
    Zhang, Yupei
    Pang, Shanmin
    Shang, Xuequn
    Zhao, Xueqing
    Han, Minghui
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (01) : 216 - 226
  • [29] Multi-omic integration of microbiome data for identifying disease-associated modules
    Muller, Efrat
    Shiryan, Itamar
    Borenstein, Elhanan
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [30] DiscoverSL: an R package for multi-omic data driven prediction of synthetic lethality in cancers
    Das, Shaoli
    Deng, Xiang
    Camphausen, Kevin
    Shankavaram, Uma
    BIOINFORMATICS, 2019, 35 (04) : 701 - 702