Prediction of disease-free survival for precision medicine using cooperative learning on multi-omic data

被引:1
|
作者
Hahn, Georg [1 ]
Prokopenko, Dmitry [2 ]
Hecker, Julian [3 ]
Lutz, Sharon M. [1 ]
Mullin, Kristina [2 ]
Sejour, Leinal
Hide, Winston [4 ]
Vlachos, Ioannis [4 ]
DeSantis, Stacia [5 ]
Tanzi, Rudolph E. [2 ]
Lange, Christoph [1 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Biostat, 677 Huntington Ave, Boston, MA 02115 USA
[2] Massachusetts Gen Hosp MGH, McCance Ctr Brain Hlth, Dept Neurol, Genet & Aging Res Unit, Boston, MA 02114 USA
[3] Brigham & Womens Hosp, Harvard Med Sch, Dept Med, Cardiovasc Div, 75 Francis St, Boston, MA 02115 USA
[4] Beth Israel Deaconess Med Ctr, Dept Pathol, 330 Brookline Ave, Boston, MA 02215 USA
[5] Univ Texas Hlth Sci Ctr Houston, 1200 Pressler St,Houston Campus, Houston, TX 77030 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Alzheimer; cooperative learning; Cox proportional hazard; lasso; penalized regression; precision medicine; survival; INSIGHTS; RISK;
D O I
10.1093/bib/bbae267
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In precision medicine, both predicting the disease susceptibility of an individual and forecasting its disease-free survival are areas of key research. Besides the classical epidemiological predictor variables, data from multiple (omic) platforms are increasingly available. To integrate this wealth of information, we propose new methodology to combine both cooperative learning, a recent approach to leverage the predictive power of several datasets, and polygenic hazard score models. Polygenic hazard score models provide a practitioner with a more differentiated view of the predicted disease-free survival than the one given by merely a point estimate, for instance computed with a polygenic risk score. Our aim is to leverage the advantages of cooperative learning for the computation of polygenic hazard score models via Cox's proportional hazard model, thereby improving the prediction of the disease-free survival. In our experimental study, we apply our methodology to forecast the disease-free survival for Alzheimer's disease (AD) using three layers of data. One layer contains epidemiological variables such as sex, APOE (apolipoprotein E, a genetic risk factor for AD) status and 10 leading principal components. Another layer contains selected genomic loci, and the last layer contains methylation data for selected CpG sites. We demonstrate that the survival curves computed via cooperative learning yield an AUC of around $0.7$, above the state-of-the-art performance of its competitors. Importantly, the proposed methodology returns (1) a linear score that can be easily interpreted (in contrast to machine learning approaches), and (2) a weighting of the predictive power of the involved data layers, allowing for an assessment of the importance of each omic (or other) platform. Similarly to polygenic hazard score models, our methodology also allows one to compute individual survival curves for each patient.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Supervised Graph Clustering for Cancer Subtyping Based on Survival Analysis and Integration of Multi-Omic Tumor Data
    Liu, Cheng
    Cao, Wenming
    Wu, Si
    Shen, Wenjun
    Jiang, Dazhi
    Yu, Zhiwen
    Wong, Hau-San
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 1193 - 1202
  • [42] Applying Artificial Intelligence to Multi-Omic Data: New Functional Variants in Parkinson's Disease
    Welton, Thomas
    Tan, Eng-King
    MOVEMENT DISORDERS, 2021, 36 (02) : 347 - 347
  • [43] Improved Statistical Methods and Bioinformatic Tools to Integrate Multi-Omic Data in Disease Association Studies
    Gonzalez, Juan R.
    HUMAN HEREDITY, 2016, 81 (04) : 223 - 223
  • [44] Synthetic augmentation of cancer cell line multi-omic datasets using unsupervised deep learning
    Cai, Zhaoxiang
    Apolinario, Sofia
    Baiao, Ana R.
    Pacini, Clare
    Sousa, Miguel D.
    Vinga, Susana
    Reddel, Roger R.
    Robinson, Phillip J.
    Garnett, Mathew J.
    Zhong, Qing
    Goncalves, Emanuel
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [45] Streamlining multi-omic and artificial intelligence analysis through interrogative biology and baicis for translational precision medicine applications in clinical oncology
    Rodrigues, L.
    Kiebish, M. A.
    Miller, G.
    Zhang, L.
    Vemulapalli, V.
    Vishnudas, V. K.
    Sarangarajan, R.
    Narain, N. R.
    Akmaev, V. R.
    ANNALS OF ONCOLOGY, 2018, 29
  • [46] Risk prediction by Raman spectroscopy for disease-free survival in oral cancers
    Bhattacharjee, Atanu
    Hole, Arti
    Malik, Akshat
    Sahu, Aditi
    Singh, S. P.
    Deshmukh, Atul
    Nair, Sudhir
    Chaturvedi, Pankaj
    Murali Krishna, C.
    LASERS IN MEDICAL SCIENCE, 2021, 36 (08) : 1691 - 1700
  • [47] Multi-omic data integration assisted identification of molecular features contributing to disease heterogeneity in Crohn's disease
    Sudhakar, P.
    Verstockt, B.
    Creyns, B.
    Cremer, J.
    van Assche, G.
    Korcsmaros, T.
    Ferrante, M.
    Vermeire, S.
    JOURNAL OF CROHNS & COLITIS, 2019, 13 : S86 - S86
  • [48] Prediction of Disease-free Survival in Hepatocellular Carcinoma by Gene Expression Profiling
    Lim, Ho-Yeong
    Sohn, Insuk
    Deng, Shibing
    Lee, Jeeyun
    Jung, Sin Ho
    Mao, Mao
    Xu, Jiangchun
    Wang, Kai
    Shi, Stephanie
    Joh, Jae Won
    Choi, Yoon La
    Park, Cheol-Keun
    ANNALS OF SURGICAL ONCOLOGY, 2013, 20 (12) : 3747 - 3753
  • [49] Risk prediction by Raman spectroscopy for disease-free survival in oral cancers
    Atanu Bhattacharjee
    Arti Hole
    Akshat Malik
    Aditi Sahu
    S. P. Singh
    Atul Deshmukh
    Sudhir Nair
    Pankaj Chaturvedi
    C. Murali Krishna
    Lasers in Medical Science, 2021, 36 : 1691 - 1700
  • [50] Data processing, multi-omic pathway mapping, and metabolite activity analysis using XCMS Online
    Forsberg, Erica M.
    Huan, Tao
    Rinehart, Duane
    Benton, H. Paul
    Warth, Benedikt
    Hilmers, Brian
    Siuzdak, Gary
    NATURE PROTOCOLS, 2018, 13 (04) : 633 - 651