A Bayesian Nonparametric Model for Integrative Clustering of Omics Data

被引:0
|
作者
Peneva, Iliana [1 ]
Savage, Richard S. [2 ]
机构
[1] Univ Warwick, Warwick, England
[2] Univ Warwick, Dept Stat, Warwick, England
关键词
Bayesian nonparametrics; Data integration; Glioblastoma; Mixture models; Non-local priors; LATENT VARIABLE MODEL; BREAST; GLIOBLASTOMA;
D O I
10.1007/978-3-030-30611-3_11
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Cancer is a complex disease, driven by a range of genetic and environmental factors. Many integrative clustering methods aim to provide insight into the mechanisms underlying cancer but fewof them are computationally efficient and able to estimate the number of subtypes. We have developed a Bayesian nonparametric model for combined data integration and clustering called BayesCluster, which aims to identify cancer subtypes and addresses many of the issues faced by the existing integrative methods. The proposed method can integrate and use the information from multiple different datasets, and offers better cluster interpretability by using nonlocal priors. We incorporate feature learning because of the large number of predictors, and use a Dirichlet process mixture model approach to produce the patient subgroups. We ensure tractable inference with simulated annealing. We apply the model to datasets from the Cancer Genome Atlas project of glioblastoma multiforme, which contains clinical and biological data about cancer patients with extremely poor prognosis of survival. By combining all available information we are able to be better identify clinically meaningful subtypes of glioblastoma.
引用
收藏
页码:105 / 114
页数:10
相关论文
共 50 条
  • [1] LUCID: An Integrative Clustering Model for Multi Omics Data
    Zhao, Yinqi
    Conti, David V.
    GENETIC EPIDEMIOLOGY, 2022, 46 (07) : 550 - 550
  • [2] A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data
    Mo, Qianxing
    Shen, Ronglai
    Guo, Cui
    Vannucci, Marina
    Chan, Keith S.
    Hilsenbeck, Susan G.
    BIOSTATISTICS, 2018, 19 (01) : 71 - 86
  • [3] Bayesian integrative model for multi-omics data with missingness
    Fang, Zhou
    Ma, Tianzhou
    Tang, Gong
    Zhu, Li
    Yan, Qi
    Wang, Ting
    Celedon, Juan C.
    Chen, Wei
    Tseng, George C.
    BIOINFORMATICS, 2018, 34 (22) : 3801 - 3808
  • [4] Generalized Bayesian Factor Analysis for Integrative Clustering with Applications to Multi-Omics Data
    Min, Eun Jeong
    Chang, Changgee
    Long, Qi
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 109 - 119
  • [5] Integrative Clustering Analysis for Omics Data with Missingness
    Zhao, Yinqi
    Darst, Burcu
    Conti, David V.
    GENETIC EPIDEMIOLOGY, 2021, 45 (07) : 806 - 806
  • [6] Bayesian nonparametric clustering for large data sets
    Daiane Aparecida Zuanetti
    Peter Müller
    Yitan Zhu
    Shengjie Yang
    Yuan Ji
    Statistics and Computing, 2019, 29 : 203 - 215
  • [7] Bayesian nonparametric clustering for large data sets
    Zuanetti, Daiane Aparecida
    Mueller, Peter
    Zhu, Yitan
    Yang, Shengjie
    Ji, Yuan
    STATISTICS AND COMPUTING, 2019, 29 (02) : 203 - 215
  • [8] Integrative clustering methods for multi-omics data
    Zhang, Xiaoyu
    Zhou, Zhenwei
    Xu, Hanfei
    Liu, Ching-Ti
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (03)
  • [9] SPARSE INTEGRATIVE CLUSTERING OF MULTIPLE OMICS DATA SETS
    Shen, Ronglai
    Wang, Sijian
    Mo, Qianxing
    ANNALS OF APPLIED STATISTICS, 2013, 7 (01): : 269 - 294
  • [10] Principal Subspace Updation for Integrative Clustering of Multimodal Omics Data
    Khan, Aparajita
    Maji, Pradipta
    2017 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NETWORKS (CINE), 2017, : 99 - 104