A Bayesian Nonparametric Model for Integrative Clustering of Omics Data

被引:0
|
作者
Peneva, Iliana [1 ]
Savage, Richard S. [2 ]
机构
[1] Univ Warwick, Warwick, England
[2] Univ Warwick, Dept Stat, Warwick, England
关键词
Bayesian nonparametrics; Data integration; Glioblastoma; Mixture models; Non-local priors; LATENT VARIABLE MODEL; BREAST; GLIOBLASTOMA;
D O I
10.1007/978-3-030-30611-3_11
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Cancer is a complex disease, driven by a range of genetic and environmental factors. Many integrative clustering methods aim to provide insight into the mechanisms underlying cancer but fewof them are computationally efficient and able to estimate the number of subtypes. We have developed a Bayesian nonparametric model for combined data integration and clustering called BayesCluster, which aims to identify cancer subtypes and addresses many of the issues faced by the existing integrative methods. The proposed method can integrate and use the information from multiple different datasets, and offers better cluster interpretability by using nonlocal priors. We incorporate feature learning because of the large number of predictors, and use a Dirichlet process mixture model approach to produce the patient subgroups. We ensure tractable inference with simulated annealing. We apply the model to datasets from the Cancer Genome Atlas project of glioblastoma multiforme, which contains clinical and biological data about cancer patients with extremely poor prognosis of survival. By combining all available information we are able to be better identify clinically meaningful subtypes of glioblastoma.
引用
收藏
页码:105 / 114
页数:10
相关论文
共 50 条
  • [31] Omics Fusion - A Platform for Integrative Analysis of Omics Data
    Brink, Benedikt G.
    Seidel, Annica
    Kleinboelting, Nils
    Nattkemper, Tim W.
    Albaum, Stefan P.
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2016, 13 (04): : 296
  • [32] A Bayesian mixture model for clustering circular data
    Rodriguez, Carlos E.
    Nunez-Antonio, Gabriel
    Escarela, Gabriel
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 143
  • [33] Bayesian nonparametric clustering as a community detection problem
    Tonellato, Stefano F.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 152
  • [34] Bayesian Nonparametric Clustering for Positive Definite Matrices
    Cherian, Anoop
    Morellas, Vassilios
    Papanikolopoulos, Nikolaos
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (05) : 862 - 874
  • [35] A Bayesian Nonparametric Approach for Time Series Clustering
    Nieto-Barajas, Luis E.
    Contreras-Cristan, Alberto
    BAYESIAN ANALYSIS, 2014, 9 (01): : 147 - 169
  • [36] Efficient nonparametric and asymptotic Bayesian model selection methods for attributed graph clustering
    Zhiqiang Xu
    James Cheng
    Xiaokui Xiao
    Ryohei Fujimaki
    Yusuke Muraoka
    Knowledge and Information Systems, 2017, 53 : 239 - 268
  • [37] A Bayesian Graphical Model for Integrative Analysis of TCGA Data
    Xu, Yanxun
    Zhang, Jie
    Yuan, Yuan
    Mitra, Riten
    Mueller, Peter
    Ji, Yuan
    2012 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS), 2012, : 135 - 138
  • [38] Bayesian Nonparametric Joint Mixture Model for Clustering Spatially Correlated Time Series
    Lee, Youngmin
    Kim, Heeyoung
    TECHNOMETRICS, 2020, 62 (03) : 313 - 329
  • [39] Efficient nonparametric and asymptotic Bayesian model selection methods for attributed graph clustering
    Xu, Zhiqiang
    Cheng, James
    Xiao, Xiaokui
    Fujimaki, Ryohei
    Muraoka, Yusuke
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 53 (01) : 239 - 268
  • [40] A Common Atoms Model for the Bayesian Nonparametric Analysis of Nested Data
    Denti, Francesco
    Camerlenghi, Federico
    Guindani, Michele
    Mira, Antonietta
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (541) : 405 - 416