A Bayesian Nonparametric Model for Integrative Clustering of Omics Data

被引:0
|
作者
Peneva, Iliana [1 ]
Savage, Richard S. [2 ]
机构
[1] Univ Warwick, Warwick, England
[2] Univ Warwick, Dept Stat, Warwick, England
关键词
Bayesian nonparametrics; Data integration; Glioblastoma; Mixture models; Non-local priors; LATENT VARIABLE MODEL; BREAST; GLIOBLASTOMA;
D O I
10.1007/978-3-030-30611-3_11
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Cancer is a complex disease, driven by a range of genetic and environmental factors. Many integrative clustering methods aim to provide insight into the mechanisms underlying cancer but fewof them are computationally efficient and able to estimate the number of subtypes. We have developed a Bayesian nonparametric model for combined data integration and clustering called BayesCluster, which aims to identify cancer subtypes and addresses many of the issues faced by the existing integrative methods. The proposed method can integrate and use the information from multiple different datasets, and offers better cluster interpretability by using nonlocal priors. We incorporate feature learning because of the large number of predictors, and use a Dirichlet process mixture model approach to produce the patient subgroups. We ensure tractable inference with simulated annealing. We apply the model to datasets from the Cancer Genome Atlas project of glioblastoma multiforme, which contains clinical and biological data about cancer patients with extremely poor prognosis of survival. By combining all available information we are able to be better identify clinically meaningful subtypes of glioblastoma.
引用
收藏
页码:105 / 114
页数:10
相关论文
共 50 条
  • [21] Bayesian nonparametric latent class model for longitudinal data
    Koo, Wonmo
    Kim, Heeyoung
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (11) : 3381 - 3395
  • [22] A Nonparametric Bayesian Poisson Gamma Model for Count Data
    Gupta, Sunil Kumar
    Dinh Phung
    Venkatesh, Svetha
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1815 - 1818
  • [23] Integrative analysis of omics data
    不详
    METHODS, 2017, 124 : 1 - 2
  • [24] Integrative biclustering of heterogeneous datasets using a Bayesian nonparametric model with application to chemogenomics
    Li, Dazhuo
    Rouchka, Eric C.
    BMC BIOINFORMATICS, 2011, 12
  • [25] Integrative biclustering of heterogeneous datasets using a Bayesian nonparametric model with application to chemogenomics
    Dazhuo Li
    Eric C Rouchka
    BMC Bioinformatics, 12
  • [26] A Differentially Private Big Data Nonparametric Bayesian Clustering Algorithm in Smart Grid
    Guan, Zhitao
    Lv, Zefang
    Sun, Xianwen
    Wu, Longfei
    Wu, Jun
    Du, Xiaojiang
    Guizani, Mohsen
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 2631 - 2641
  • [27] Integrative phenotyping framework (iPF): integrative clustering of multiple omics data identifies novel lung disease subphenotypes
    Kim, SungHwan
    Herazo-Maya, Jose D.
    Kang, Dongwan D.
    Juan-Guardela, Brenda M.
    Tedrow, John
    Martinez, Fernando J.
    Sciurba, Frank C.
    Tseng, George C.
    Kaminski, Naftali
    BMC GENOMICS, 2015, 16
  • [28] Nonparametric Bayesian Bi-Clustering for Next Generation Sequencing Count Data
    Xu, Yanxun
    Lee, Juhee
    Yuan, Yuan
    Mitra, Riten
    Liang, Shoudan
    Mueller, Peter
    Ji, Yuan
    BAYESIAN ANALYSIS, 2013, 8 (04): : 759 - 780
  • [29] Integrative phenotyping framework (iPF): integrative clustering of multiple omics data identifies novel lung disease subphenotypes
    SungHwan Kim
    Jose D. Herazo-Maya
    Dongwan D. Kang
    Brenda M. Juan-Guardela
    John Tedrow
    Fernando J. Martinez
    Frank C. Sciurba
    George C. Tseng
    Naftali Kaminski
    BMC Genomics, 16
  • [30] A spatio-temporal nonparametric Bayesian variable selection model of fMRI data for clustering correlated time courses
    Zhang, Linlin
    Guindani, Michele
    Versace, Francesco
    Vannucci, Marina
    NEUROIMAGE, 2014, 95 : 162 - 175