Bayesian variable selection in clustering high-dimensional data with substructure

被引:0
|
作者
Michael D. Swartz
Qianxing Mo
Mary E. Murphy
Joanne R. Lupton
Nancy D. Turner
Mee Young Hong
Marina Vannucci
机构
[1] M.D. Anderson Cancer Center,Department of Epidemiology
[2] Memorial Sloan-Kettering Cancer Center,Department of Epidemiology and Biostatistics
[3] Texas A&M University,Nutrition and Food Science Department
[4] Texas A&M University,School of Exercise and Nutritional Sciences
[5] San Diego State University,Department of Statistics
[6] Rice University,undefined
关键词
Bayesian inference; Designed experiments; Microarray analysis;
D O I
暂无
中图分类号
学科分类号
摘要
In this article we focus on clustering techniques recently proposed for high-dimensional data that incorporate variable selection and extend them to the modeling of data with a known substructure, such as the structure imposed by an experimental design. Our method essentially approximates the within-group covariance by facilitating clustering without disrupting the groups defined by the experimenter. The method we adopt simultaneously determines which expression patterns are important, and which genes contribute to such patterns. We evaluate performance on simulated data and on microarray data from a colon carcinogenesis study. Selected genes are biologically consistent with current research and provide strong biological validation of the cluster configuration identified by the method.
引用
收藏
页码:407 / 423
页数:16
相关论文
共 50 条
  • [1] Bayesian Variable Selection in Clustering High-Dimensional Data With Substructure
    Swartz, Michael D.
    Mo, Qianxing
    Murphy, Mary E.
    Lupton, Joanne R.
    Turner, Nancy D.
    Hong, Mee Young
    Vannucci, Marina
    [J]. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2008, 13 (04) : 407 - 423
  • [2] Bayesian variable selection in clustering high-dimensional data
    Tadesse, MG
    Sha, N
    Vannucci, M
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 602 - 617
  • [3] Bayesian variable selection for high-dimensional rank data
    Cui, Can
    Singh, Susheela P.
    Staicu, Ana-Maria
    Reich, Brian J.
    [J]. ENVIRONMETRICS, 2021, 32 (07)
  • [4] Bayesian variable selection in clustering high-dimensional data via a mixture of finite mixtures
    Doo, Woojin
    Kim, Heeyoung
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (12) : 2551 - 2568
  • [5] High-Dimensional Bayesian Clustering with Variable Selection: The R Package bclus
    Nia, Vahid Partovi
    Davison, Anthony C.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2012, 47 (05): : 1 - 22
  • [6] Scalable Bayesian variable selection for structured high-dimensional data
    Chang, Changgee
    Kundu, Suprateek
    Long, Qi
    [J]. BIOMETRICS, 2018, 74 (04) : 1372 - 1382
  • [7] Sparse Bayesian variable selection for classifying high-dimensional data
    Yang, Aijun
    Lian, Heng
    Jiang, Xuejun
    Liu, Pengfei
    [J]. STATISTICS AND ITS INTERFACE, 2018, 11 (02) : 385 - 395
  • [8] ON THE COMPUTATIONAL COMPLEXITY OF HIGH-DIMENSIONAL BAYESIAN VARIABLE SELECTION
    Yang, Yun
    Wainwright, Martin J.
    Jordan, Michael I.
    [J]. ANNALS OF STATISTICS, 2016, 44 (06): : 2497 - 2532
  • [9] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Yang, Aijun
    Jiang, Xuejun
    Shu, Lianjie
    Lin, Jinguan
    [J]. COMPUTATIONAL STATISTICS, 2017, 32 (01) : 127 - 143
  • [10] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Aijun Yang
    Xuejun Jiang
    Lianjie Shu
    Jinguan Lin
    [J]. Computational Statistics, 2017, 32 : 127 - 143