Bayesian Variable Selection in Clustering High-Dimensional Data With Substructure

被引:5
|
作者
Swartz, Michael D. [2 ]
Mo, Qianxing [3 ]
Murphy, Mary E.
Lupton, Joanne R. [4 ]
Turner, Nancy D. [4 ]
Hong, Mee Young [5 ]
Vannucci, Marina [1 ]
机构
[1] Rice Univ, Dept Stat, Houston, TX 77251 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Epidemiol, Houston, TX 77030 USA
[3] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10021 USA
[4] Texas A&M Univ, Dept Nutr & Food Sci, College Stn, TX 77843 USA
[5] San Diego State Univ, Sch Exercise & Nutr Sci, San Diego, CA 92182 USA
关键词
Bayesian inference; Designed experiments; Microarray analysis;
D O I
10.1198/108571108X378317
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this article we focus on clustering techniques recently proposed for high-dimensional data that incorporate variable selection and extend them to the modeling of data with a known substructure, such as the structure imposed by an experimental design. Our method essentially approximates the within-group covariance by facilitating Clustering without disrupting the groups defined by the experimenter The method we adopt simultaneously determines which expression Patterns are important, and which genes contribute to Such patterns. We evaluate performance on simulated data and on microarray data From a colon carcinogenesis Study. Selected genes are biologically consistent with Current research and provide strong biological validation of the Cluster configuration identified by the method.
引用
收藏
页码:407 / 423
页数:17
相关论文
共 50 条
  • [1] Bayesian variable selection in clustering high-dimensional data with substructure
    Michael D. Swartz
    Qianxing Mo
    Mary E. Murphy
    Joanne R. Lupton
    Nancy D. Turner
    Mee Young Hong
    Marina Vannucci
    [J]. Journal of Agricultural, Biological, and Environmental Statistics, 2008, 13 : 407 - 423
  • [2] Bayesian variable selection in clustering high-dimensional data
    Tadesse, MG
    Sha, N
    Vannucci, M
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 602 - 617
  • [3] Bayesian variable selection for high-dimensional rank data
    Cui, Can
    Singh, Susheela P.
    Staicu, Ana-Maria
    Reich, Brian J.
    [J]. ENVIRONMETRICS, 2021, 32 (07)
  • [4] Bayesian variable selection in clustering high-dimensional data via a mixture of finite mixtures
    Doo, Woojin
    Kim, Heeyoung
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (12) : 2551 - 2568
  • [5] High-Dimensional Bayesian Clustering with Variable Selection: The R Package bclus
    Nia, Vahid Partovi
    Davison, Anthony C.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2012, 47 (05): : 1 - 22
  • [6] Scalable Bayesian variable selection for structured high-dimensional data
    Chang, Changgee
    Kundu, Suprateek
    Long, Qi
    [J]. BIOMETRICS, 2018, 74 (04) : 1372 - 1382
  • [7] Sparse Bayesian variable selection for classifying high-dimensional data
    Yang, Aijun
    Lian, Heng
    Jiang, Xuejun
    Liu, Pengfei
    [J]. STATISTICS AND ITS INTERFACE, 2018, 11 (02) : 385 - 395
  • [8] ON THE COMPUTATIONAL COMPLEXITY OF HIGH-DIMENSIONAL BAYESIAN VARIABLE SELECTION
    Yang, Yun
    Wainwright, Martin J.
    Jordan, Michael I.
    [J]. ANNALS OF STATISTICS, 2016, 44 (06): : 2497 - 2532
  • [9] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Aijun Yang
    Xuejun Jiang
    Lianjie Shu
    Jinguan Lin
    [J]. Computational Statistics, 2017, 32 : 127 - 143
  • [10] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Yang, Aijun
    Jiang, Xuejun
    Shu, Lianjie
    Lin, Jinguan
    [J]. COMPUTATIONAL STATISTICS, 2017, 32 (01) : 127 - 143