ON THE COMPUTATIONAL COMPLEXITY OF HIGH-DIMENSIONAL BAYESIAN VARIABLE SELECTION

被引:77
|
作者
Yang, Yun [1 ]
Wainwright, Martin J. [2 ]
Jordan, Michael I. [2 ]
机构
[1] Univ Calif Berkeley, Dept EECS, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept EECS & Stat, Berkeley, CA 94720 USA
来源
ANNALS OF STATISTICS | 2016年 / 44卷 / 06期
基金
美国国家科学基金会;
关键词
High-dimensional inference; Bayesian variable selection; Markov chain; spectral gap; rapid mixing; SPARSITY RECOVERY; REGRESSION; PRIORS; LIKELIHOOD;
D O I
10.1214/15-AOS1417
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study the computational complexity of Markov chain Monte Carlo (MCMC) methods for high-dimensional Bayesian linear regression under sparsity constraints. We first show that a Bayesian approach can achieve variable-selection consistency under relatively mild conditions on the design matrix. We then demonstrate that the statistical criterion of posterior concentration need not imply the computational desideratum of rapid mixing of the MCMC algorithm. By introducing a truncated sparsity prior for variable selection, we provide a set of conditions that guarantee both variable-selection consistency and rapid mixing of a particular Metropolis Hastings algorithm. The mixing time is linear in the number of covariates up to a logarithmic factor. Our proof controls the spectral gap of the Markov chain by constructing a canonical path ensemble that is inspired by the steps taken by greedy algorithms for variable selection.
引用
收藏
页码:2497 / 2532
页数:36
相关论文
共 50 条
  • [1] Bayesian variable selection in clustering high-dimensional data
    Tadesse, MG
    Sha, N
    Vannucci, M
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 602 - 617
  • [2] Bayesian variable selection for high-dimensional rank data
    Cui, Can
    Singh, Susheela P.
    Staicu, Ana-Maria
    Reich, Brian J.
    [J]. ENVIRONMETRICS, 2021, 32 (07)
  • [3] Bayesian Variable Selection in Clustering High-Dimensional Data With Substructure
    Swartz, Michael D.
    Mo, Qianxing
    Murphy, Mary E.
    Lupton, Joanne R.
    Turner, Nancy D.
    Hong, Mee Young
    Vannucci, Marina
    [J]. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2008, 13 (04) : 407 - 423
  • [4] Sparse Bayesian variable selection for classifying high-dimensional data
    Yang, Aijun
    Lian, Heng
    Jiang, Xuejun
    Liu, Pengfei
    [J]. STATISTICS AND ITS INTERFACE, 2018, 11 (02) : 385 - 395
  • [5] Bayesian Regression Trees for High-Dimensional Prediction and Variable Selection
    Linero, Antonio R.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) : 626 - 636
  • [6] Bayesian variable selection in clustering high-dimensional data with substructure
    Michael D. Swartz
    Qianxing Mo
    Mary E. Murphy
    Joanne R. Lupton
    Nancy D. Turner
    Mee Young Hong
    Marina Vannucci
    [J]. Journal of Agricultural, Biological, and Environmental Statistics, 2008, 13 : 407 - 423
  • [7] Scalable Bayesian variable selection for structured high-dimensional data
    Chang, Changgee
    Kundu, Suprateek
    Long, Qi
    [J]. BIOMETRICS, 2018, 74 (04) : 1372 - 1382
  • [8] Variance Prior Forms for High-Dimensional Bayesian Variable Selection
    Moran, Gemma E.
    Rockova, Veronika
    George, Edward I.
    [J]. BAYESIAN ANALYSIS, 2019, 14 (04): : 1091 - 1119
  • [9] HIGH-DIMENSIONAL VARIABLE SELECTION
    Wasserman, Larry
    Roeder, Kathryn
    [J]. ANNALS OF STATISTICS, 2009, 37 (5A): : 2178 - 2201
  • [10] Dimension-free mixing for high-dimensional Bayesian variable selection
    Zhou, Quan
    Yang, Jun
    Vats, Dootika
    Roberts, Gareth O.
    Rosenthal, Jeffrey S.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (05) : 1751 - 1784