Empirical Priors for Prediction in Sparse High-dimensional Linear Regression

被引:0
|
作者
Martin, Ryan [1 ]
Tang, Yiqi [1 ]
机构
[1] North Carolina State Univ, Dept Stat, 2311 Stinson Dr, Raleigh, NC 27695 USA
基金
美国国家科学基金会;
关键词
Bayesian inference; data-dependent prior; model averaging; predictive distribution; uncertainty quantification; POSTERIOR CONCENTRATION; VARIABLE SELECTION; HORSESHOE ESTIMATOR; CONVERGENCE-RATES; MODEL SELECTION; LIKELIHOOD; INFERENCE; LASSO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we adopt the familiar sparse, high-dimensional linear regression model and focus on the important but often overlooked task of prediction. In particular, we consider a new empirical Bayes framework that incorporates data in the prior in two ways: one is to center the prior for the non-zero regression coefficients and the other is to provide some additional regularization. We show that, in certain settings, the asymptotic concentration of the proposed empirical Bayes posterior predictive distribution is very fast, and we establish a Bernstein-von Mises theorem which ensures that the derived empirical Bayes prediction intervals achieve the targeted frequentist coverage probability. The empirical prior has a convenient conjugate form, so posterior computations are relatively simple and fast. Finally, our numerical results demonstrate the proposed method's strong finite-sample performance in terms of prediction accuracy, uncertainty quantification, and computation time compared to existing Bayesian methods.
引用
收藏
页数:30
相关论文
共 50 条
  • [21] Empirical Bayes posterior concentration in sparse high-dimensional linear models
    Martin, Ryan
    Mess, Raymond
    Walker, Stephen G.
    BERNOULLI, 2017, 23 (03) : 1822 - 1847
  • [22] High-Dimensional Sparse Linear Bandits
    Hao, Botao
    Lattimore, Tor
    Wang, Mengdi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [23] High-Dimensional Classification by Sparse Logistic Regression
    Abramovich, Felix
    Grinshtein, Vadim
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (05) : 3068 - 3079
  • [24] High-Dimensional Sparse Additive Hazards Regression
    Lin, Wei
    Lv, Jinchi
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) : 247 - 264
  • [25] An Additive Sparse Penalty for Variable Selection in High-Dimensional Linear Regression Model
    Lee, Sangin
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2015, 22 (02) : 147 - 157
  • [26] Estimation of linear projections of non-sparse coefficients in high-dimensional regression
    Azriel, David
    Schwartzman, Armin
    ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 174 - 206
  • [27] Prediction intervals, factor analysis models, and high-dimensional empirical linear prediction
    Ding, AA
    Hwang, JTG
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (446) : 446 - 455
  • [28] BAYESIAN LINEAR REGRESSION WITH SPARSE PRIORS
    Castillo, Ismael
    Schmidt-Hieber, Johannes
    Van der Vaart, Aad
    ANNALS OF STATISTICS, 2015, 43 (05): : 1986 - 2018
  • [29] A STEPWISE REGRESSION METHOD AND CONSISTENT MODEL SELECTION FOR HIGH-DIMENSIONAL SPARSE LINEAR MODELS
    Ing, Ching-Kang
    Lai, Tze Leung
    STATISTICA SINICA, 2011, 21 (04) : 1473 - 1513
  • [30] RELATIVE COST BASED MODEL SELECTION FOR SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION MODELS
    Gohain, Prakash B.
    Jansson, Magnus
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 5515 - 5519