Empirical Priors for Prediction in Sparse High-dimensional Linear Regression

被引：0

作者：

Martin, Ryan ^{[1
]}

Tang, Yiqi ^{[1
]}

机构：

[1] North Carolina State Univ, Dept Stat, 2311 Stinson Dr, Raleigh, NC 27695 USA

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2020年 / 21卷

基金：

美国国家科学基金会;

关键词：

Bayesian inference; data-dependent prior; model averaging; predictive distribution; uncertainty quantification; POSTERIOR CONCENTRATION; VARIABLE SELECTION; HORSESHOE ESTIMATOR; CONVERGENCE-RATES; MODEL SELECTION; LIKELIHOOD; INFERENCE; LASSO;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we adopt the familiar sparse, high-dimensional linear regression model and focus on the important but often overlooked task of prediction. In particular, we consider a new empirical Bayes framework that incorporates data in the prior in two ways: one is to center the prior for the non-zero regression coefficients and the other is to provide some additional regularization. We show that, in certain settings, the asymptotic concentration of the proposed empirical Bayes posterior predictive distribution is very fast, and we establish a Bernstein-von Mises theorem which ensures that the derived empirical Bayes prediction intervals achieve the targeted frequentist coverage probability. The empirical prior has a convenient conjugate form, so posterior computations are relatively simple and fast. Finally, our numerical results demonstrate the proposed method's strong finite-sample performance in terms of prediction accuracy, uncertainty quantification, and computation time compared to existing Bayesian methods.

引用

页数：30

共 50 条

[21] Empirical Bayes posterior concentration in sparse high-dimensional linear models
Martin, Ryan
Mess, Raymond
Walker, Stephen G.
BERNOULLI, 2017, 23 (03) : 1822 - 1847
[22] High-Dimensional Sparse Linear Bandits
Hao, Botao
Lattimore, Tor
Wang, Mengdi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[23] High-Dimensional Classification by Sparse Logistic Regression
Abramovich, Felix
Grinshtein, Vadim
IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (05) : 3068 - 3079
[24] High-Dimensional Sparse Additive Hazards Regression
Lin, Wei
Lv, Jinchi
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) : 247 - 264
[25] An Additive Sparse Penalty for Variable Selection in High-Dimensional Linear Regression Model
Lee, Sangin
COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2015, 22 (02) : 147 - 157
[26] Estimation of linear projections of non-sparse coefficients in high-dimensional regression
Azriel, David
Schwartzman, Armin
ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 174 - 206
[27] Prediction intervals, factor analysis models, and high-dimensional empirical linear prediction
Ding, AA
Hwang, JTG
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (446) : 446 - 455
[28] BAYESIAN LINEAR REGRESSION WITH SPARSE PRIORS
Castillo, Ismael
Schmidt-Hieber, Johannes
Van der Vaart, Aad
ANNALS OF STATISTICS, 2015, 43 (05): : 1986 - 2018
[29] A STEPWISE REGRESSION METHOD AND CONSISTENT MODEL SELECTION FOR HIGH-DIMENSIONAL SPARSE LINEAR MODELS
Ing, Ching-Kang
Lai, Tze Leung
STATISTICA SINICA, 2011, 21 (04) : 1473 - 1513
[30] RELATIVE COST BASED MODEL SELECTION FOR SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION MODELS
Gohain, Prakash B.
Jansson, Magnus
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 5515 - 5519

← 1 2 3 4 5 →