Bayesian regression based on principal components for high-dimensional data

被引:7
|
作者
Lee, Jaeyong [1 ]
Oh, Hee-Seok [1 ]
机构
[1] Seoul Natl Univ, Seoul 151, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
10.1016/j.jmva.2013.02.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The Gaussian sequence model can be obtained from the high-dimensional regression model through principal component analysis. It is shown that the Gaussian sequence model is equivalent to the original high-dimensional regression model in terms of prediction. Under a sparsity condition, we investigate the posterior consistency and convergence rates of the Gaussian sequence model. In particular, we examine two different modeling strategies: Bayesian inference with and without covariate selection. For Bayesian inferences without covariate selection, we obtain the consistency results of the estimators and posteriors with normal priors with constant and decreasing variances, and the James Stein estimator; for Bayesian inference with covariate selection, we obtain convergence rates of Bayesian model averaging (BMA) and median probability model (MPM) estimators, and the posterior with variable selection prior. Based on these results, we conclude that variable selection is essential in high-dimensional Bayesian regression. A simulation study also confirms the conclusion. The methodologies are applied to a climate prediction problem. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:175 / 192
页数:18
相关论文
共 50 条