Bayesian regression based on principal components for high-dimensional data

被引：7

作者：

Lee, Jaeyong ^{[1
]}

Oh, Hee-Seok ^{[1
]}

机构：

[1] Seoul Natl Univ, Seoul 151, South Korea

来源：

JOURNAL OF MULTIVARIATE ANALYSIS | 2013年 / 117卷

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1016/j.jmva.2013.02.002

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

The Gaussian sequence model can be obtained from the high-dimensional regression model through principal component analysis. It is shown that the Gaussian sequence model is equivalent to the original high-dimensional regression model in terms of prediction. Under a sparsity condition, we investigate the posterior consistency and convergence rates of the Gaussian sequence model. In particular, we examine two different modeling strategies: Bayesian inference with and without covariate selection. For Bayesian inferences without covariate selection, we obtain the consistency results of the estimators and posteriors with normal priors with constant and decreasing variances, and the James Stein estimator; for Bayesian inference with covariate selection, we obtain convergence rates of Bayesian model averaging (BMA) and median probability model (MPM) estimators, and the posterior with variable selection prior. Based on these results, we conclude that variable selection is essential in high-dimensional Bayesian regression. A simulation study also confirms the conclusion. The methodologies are applied to a climate prediction problem. (C) 2013 Elsevier Inc. All rights reserved.

引用

页码：175 / 192

页数：18

共 50 条

[1] Using principal components for estimating logistic regression with high-dimensional multicollinear data
Aguilera, AM
Escabias, M
Valderrama, MJ
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (08) : 1905 - 1924
[2] Adaptive Bayesian density regression for high-dimensional data
Shen, Weining
Ghosal, Subhashis
[J]. BERNOULLI, 2016, 22 (01) : 396 - 420
[3] Using principal components to test normality of high-dimensional data
Mansoor, Rashid
[J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (05) : 3396 - 3405
[4] Bayesian Function-on-Scalars Regression for High-Dimensional Data
Kowal, Daniel R.
Bourgeois, Daniel C.
[J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (03) : 629 - 638
[5] A ridge penalized principal-components approach based on heritability for high-dimensional data
Wang, Yuanjia
Fang, Yixin
Jin, Man
[J]. HUMAN HEREDITY, 2007, 64 (03) : 182 - 191
[6] Bayesian Dynamic Feature Partitioning in High-Dimensional Regression With Big Data
Gutierrez, Rene
Guhaniyogi, Rajarshi
[J]. TECHNOMETRICS, 2022, 64 (02) : 224 - 240
[7] A nonparametric Bayesian technique for high-dimensional regression
Guha, Subharup
Baladandayuthapani, Veerabhadran
[J]. ELECTRONIC JOURNAL OF STATISTICS, 2016, 10 (02): : 3374 - 3424
[8] Obtaining insights from high-dimensional data: sparse principal covariates regression
Katrijn Van Deun
Elise A. V. Crompvoets
Eva Ceulemans
[J]. BMC Bioinformatics, 19
[9] Obtaining insights from high-dimensional data: sparse principal covariates regression
Van Deun, Katrijn
Crompvoets, Elise A. V.
Ceulemans, Eva
[J]. BMC BIOINFORMATICS, 2018, 19
[10] New approach to Bayesian high-dimensional linear regression
Jalali, Shirin
Maleki, Arian
[J]. INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2018, 7 (04) : 605 - 655

← 1 2 3 4 5 →