A stepwise regression algorithm for high-dimensional variable selection

被引:21
|
作者
Hwang, Jing-Shiang [1 ]
Hu, Tsuey-Hwa [1 ]
机构
[1] Acad Sinica, Inst Stat Sci, Taipei 11529, Taiwan
关键词
lasso; forward regression; minimum p-value; genome-wide association study; high-dimensional data;
D O I
10.1080/00949655.2014.902460
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a new stepwise regression algorithm with a simple stopping rule for the identification of influential predictors and interactions among a huge number of variables in various statistical models. Like conventional stepwise regression, at each forward selection step, a variable is included in the current model if the test statistic of the enlarged model with the predictor against the current model has the minimum [GRAPHICS] -value among all the candidates and is smaller than a predetermined threshold. Instead of using conventional information types of criteria, the threshold is determined by a lower percentile of the beta distribution. We conducted extensive simulation studies to evaluate the performance of the proposed algorithm for various statistical models and found it to be very competitive and robust compared with several popular high-dimensional variable selection methods.
引用
收藏
页码:1793 / 1806
页数:14
相关论文
共 50 条