Delete or merge regressors for linear model selection

被引:5
|
作者
Maj-Kanska, Aleksandra [1 ]
Pokarowski, Piotr [2 ]
Prochenka, Agnieszka [1 ]
机构
[1] Polish Acad Sci, Inst Comp Sci, PL-01248 Warsaw, Poland
[2] Univ Warsaw, Fac Math Informat & Mech, PL-02097 Warsaw, Poland
来源
ELECTRONIC JOURNAL OF STATISTICS | 2015年 / 9卷 / 02期
关键词
ANOVA; consistency; BIC; merging levels; t-statistic; variable selection; INFORMATION CRITERIA; REGULARIZATION;
D O I
10.1214/15-EJS1050
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a problem of linear model selection in the presence of both continuous and categorical predictors. Feasible models consist of subsets of numerical variables and partitions of levels of factors. A new algorithm called delete or merge regressors (DMR) is presented which is a stepwise backward procedure involving ranking the predictors according to squared t-statistics and choosing the final model minimizing BIC. We prove consistency of DMR when the number of predictors tends to infinity with the sample size and describe a simulation study using a pertaining R package. The results indicate significant advantage in time complexity and selection accuracy of our algorithm over Lasso-based methods described in the literature. Moreover, a version of DMR for generalized linear models is proposed.
引用
收藏
页码:1749 / 1778
页数:30
相关论文
共 50 条