Delete or merge regressors for linear model selection

被引:5
|
作者
Maj-Kanska, Aleksandra [1 ]
Pokarowski, Piotr [2 ]
Prochenka, Agnieszka [1 ]
机构
[1] Polish Acad Sci, Inst Comp Sci, PL-01248 Warsaw, Poland
[2] Univ Warsaw, Fac Math Informat & Mech, PL-02097 Warsaw, Poland
来源
ELECTRONIC JOURNAL OF STATISTICS | 2015年 / 9卷 / 02期
关键词
ANOVA; consistency; BIC; merging levels; t-statistic; variable selection; INFORMATION CRITERIA; REGULARIZATION;
D O I
10.1214/15-EJS1050
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a problem of linear model selection in the presence of both continuous and categorical predictors. Feasible models consist of subsets of numerical variables and partitions of levels of factors. A new algorithm called delete or merge regressors (DMR) is presented which is a stepwise backward procedure involving ranking the predictors according to squared t-statistics and choosing the final model minimizing BIC. We prove consistency of DMR when the number of predictors tends to infinity with the sample size and describe a simulation study using a pertaining R package. The results indicate significant advantage in time complexity and selection accuracy of our algorithm over Lasso-based methods described in the literature. Moreover, a version of DMR for generalized linear models is proposed.
引用
收藏
页码:1749 / 1778
页数:30
相关论文
共 50 条
  • [21] Prediction in linear index models with endogenous regressors
    Skeels, Christopher L.
    Taylor, Larry W.
    STATA JOURNAL, 2015, 15 (03): : 627 - 644
  • [23] Measuring the information content of regressors in the linear model using PROC REG* and SAS IML*
    Retzer, J
    Pflughoeft, K
    PROCEEDINGS OF THE TWENTY-SECOND ANNUAL SAS USERS GROUP INTERNATIONAL CONFERENCE, 1997, : 1347 - 1352
  • [24] Adaptive estimation in the linear random coefficients model when regressors have limited variation
    Gaillac, Christophe
    Gautier, Eric
    BERNOULLI, 2022, 28 (01) : 504 - 524
  • [25] PMSE performance of the biased estimators in a linear regression model when relevant regressors are omitted
    Namba, A
    ECONOMETRIC THEORY, 2002, 18 (05) : 1086 - 1098
  • [26] Criteria for Selection of Stochastic Linear Model Selection
    Mahaboob, B.
    Praveen, J. Peter
    Donthi, Ranadheer
    Prasad, S. Vijay
    Venkateswarlu, B.
    RECENT TRENDS IN PURE AND APPLIED MATHEMATICS, 2019, 2177
  • [27] liureg: A Comprehensive R Package for the Liu Estimation of Linear Regression Model with Collinear Regressors
    Imdadullah, Muhammad
    Aslam, Muhammad
    Altaf, Saima
    R JOURNAL, 2017, 9 (02): : 232 - 247
  • [28] Model Selection by Linear Programming
    Wang, Joseph
    Bolukbasi, Tolga
    Trapeznikov, Kirill
    Saligrama, Venkatesh
    COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 : 647 - 662
  • [29] Consistent linear model selection
    Zhao, M
    Kulasekera, KB
    STATISTICS & PROBABILITY LETTERS, 2006, 76 (05) : 520 - 530
  • [30] Model selection in linear regression
    Kundu, D
    Murali, G
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1996, 22 (05) : 461 - 469