Delete or merge regressors for linear model selection

被引:5
|
作者
Maj-Kanska, Aleksandra [1 ]
Pokarowski, Piotr [2 ]
Prochenka, Agnieszka [1 ]
机构
[1] Polish Acad Sci, Inst Comp Sci, PL-01248 Warsaw, Poland
[2] Univ Warsaw, Fac Math Informat & Mech, PL-02097 Warsaw, Poland
来源
ELECTRONIC JOURNAL OF STATISTICS | 2015年 / 9卷 / 02期
关键词
ANOVA; consistency; BIC; merging levels; t-statistic; variable selection; INFORMATION CRITERIA; REGULARIZATION;
D O I
10.1214/15-EJS1050
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a problem of linear model selection in the presence of both continuous and categorical predictors. Feasible models consist of subsets of numerical variables and partitions of levels of factors. A new algorithm called delete or merge regressors (DMR) is presented which is a stepwise backward procedure involving ranking the predictors according to squared t-statistics and choosing the final model minimizing BIC. We prove consistency of DMR when the number of predictors tends to infinity with the sample size and describe a simulation study using a pertaining R package. The results indicate significant advantage in time complexity and selection accuracy of our algorithm over Lasso-based methods described in the literature. Moreover, a version of DMR for generalized linear models is proposed.
引用
收藏
页码:1749 / 1778
页数:30
相关论文
共 50 条
  • [41] Anytime Model Selection in Linear Bandits
    Kassraie, Parnian
    Emmenegger, Nicolas
    Krause, Andreas
    Pacchiano, Aldo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [42] No arbitrage and a linear portfolio selection model
    Bruni, Renato
    Cesarone, Francesco
    Scozzari, Andrea
    Tardella, Fabio
    ECONOMICS BULLETIN, 2013, 33 (02): : 1247 - 1258
  • [43] Nonnested linear model selection revisited
    Watnik, M
    Johnson, W
    Bedrick, EJ
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2001, 30 (01) : 1 - 20
  • [44] On Stochastic Linear Regression Model Selection
    Praveen, J. Peter
    Mahaboob, B.
    Donthi, Ranadheer
    Prasad, S. Vijay
    Venkateswarlu, B.
    RECENT TRENDS IN PURE AND APPLIED MATHEMATICS, 2019, 2177
  • [45] An asymptotic theory for linear model selection
    Shao, J
    STATISTICA SINICA, 1997, 7 (02) : 221 - 242
  • [46] THE LINEAR UTILITY MODEL FOR OPTIMAL SELECTION
    MELLENBERGH, GJ
    VANDERLINDEN, WJ
    PSYCHOMETRIKA, 1981, 46 (03) : 283 - 293
  • [47] PARTIALLY LINEAR MODEL SELECTION BY THE BOOTSTRAP
    Mueller, Samuel
    Vial, Celine
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2009, 51 (02) : 183 - 200
  • [48] Model Selection in Linear Mixed Models
    Mueller, Samuel
    Scealy, J. L.
    Welsh, A. H.
    STATISTICAL SCIENCE, 2013, 28 (02) : 135 - 167
  • [49] Nonprobabilistic approach for linear model selection
    Wang, Shuning
    Dai, Jianshe
    International Journal of Modelling and Simulation, 1999, 19 (02): : 113 - 117
  • [50] ESTIMATING LINEAR-MODELS WITH ORDINAL QUALITATIVE REGRESSORS
    TERZA, JV
    JOURNAL OF ECONOMETRICS, 1987, 34 (03) : 275 - 291