Bootstrap model selection

被引:164
|
作者
Shao, J
机构
关键词
autoregressive time series; bootstrap sample size; generalized linear model; nonlinear regression; prediction error;
D O I
10.2307/2291661
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In a regression problem, typically there are p explanatory variables possibly related to a response variable, and we wish to select a subset of the p explanatory variables to fit a model between these variables and the response. A bootstrap variable/model selection procedure is to select the subset of variables by minimizing bootstrap estimates of the prediction error, where the bootstrap estimates are constructed based on a data set of size n. Although the bootstrap estimates have good properties, this bootstrap selection procedure is inconsistent in the sense that the probability of selecting the optimal subset of variables does not converge to 1 as n --> infinity. This inconsistency can be rectified by modifying the sampling method used in drawing bootstrap observations. For bootstrapping pairs (response, explanatory variable), it is found that instead of drawing n bootstrap observations (a customary bootstrap sampling plan), much less bootstrap observations should be sampled: The bootstrap selection procedure becomes consistent if we draw m bootstrap observations with m --> infinity and m/n --> 0. For bootstrapping residuals, we modify the bootstrap sampling procedure by increasing the variability among the bootstrap observations. The consistency of the modified bootstrap selection procedures is established in various situations, including linear models, nonlinear models, generalized linear models, and autoregressive time series. The choice of the bootstrap sample size m and some computational issues are also discussed. Some empirical results are presented.
引用
收藏
页码:655 / 665
页数:11
相关论文
共 50 条
  • [41] Bootstrap variants of the Akaike information criterion for mixed model selection
    Shang, Junfeng
    Cavanaugh, Joseph E.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (04) : 2004 - 2021
  • [42] Assessing Model Selection Uncertainty Using a Bootstrap Approach: An Update
    Lubke, Gitta H.
    Campbell, Ian
    McArtor, Dan
    Miller, Patrick
    Luningham, Justin
    van den Berg, Stephanie M.
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2017, 24 (02) : 230 - 245
  • [43] Bootstrap-based model selection criteria for beta regressions
    Bayer, Fabio M.
    Cribari-Neto, Francisco
    TEST, 2015, 24 (04) : 776 - 795
  • [44] Parametric bootstrap model selection criterion with in linear model compared to other criteria
    Ucal, Meltem
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 8, 2005, : 354 - 357
  • [45] BOOTSTRAP SELECTION PROCEDURES
    SWANEPOEL, JWH
    SOUTH AFRICAN STATISTICAL JOURNAL, 1982, 16 (02) : 162 - 162
  • [46] THE BOOTSTRAP-BASED SELECTION CRITERIA: AN OPTIMAL CHOICE FOR MODEL SELECTION IN LINEAR REGRESSION
    Shang, Junfeng
    ADVANCES AND APPLICATIONS IN STATISTICS, 2010, 14 (02) : 173 - 189
  • [47] A bootstrap model selection criterion based on Kullback's symmetric divergence
    Seghouane, AK
    De Lathawer, L
    PROCEEDINGS OF THE 2003 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING, 2003, : 494 - 497
  • [48] Robust bootstrap control charts for percentiles based on model selection approaches
    Chiang, Jyun-You
    Lio, Y. L.
    Ng, H. K. T.
    Tsai, Tzong-Ru
    Li, Ting
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 123 : 119 - 133
  • [49] Bootstrap model order selection of Zernike polynomial expansion for classification of rice
    Wee, CY
    Paramesran, R
    Takeda, F
    TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A203 - A206
  • [50] Optimal selection of model order for a class of nonlinear systems using the bootstrap
    Zoubir, AM
    Ralston, JC
    Iskander, DR
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3945 - 3948