Bootstrap model selection

被引:164
|
作者
Shao, J
机构
关键词
autoregressive time series; bootstrap sample size; generalized linear model; nonlinear regression; prediction error;
D O I
10.2307/2291661
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In a regression problem, typically there are p explanatory variables possibly related to a response variable, and we wish to select a subset of the p explanatory variables to fit a model between these variables and the response. A bootstrap variable/model selection procedure is to select the subset of variables by minimizing bootstrap estimates of the prediction error, where the bootstrap estimates are constructed based on a data set of size n. Although the bootstrap estimates have good properties, this bootstrap selection procedure is inconsistent in the sense that the probability of selecting the optimal subset of variables does not converge to 1 as n --> infinity. This inconsistency can be rectified by modifying the sampling method used in drawing bootstrap observations. For bootstrapping pairs (response, explanatory variable), it is found that instead of drawing n bootstrap observations (a customary bootstrap sampling plan), much less bootstrap observations should be sampled: The bootstrap selection procedure becomes consistent if we draw m bootstrap observations with m --> infinity and m/n --> 0. For bootstrapping residuals, we modify the bootstrap sampling procedure by increasing the variability among the bootstrap observations. The consistency of the modified bootstrap selection procedures is established in various situations, including linear models, nonlinear models, generalized linear models, and autoregressive time series. The choice of the bootstrap sample size m and some computational issues are also discussed. Some empirical results are presented.
引用
收藏
页码:655 / 665
页数:11
相关论文
共 50 条
  • [1] MODEL SELECTION AND THE BOOTSTRAP
    EFRON, B
    MATHEMATICAL SOCIAL SCIENCES, 1983, 5 (02) : 236 - 236
  • [2] Model selection: A bootstrap approach
    Zoubir, A.M.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 3 : 1377 - 1380
  • [3] Bootstrap methods for model selection
    Zoubir, AM
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 1999, 53 (06) : 386 - 392
  • [4] Model selection with bootstrap validation
    Savvides, Rafael
    Makela, Jarmo
    Puolamaki, Kai
    STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (02) : 162 - 186
  • [5] Bootstrap for neural model selection
    Kallel, R
    Cottrell, M
    Vigneron, V
    NEUROCOMPUTING, 2002, 48 : 175 - 183
  • [6] Model selection: A bootstrap approach
    Zoubir, AM
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 1377 - 1380
  • [7] Weighted bootstrap for neural model selection
    Chuang, Shun-Chin
    Hung, Wen-Liang
    Fu, Hsin-Chia
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2008, 39 (05) : 557 - 562
  • [8] A note on bootstrap model selection criterion
    Chung, HY
    Lee, KW
    Koo, JY
    STATISTICS & PROBABILITY LETTERS, 1996, 26 (01) : 35 - 41
  • [9] Model selection by bootstrap penalization for classification
    Magalie Fromont
    Machine Learning, 2007, 66 : 165 - 207
  • [10] Testing Model Fit by Bootstrap Selection
    Gronneberg, Steffen
    Foldnes, Njal
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2019, 26 (02) : 182 - 190