UPPER BOUNDS ON THE MINIMUM COVERAGE PROBABILITY OF CONFIDENCE INTERVALS IN REGRESSION AFTER MODEL SELECTION

被引：12

作者：

Kabaila, Paul ^{[1
]}

Giri, Khageswor ^{[1
]}

机构：

[1] La Trobe Univ, Dept Math & Stat, Bundoora, Vic 3086, Australia

来源：

AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS | 2009年 / 51卷 / 03期

关键词：

Adjusted R(2)-statistic; AIC; 'best subset'; regression; BIC; Mallows' criterion; t-tests; VARIABLE SELECTION; INFERENCE; REJECTION; PRETEST; ERROR;

D O I：

10.1111/j.1467-842X.2009.00544.x

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

P>We consider a linear regression model, with the parameter of interest a specified linear combination of the components of the regression parameter vector. We suppose that, as a first step, a data-based model selection (e.g. by preliminary hypothesis tests or minimizing the Akaike information criterion - AIC) is used to select a model. It is common statistical practice to then construct a confidence interval for the parameter of interest, based on the assumption that the selected model had been given to us a priori. This assumption is false, and it can lead to a confidence interval with poor coverage properties. We provide an easily computed finite-sample upper bound (calculated by repeated numerical evaluation of a double integral) to the minimum coverage probability of this confidence interval. This bound applies for model selection by any of the following methods: minimum AIC, minimum Bayesian information criterion (BIC), maximum adjusted R(2), minimum Mallows' C(P) and t-tests. The importance of this upper bound is that it delineates general categories of design matrices and model selection procedures for which this confidence interval has poor coverage properties. This upper bound is shown to be a finite-sample analogue of an earlier large-sample upper bound due to Kabaila and Leeb.

引用

页码：271 / 287

页数：17

共 50 条

[1] On the coverage probability of confidence intervals in regression after variable selection
Kabaila, P
[J]. AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2005, 47 (04) : 549 - 562
[2] The minimum coverage probability of confidence intervals in regression after a preliminary F test
Kabaila, Paul
Farchione, Davide
[J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (04) : 956 - 964
[3] On the minimum coverage probability of model averaged tail area confidence intervals
Kabaila, Paul
[J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2018, 46 (02): : 279 - 297
[4] On the large-sample minimal coverage probability of confidence intervals after model selection
Kabaila, Paul
Leeb, Hannes
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (474) : 619 - 629
[5] Two sources of poor coverage of confidence intervals after model selection
Kabaila, Paul
Mainzer, Rheanna
[J]. STATISTICS & PROBABILITY LETTERS, 2018, 140 : 185 - 190
[6] Upper bounds for coverage probabilities of confidence intervals for nonmonotone parametric functions
Bar-Lev, SK
Bshouty, D
Reiser, B
[J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2000, 89 (1-2) : 109 - 118
[7] Upper bounds on the true coverage of bootstrap percentile type confidence intervals
Polansky, AM
[J]. AMERICAN STATISTICIAN, 1999, 53 (04): : 362 - 369
[8] Valid confidence intervals in regression after variable selection
Kabaila, P
[J]. ECONOMETRIC THEORY, 1998, 14 (04) : 463 - 482
[9] The large sample coverage probability of confidence intervals in general regression models after a preliminary hypothesis test
Kabaila, Paul
Kuveke, Rupert E. H.
[J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2019, 46 (02) : 432 - 445
[10] Coverage of confidence intervals based on conditional probability
Mandelkern, M
Schultz, J
[J]. JOURNAL OF HIGH ENERGY PHYSICS, 2000, (11):

← 1 2 3 4 5 →