Bootstrap methods for developing predictive models

被引:485
|
作者
Austin, PC
Tu, JV
机构
[1] Inst Clin Evaluat Sci, Toronto, ON M4N 3M5, Canada
[2] Univ Toronto, Dept Publ Hlth Sci, Toronto, ON, Canada
[3] Univ Toronto, Dept Hlth Policy Management & Evaluat, Toronto, ON, Canada
[4] Inst Clin Evaluat Sci, Toronto, ON, Canada
[5] Sunnybrook & Womens Coll, Hlth Sci Ctr, Div Gen Internal Med, Toronto, ON, Canada
来源
AMERICAN STATISTICIAN | 2004年 / 58卷 / 02期
基金
加拿大健康研究院;
关键词
acute myocardial infarction; epidemiological research; mortality; multivariate analysis; regression models; variable selection;
D O I
10.1198/0003130043277
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Researchers frequently use automated model selection methods such as backwards elimination to identify variables that are independent predictors of an outcome under consideration. We propose using bootstrap resampling in conjunction with automated variable selection methods to develop parsimonious prediction models. Using data on patients admitted to hospital with a heart attack, we demonstrate that selecting those variables that were identified as independent predictors of mortality in at least 60% of the bootstrap samples resulted in a parsimonious model with excellent predictive ability.
引用
收藏
页码:131 / 137
页数:7
相关论文
共 50 条