variable selection methods;
model selection methods;
regression models;
Monte Carlo simulations;
backwards variable elimination;
D O I:
10.1080/02664760802382434
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
Prior studies have shown that automated variable selection results in models with substantially inflated estimates of the model R-2, and that a large proportion of selected variables are truly noise variables. These earlier studies used simulated data sets whose sample sizes were at most 100. We used Monte Carlo Simulations to examine the large-sample performance of backwards variable elimination. We found that in large samples, backwards variable elimination resulted in estimates of R-2 that were at most marginally biased. However, even in large samples, backwards elimination tended to identify the correct regression model in a minority of the simulated data sets.
机构:
Univ Liechtenstein, Inst Financial Serv, Furst Franz Josef Str, FL-9490 Vaduz, LiechtensteinUniv Liechtenstein, Inst Financial Serv, Furst Franz Josef Str, FL-9490 Vaduz, Liechtenstein
Hanke, Michael
Penev, Spiridon
论文数: 0引用数: 0
h-index: 0
机构:
Univ New South Wales, Sch Math & Stat, Sydney, NSW 2052, AustraliaUniv Liechtenstein, Inst Financial Serv, Furst Franz Josef Str, FL-9490 Vaduz, Liechtenstein