The Sense and Non-Sense of Holdout Sample Validation in the Presence of Endogeneity
被引:65
|
作者:
Ebbes, Peter
论文数: 0引用数: 0
h-index: 0
机构:
Ohio State Univ, Fisher Coll Business, Columbus, OH 43210 USAOhio State Univ, Fisher Coll Business, Columbus, OH 43210 USA
Ebbes, Peter
[1
]
Papies, Dominik
论文数: 0引用数: 0
h-index: 0
机构:
Univ Hamburg, Inst Mkt & Media, D-20354 Hamburg, GermanyOhio State Univ, Fisher Coll Business, Columbus, OH 43210 USA
Papies, Dominik
[2
]
van Heerde, Harald J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Waikato, Hamilton 3240, New Zealand
Tilburg Univ, CentER, NL-5000 LE Tilburg, NetherlandsOhio State Univ, Fisher Coll Business, Columbus, OH 43210 USA
van Heerde, Harald J.
[3
,4
]
机构:
[1] Ohio State Univ, Fisher Coll Business, Columbus, OH 43210 USA
model validation;
instrumental variables;
IV estimation;
endogeneity;
exogeneity;
predictive model;
descriptive model;
holdout sample;
prediction;
CROSS-VALIDATION;
MARKETING MODELS;
HETEROGENEITY;
BEHAVIOR;
D O I:
10.1287/mksc.1110.0666
中图分类号:
F [经济];
学科分类号:
02 ;
摘要:
Market response models based on field-generated data need to address potential endogeneity in the regressors to obtain consistent parameter estimates. Another requirement is that market response models predict well in a holdout sample. With both requirements combined, it may seem reasonable to subject an endogeneity-corrected model to a holdout prediction task, and this is quite common in the academic marketing literature. One may be inclined to expect that the consistent parameter estimates obtained via instrumental variables (IV) estimation predict better than the biased ordinary least squares (OLS) estimates. This paper shows that this expectation is incorrect. That is, if the holdout sample is similar to the estimation sample so that the regressors are endogenous in both samples, holdout sample validation favors regression estimates that are not corrected for endogeneity (i.e., OLS) over estimates that are corrected for endogeneity (i.e., IV estimation). We also discuss ways in which holdout samples may be used sensibly in the presence of endogeneity. A key takeaway is that if consistent parameter estimates are the primary model objective, the model should be validated with an exogenous (rather than endogenous) holdout sample.
机构:
Aligarh Muslim Univ, Interdisciplinary Biotechnol Unit, Aligarh, Uttar Pradesh, IndiaAligarh Muslim Univ, Interdisciplinary Biotechnol Unit, Aligarh, Uttar Pradesh, India