An unbiased model comparison test using cross-validation

被引:2
|
作者
Desmarais, Bruce A. [1 ]
Harden, Jeffrey J. [2 ]
机构
[1] Univ Massachusetts, Dept Polit Sci, Amherst, MA 01003 USA
[2] Univ Colorado, Dept Polit Sci, Boulder, CO 80309 USA
关键词
Model selection; Cross-validation; Kullback-Leibler Divergence; Vuong test; Clarke test; POLICY; INFORMATION; SELECTION; POLITICS; TIME;
D O I
10.1007/s11135-013-9884-7
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Social scientists often consider multiple empirical models of the same process. When these models are parametric and non-nested, the null hypothesis that two models fit the data equally well is commonly tested using methods introduced by Vuong (Econometrica 57(2):307-333, 1989) and Clarke (Am J Political Sci 45(3):724-744, 2001; J Confl Resolut 47(1):72-93, 2003; Political Anal 15(3):347-363, 2007). The objective of each is to compare the Kullback-Leibler Divergence (KLD) of the two models from the true model that generated the data. Here we show that both of these tests are based upon a biased estimator of the KLD, the individual log-likelihood contributions, and that the Clarke test is not proven to be consistent for the difference in KLDs. As a solution, we derive a test based upon cross-validated log-likelihood contributions, which represent an unbiased KLD estimate. We demonstrate the CVDM test's superior performance via simulation, then apply it to two empirical examples from political science. We find that the test's selection can diverge from those of the Vuong and Clarke tests and that this can ultimately lead to differences in substantive conclusions.
引用
收藏
页码:2155 / 2173
页数:19
相关论文
共 50 条
  • [21] On Cross-Validation for MLP Model Evaluation
    Karkkainen, Tommi
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2014, 8621 : 291 - 300
  • [22] Assessing model fit by cross-validation
    Hawkins, DM
    Basak, SC
    Mills, D
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02): : 579 - 586
  • [23] Test, revision, and cross-validation of the physical activity self-definition model
    Kendzierski, D
    [J]. JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2004, 26 : S103 - S103
  • [24] A comparison of material flow strength models using Bayesian cross-validation
    Bernstein, Jason
    Schmidt, Kathleen
    Rivera, David
    Barton, Nathan
    Florando, Jeffrey
    Kupresanin, Ana
    [J]. COMPUTATIONAL MATERIALS SCIENCE, 2019, 169
  • [25] Test, Revision, and Cross-Validation of the Physical Activity Self-Definition Model
    Kendzierski, Deborah
    Morganstein, Mara S.
    [J]. JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2009, 31 (04): : 484 - 504
  • [26] Wavelet shrinkage using cross-validation
    Nason, GP
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1996, 58 (02): : 463 - 479
  • [27] CROSS-VALIDATION USING THE T STATISTIC
    KLEIJNEN, JPC
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1983, 13 (02) : 133 - 141
  • [28] Unbiased cross-validation kernel density estimation for wind and PV probabilistic modelling
    Wahbah, Maisam
    Mohandes, Baraa
    EL-Fouly, Tarek H. M.
    El Moursi, Mohamed Shawky
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2022, 266
  • [29] External cross-validation for unbiased evaluation of protein family detectors:: Application to allergens
    Soeria-Atmadja, D
    Wallman, M
    Björklund, ÅK
    Isaksson, A
    Hammerling, U
    Gustafsson, MG
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 (04) : 918 - 925
  • [30] Comparison of Cross-Validation and Test Sets Approaches to Evaluation of Classifiers in Authorship Attribution Domain
    Baron, Grzegorz
    [J]. COMPUTER AND INFORMATION SCIENCES, ISCIS 2016, 2016, 659 : 81 - 89