Assessing model fit by cross-validation

被引:623
|
作者
Hawkins, DM [1 ]
Basak, SC
Mills, D
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[2] Univ Minnesota, Nat Resources Res Inst, Duluth, MN 55811 USA
关键词
D O I
10.1021/ci025626i
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
When QSAR models are fitted, it is important to validate any fitted model-to check that it is plausible that its predictions will carry over to fresh data not used in the model fitting exercise. There are two standard ways of doing this-using a separate hold-out test sample and the computationally much more burdensome leave-one-out cross-validation in which the entire pool of available compounds is used both to fit the model and to assess its validity. We show by theoretical argument and empiric study of a large QSAR data set that when the available sample size is small-in the dozens or scores rather than the hundreds, holding a portion of it back for testing is wasteful, and that it is much better to use cross-validation, but ensure that this is done properly.
引用
收藏
页码:579 / 586
页数:8
相关论文
共 50 条
  • [21] Cross-Validation Model Averaging for Generalized Functional Linear Model
    Zhang, Haili
    Zou, Guohua
    [J]. ECONOMETRICS, 2020, 8 (01)
  • [22] Fast Cross-Validation
    Liu, Yong
    Lin, Hailun
    Ding, Lizhong
    Wang, Weiping
    Liao, Shizhong
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2497 - 2503
  • [23] Cross-Validation With Confidence
    Lei, Jing
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1978 - 1997
  • [24] SMOOTHED CROSS-VALIDATION
    HALL, P
    MARRON, JS
    PARK, BU
    [J]. PROBABILITY THEORY AND RELATED FIELDS, 1992, 92 (01) : 1 - 20
  • [25] PARAMETERS OF CROSS-VALIDATION
    HERZBERG, PA
    [J]. PSYCHOMETRIKA, 1969, 34 (2P2) : 1 - &
  • [26] Cross-validation methods
    Browne, MW
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2000, 44 (01) : 108 - 132
  • [27] Targeted cross-validation
    Zhang, Jiawei
    Ding, Jie
    Yang, Yuhong
    [J]. BERNOULLI, 2023, 29 (01) : 377 - 402
  • [28] CROSS-VALIDATION FOR PREDICTION
    COOIL, B
    WINER, RS
    RADOS, DL
    [J]. JOURNAL OF MARKETING RESEARCH, 1987, 24 (03) : 271 - 279
  • [29] Cross-validation Revisited
    Dutta, Santanu
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (02) : 472 - 490
  • [30] Purposeful cross-validation: a novel cross-validation strategy for improved surrogate optimizability
    Correia, Daniel
    Wilke, Daniel N.
    [J]. ENGINEERING OPTIMIZATION, 2021, 53 (09) : 1558 - 1573