Monte Carlo cross-validation for selecting a model and estimating the prediction error in multivariate calibration

被引:192
|
作者
Xu, QS
Liang, YZ [1 ]
Du, YP
机构
[1] Cent S Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China
[2] Hunan Univ, Coll Math & Econometr, Changsha, Peoples R China
[3] Hunan Univ, Inst Chemometr & Chem Sensing Technol, Changsha, Peoples R China
关键词
model selection; prediction error; cross-validation;
D O I
10.1002/cem.858
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new simple and effective method named Monte Carlo cross validation (MCCV) has been introduced and evaluated for selecting a model and estimating the prediction ability of the model selected. Unlike the leave-one-out procedure widely used in chemometrics for cross-validation (CV), the Monte Carlo cross-validation developed in this paper is an asymptotically consistent method of model selection. It can avoid an unnecessarily large model and therefore decreases the risk of overfitting of the model. The results obtained from a simulation study showed that MCCV has an obviously larger probability than leave-one-out CV (LOO-CV) of selecting the model with best prediction ability and that a corrected MCCV (CMCCV) could give a more accurate estimation of prediction ability than LOO-CV or MCCV. The results obtained with real data sets demonstrated that MCCV could successfully select an appropriate model and that CMCCV could assess the prediction ability of the selected model with satisfactory accuracy. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:112 / 120
页数:9
相关论文
共 50 条
  • [31] Cross-validation for selecting the penalty factor in least squares model averaging
    Fang, Fang
    Yang, Qiwei
    Tian, Wenling
    ECONOMICS LETTERS, 2022, 217
  • [32] Estimation of prediction error by using K-fold cross-validation
    Tadayoshi Fushiki
    Statistics and Computing, 2011, 21 : 137 - 146
  • [33] Estimation of prediction error by using K-fold cross-validation
    Fushiki, Tadayoshi
    STATISTICS AND COMPUTING, 2011, 21 (02) : 137 - 146
  • [34] CROSS-VALIDATION AND MULTINOMIAL PREDICTION
    STONE, M
    BIOMETRIKA, 1974, 61 (03) : 509 - 515
  • [35] Monte Carlo cross validation
    Xu, QS
    Liang, YZ
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 56 (01) : 1 - 11
  • [36] Pathway analysis based on Monte Carlo Cross-Validation in polyarticular juvenile idiopathic arthritis
    Lin, Shunhua
    Wang, Yuanji
    Mu, Shunmei
    Zhang, Junxi
    Yuan, Fangchang
    Sun, Kang
    PATHOLOGY RESEARCH AND PRACTICE, 2017, 213 (01) : 7 - 12
  • [37] Validation of machine learning ridge regression models using Monte Carlo, bootstrap, and variations in cross-validation
    Nakatsu, Robbie T.
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [38] Statistical confidence for variable selection in QSAR models via Monte Carlo cross-validation
    Konovalov, Dmitry A.
    Sim, Nigel
    Deconinck, Eric
    Heyden, Yvan Vander
    Coomans, Danny
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (02) : 370 - 383
  • [39] Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction
    Gianola, Daniel
    Schoen, Chris-Carolin
    G3-GENES GENOMES GENETICS, 2016, 6 (10): : 3107 - 3128
  • [40] Revealing pathway cross-talk related to diabetes mellitus by Monte Carlo Cross-Validation analysis
    Cai, Han-Qing
    Lv, Shi-Hong
    Shi, Chun-Jing
    OPEN LIFE SCIENCES, 2017, 12 (01): : 473 - 480