Monte Carlo cross-validation for selecting a model and estimating the prediction error in multivariate calibration

被引:192
|
作者
Xu, QS
Liang, YZ [1 ]
Du, YP
机构
[1] Cent S Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China
[2] Hunan Univ, Coll Math & Econometr, Changsha, Peoples R China
[3] Hunan Univ, Inst Chemometr & Chem Sensing Technol, Changsha, Peoples R China
关键词
model selection; prediction error; cross-validation;
D O I
10.1002/cem.858
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new simple and effective method named Monte Carlo cross validation (MCCV) has been introduced and evaluated for selecting a model and estimating the prediction ability of the model selected. Unlike the leave-one-out procedure widely used in chemometrics for cross-validation (CV), the Monte Carlo cross-validation developed in this paper is an asymptotically consistent method of model selection. It can avoid an unnecessarily large model and therefore decreases the risk of overfitting of the model. The results obtained from a simulation study showed that MCCV has an obviously larger probability than leave-one-out CV (LOO-CV) of selecting the model with best prediction ability and that a corrected MCCV (CMCCV) could give a more accurate estimation of prediction ability than LOO-CV or MCCV. The results obtained with real data sets demonstrated that MCCV could successfully select an appropriate model and that CMCCV could assess the prediction ability of the selected model with satisfactory accuracy. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:112 / 120
页数:9
相关论文
共 50 条
  • [21] Estimating misclassification error with small samples via bootstrap cross-validation
    Fu, WJJ
    Carroll, RJ
    Wang, SJ
    BIOINFORMATICS, 2005, 21 (09) : 1979 - 1986
  • [22] Ascertainment of the number of samples in the validation set in Monte Carlo cross validation and the selection of model dimension with Monte Carlo cross validation
    Du, Yi Ping
    Kasemsumran, Surnaporn
    Maruo, Katsuhiko
    Nakagawa, Takehiro
    Ozaki, Yukihiro
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2006, 82 (1-2) : 83 - 89
  • [23] Monte Carlo cross-validation for a study with binary outcome and limited sample size
    Shan, Guogen
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [24] Monte Carlo cross-validation for a study with binary outcome and limited sample size
    Guogen Shan
    BMC Medical Informatics and Decision Making, 22
  • [25] Corrected versions of cross-validation criteria for selecting multivariate regression and growth curve models
    Yasunori Fujikoshi
    Takafumi Noguchi
    Megu Ohtaki
    Hirokazu Yanagihara
    Annals of the Institute of Statistical Mathematics, 2003, 55 : 537 - 553
  • [26] Corrected versions of cross-validation criteria for selecting multivariate regression and growth curve models
    Fujikoshi, Y
    Noguchi, T
    Ohtaki, M
    Yanagihara, H
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2003, 55 (03) : 537 - 553
  • [27] Calibration and validation of a Monte Carlo model for PGNAA of chlorine in soil
    Howell, SL
    Sigg, RA
    Moore, FS
    DeVol, TA
    JOURNAL OF RADIOANALYTICAL AND NUCLEAR CHEMISTRY, 2000, 244 (01) : 173 - 178
  • [28] Calibration and Validation of a Monte Carlo Model for PGNAA of Chlorine in Soil
    S. L. Howell
    R. A. Sigg
    F. S. Moore
    T. A. DeVol
    Journal of Radioanalytical and Nuclear Chemistry, 2000, 244 : 173 - 178
  • [29] Application of Monte Carlo cross-validation to identify pathway cross-talk in neonatal sepsis
    Zhang, Yuxia
    Liu, Cui
    Wang, Jingna
    Li, Xingxia
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2018, 243 (05) : 444 - 450
  • [30] Cross-validation of prediction equations for estimating body composition in ballet dancers
    Araujo Leal, Leilane Lilian
    Lopes Barbosa, Giovanna Stefanne
    Urbano Ferreira, Rannapaula Lawrynhuk
    Avelino, Erikarla Baracho
    Bezerra, Adriana Nunes
    de Lima Vale, Sancha Helena
    Lima Maciel, Bruna Leal
    PLOS ONE, 2019, 14 (07):