Monte Carlo cross-validation for selecting a model and estimating the prediction error in multivariate calibration

被引:192
|
作者
Xu, QS
Liang, YZ [1 ]
Du, YP
机构
[1] Cent S Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China
[2] Hunan Univ, Coll Math & Econometr, Changsha, Peoples R China
[3] Hunan Univ, Inst Chemometr & Chem Sensing Technol, Changsha, Peoples R China
关键词
model selection; prediction error; cross-validation;
D O I
10.1002/cem.858
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new simple and effective method named Monte Carlo cross validation (MCCV) has been introduced and evaluated for selecting a model and estimating the prediction ability of the model selected. Unlike the leave-one-out procedure widely used in chemometrics for cross-validation (CV), the Monte Carlo cross-validation developed in this paper is an asymptotically consistent method of model selection. It can avoid an unnecessarily large model and therefore decreases the risk of overfitting of the model. The results obtained from a simulation study showed that MCCV has an obviously larger probability than leave-one-out CV (LOO-CV) of selecting the model with best prediction ability and that a corrected MCCV (CMCCV) could give a more accurate estimation of prediction ability than LOO-CV or MCCV. The results obtained with real data sets demonstrated that MCCV could successfully select an appropriate model and that CMCCV could assess the prediction ability of the selected model with satisfactory accuracy. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:112 / 120
页数:9
相关论文
共 50 条
  • [1] Estimating Prediction Error: Cross-Validation vs. Accumulated Prediction Error
    Haggstrom, Jenny
    De Luna, Xavier
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2010, 39 (05) : 880 - 898
  • [3] SUPERCOMPUTERS FOR MONTE-CARLO SIMULATION - CROSS-VALIDATION VERSUS RAOS TEST IN MULTIVARIATE REGRESSION
    KLEIJNEN, JPC
    LECTURE NOTES IN ECONOMICS AND MATHEMATICAL SYSTEMS, 1992, 376 : 233 - 245
  • [4] Cross-validation for selecting a model selection procedure
    Zhang, Yongli
    Yang, Yuhong
    JOURNAL OF ECONOMETRICS, 2015, 187 (01) : 95 - 112
  • [5] On the use of cross-validation to assess performance in multivariate prediction
    Jonathan, P
    Krzanowski, WJ
    McCarthy, WV
    STATISTICS AND COMPUTING, 2000, 10 (03) : 209 - 229
  • [6] On the use of cross-validation to assess performance in multivariate prediction
    P. Jonathan
    W. J. Krzanowski
    W. V. McCarthy
    Statistics and Computing, 2000, 10 : 209 - 229
  • [7] Bayesian cross-validation by parallel Markov chain Monte Carlo
    Cooper, Alex
    Vehtari, Aki
    Forbes, Catherine
    Simpson, Dan
    Kennedy, Lauren
    STATISTICS AND COMPUTING, 2024, 34 (04)
  • [8] On Estimating Model in Feature Selection With Cross-Validation
    Qi, Chunxia
    Diao, Jiandong
    Qiu, Like
    IEEE ACCESS, 2019, 7 : 33454 - 33463
  • [9] Detecting influential observations by cluster analysis and Monte Carlo cross-validation
    Bian, Xihui
    Cai, Wensheng
    Shao, Xueguang
    Chen, Da
    Grant, Edward R.
    ANALYST, 2010, 135 (11) : 2841 - 2847
  • [10] CROSS-VALIDATION OF MULTIVARIATE DENSITIES
    SAIN, SR
    BAGGERLY, KA
    SCOTT, DW
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (427) : 807 - 817