Monte Carlo cross-validation for selecting a model and estimating the prediction error in multivariate calibration

被引：192

作者：

Xu, QS

Liang, YZ ^{[1
]}

Du, YP

机构：

[1] Cent S Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China

[2] Hunan Univ, Coll Math & Econometr, Changsha, Peoples R China

[3] Hunan Univ, Inst Chemometr & Chem Sensing Technol, Changsha, Peoples R China

来源：

JOURNAL OF CHEMOMETRICS | 2004年 / 18卷 / 02期

关键词：

model selection; prediction error; cross-validation;

D O I：

10.1002/cem.858

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A new simple and effective method named Monte Carlo cross validation (MCCV) has been introduced and evaluated for selecting a model and estimating the prediction ability of the model selected. Unlike the leave-one-out procedure widely used in chemometrics for cross-validation (CV), the Monte Carlo cross-validation developed in this paper is an asymptotically consistent method of model selection. It can avoid an unnecessarily large model and therefore decreases the risk of overfitting of the model. The results obtained from a simulation study showed that MCCV has an obviously larger probability than leave-one-out CV (LOO-CV) of selecting the model with best prediction ability and that a corrected MCCV (CMCCV) could give a more accurate estimation of prediction ability than LOO-CV or MCCV. The results obtained with real data sets demonstrated that MCCV could successfully select an appropriate model and that CMCCV could assess the prediction ability of the selected model with satisfactory accuracy. Copyright (C) 2004 John Wiley Sons, Ltd.

引用

页码：112 / 120

页数：9

共 50 条

[31] Cross-validation for selecting the penalty factor in least squares model averaging
Fang, Fang
Yang, Qiwei
Tian, Wenling
ECONOMICS LETTERS, 2022, 217
[32] Estimation of prediction error by using K-fold cross-validation
Tadayoshi Fushiki
Statistics and Computing, 2011, 21 : 137 - 146
[33] Estimation of prediction error by using K-fold cross-validation
Fushiki, Tadayoshi
STATISTICS AND COMPUTING, 2011, 21 (02) : 137 - 146
[34] CROSS-VALIDATION AND MULTINOMIAL PREDICTION
STONE, M
BIOMETRIKA, 1974, 61 (03) : 509 - 515
[35] Monte Carlo cross validation
Xu, QS
Liang, YZ
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 56 (01) : 1 - 11
[36] Pathway analysis based on Monte Carlo Cross-Validation in polyarticular juvenile idiopathic arthritis
Lin, Shunhua
Wang, Yuanji
Mu, Shunmei
Zhang, Junxi
Yuan, Fangchang
Sun, Kang
PATHOLOGY RESEARCH AND PRACTICE, 2017, 213 (01) : 7 - 12
[37] Validation of machine learning ridge regression models using Monte Carlo, bootstrap, and variations in cross-validation
Nakatsu, Robbie T.
JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
[38] Statistical confidence for variable selection in QSAR models via Monte Carlo cross-validation
Konovalov, Dmitry A.
Sim, Nigel
Deconinck, Eric
Heyden, Yvan Vander
Coomans, Danny
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (02) : 370 - 383
[39] Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction
Gianola, Daniel
Schoen, Chris-Carolin
G3-GENES GENOMES GENETICS, 2016, 6 (10): : 3107 - 3128
[40] Revealing pathway cross-talk related to diabetes mellitus by Monte Carlo Cross-Validation analysis
Cai, Han-Qing
Lv, Shi-Hong
Shi, Chun-Jing
OPEN LIFE SCIENCES, 2017, 12 (01): : 473 - 480

← 1 2 3 4 5 →