Validation of chemometric models - A tutorial

被引:233
|
作者
Westad, Frank [1 ]
Marini, Federico [2 ]
机构
[1] CAMO Software AS, N-0158 Oslo, Norway
[2] Univ Roma La Sapienza, Dept Chem, I-00185 Rome, Italy
关键词
Validation; Chemometrics; Resampling; Test set; Cross-validation; ARTIFICIAL NEURAL-NETWORKS; VARIABLE SELECTION; MULTIVARIATE CALIBRATION; SPECTROSCOPY; PREDICTION; REGRESSION; CLASSIFICATION; ALGORITHM; DESIGN; ERRORS;
D O I
10.1016/j.aca.2015.06.056
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this tutorial, we focus on validation both from a numerical and conceptual point of view. The often applied reported procedure in the literature of (repeatedly) dividing a dataset randomly into a calibration and test set must be applied with care. It can only be justified when there is no systematic stratification of the objects that will affect the validated estimates or figures of merits such as RMSE or R-2. The various levels of validation may, typically, be repeatability, reproducibility, and instrument and raw material variation. Examples of how one data set can be validated across this background information illustrate that it will affect the figures of merits as well as the dimensionality of the models. Even more important is the robustness of the models for predicting future samples. Another aspect that is brought to attention is validation in terms of the overall conclusions when observing a specific system. One example is to apply several methods for finding the significant variables and see if there is a consensus subset that also matches what is reported in the literature or based on the underlying chemistry. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:14 / 24
页数:11
相关论文
共 50 条
  • [1] A TUTORIAL ON THE OPERATIONAL VALIDATION OF SIMULATION MODELS
    Sargent, Robert G.
    Goldsman, David M.
    Yaacoub, Tony
    [J]. 2016 WINTER SIMULATION CONFERENCE (WSC), 2016, : 163 - 177
  • [2] AN INTRODUCTORY TUTORIAL ON VERIFICATION AND VALIDATION OF SIMULATION MODELS
    Sargent, Robert G.
    [J]. 2015 WINTER SIMULATION CONFERENCE (WSC), 2015, : 1729 - 1740
  • [3] VERIFICATION AND VALIDATION OF SIMULATION MODELS: AN ADVANCED TUTORIAL
    Sargent, Robert G.
    [J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 16 - 29
  • [4] IF validation environment tutorial
    Bozga, M
    Graf, S
    Mounier, L
    Ober, I
    [J]. MODEL CHECKING SOFTWARE, 2004, 2989 : 306 - 307
  • [5] IF validation environment tutorial
    Bozga, Marius
    Graf, Susanne
    Mounier, Laurent
    Ober, Lulian
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 2989 : 306 - 307
  • [6] Validation of chemometric models for the determination of deoxynivalenol on maize by mid-infrared spectroscopy
    G. Kos
    H. Lohninger
    R. Krska
    [J]. Mycotoxin Research, 2003, 19 (2) : 149 - 153
  • [7] Variable selection in the chemometric treatment of food data: A tutorial review
    Gomes, Adriano de Araujo
    Azcarate, Silvana M.
    Diniz, Paulo Henrique Gonsalves Dias
    Fernandes, David Douglas de Sousa
    Veras, Germano
    [J]. FOOD CHEMISTRY, 2022, 370
  • [8] Application of chemometric methods to XRF-data - A tutorial review
    Panchuk, Vitaly
    Yaroshenko, Irina
    Legin, Andrey
    Semenov, Valentin
    Kirsanov, Dmitry
    [J]. ANALYTICA CHIMICA ACTA, 2018, 1040 : 19 - 32
  • [9] Chemometric calibration of infrared spectrometers: selection and validation of variables by non-linear models
    Benoudjit, N
    Cools, E
    Meurens, M
    Verleysen, M
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2004, 70 (01) : 47 - 53
  • [10] DESIGN AND VALIDATION OF PROTOCOLS - A TUTORIAL
    HOLZMANN, GJ
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1993, 25 (09): : 981 - 1017