Propagation of measurement errors for the validation of predictions obtained by principal component regression and partial least squares

被引:0
|
作者
Faber, K [1 ]
Kowalski, BR [1 ]
机构
[1] UNIV WASHINGTON,CTR PROC ANALYT CHEM,SEATTLE,WA 98195
关键词
error propagation; multivariate calibration; regression model; EIV model; OLS; PCR; PLS; covariance matrix of regression vector; prediction interval; limit of detection; wavelength selection; sample selection; local modeling; bias; stopping rule;
D O I
10.1002/(SICI)1099-128X(199705)11:3<181::AID-CEM459>3.0.CO;2-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multivariate calibration aims to model the relation between a dependent variable, e.g. analyte concentration, and the measured independent variables, e.g. spectra, for complex mixtures. The model parameters are obtained in the form of a regression vector from calibration data by regression methods such as principal component regression (PCR) or partial least squares (PLS). Subsequently, this regression vector is used to predict the dependent variable for unknown mixtures. The validation of the obtained predictions is a crucial part of the procedure, i.e. together with the point estimate an interval estimate is desired. The associated prediction intervals can be constructed from the covariance matrix of the estimated regression vector. However, currently known expressions for PCR and PLS are derived within the classical regression framework, i.e. they only take the uncertainty in the dependent variable into account. This severely limits their capability for establishing realistic prediction intervals in practical situations. In this paper, expressions are derived using the method of error propagation that also account for the measurement errors in the independent variables. An exact linear relation is assumed between the dependent and independent variables. The obtained expressions are therefore valid for the classical errors-in-variables (EIV) model. In order to make the presentation reasonably self-contained, relevant expressions are reviewed for the classical regression model as well as the classical EN model, especially for ordinary least squares (OLS). The consequences for the limit of detection, wavelength selection, sample selection and local modeling are discussed. Diagnostics are proposed to determine the adequacy of the approximations used in the derivations. Finally, PCR and PLS are so-called biased regression methods. Compared with OLS, they yield small variance at the expense of increased bias. It follows that bias may be an important ingredient of the obtained predictions. Therefore considerable attention is paid to the quantification of bias and new stopping rules for model selection in PCR and PLS are proposed. The theoretical ideas are illustrated by the analysis of real data taken from the literature (classical regression model) as well as simulated data (classical EIV model). (C) 1997 by John Wiley & Sons, Ltd.
引用
收藏
页码:181 / 238
页数:58
相关论文
共 50 条
  • [1] Functional principal component regression and functional partial least squares
    Reiss, Philip T.
    Ogden, R. Todd
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (479) : 984 - 996
  • [2] Segmented principal component transform-partial least squares regression
    Barros, Antnio S.
    Pinto, Rui
    Delgadillo, Ivonne
    Rutledge, Douglas N.
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2007, 89 (02) : 59 - 68
  • [3] The pls package: Principal component and partial least squares regression in R
    Mevik, Bjorn-Helge
    Wehrens, Ron
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2007, 18 (02): : 1 - 23
  • [4] Comparison of partial least squares regression and principal component regression for pelvic shape prediction
    Schumann, Steffen
    Nolte, Lutz-P.
    Zheng, Guoyan
    [J]. JOURNAL OF BIOMECHANICS, 2013, 46 (01) : 197 - 199
  • [5] Partial least squares improvement and research principal component regression extraction methods
    Xiong, Wangping
    Du, Jianqiang
    Nie, Wang
    [J]. 2014 IEEE 11TH INTL CONF ON UBIQUITOUS INTELLIGENCE AND COMPUTING AND 2014 IEEE 11TH INTL CONF ON AUTONOMIC AND TRUSTED COMPUTING AND 2014 IEEE 14TH INTL CONF ON SCALABLE COMPUTING AND COMMUNICATIONS AND ITS ASSOCIATED WORKSHOPS, 2014, : 583 - 585
  • [6] The Comparison of Robust Partial Least Squares Regression with Robust Principal Component Regression on a Real Data
    Polat, Esra
    Gunay, Suleyman
    [J]. 11TH INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2013, PTS 1 AND 2 (ICNAAM 2013), 2013, 1558 : 1458 - 1461
  • [7] Determination of Antioxidant Properties of Fruit Juice by Partial Least Squares and Principal Component Regression
    Sahin, Saliha
    Demir, Cevdet
    [J]. INTERNATIONAL JOURNAL OF FOOD PROPERTIES, 2016, 19 (07) : 1455 - 1464
  • [8] The equivalence of partial least squares and principal component regression in the sufficient dimension reduction framework
    Lin, You-Wu
    Deng, Bai-Chuan
    Xu, Qing-Song
    Yun, Yong-Huan
    Liang, Yi-Zeng
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2016, 150 : 58 - 64
  • [9] A partial least squares and principal component regression study of quinone compounds with trypanocidal activity
    Molfetta, F. A.
    Bruni, A. T.
    Rosselli, R. P.
    da Silva, A. B. E.
    [J]. STRUCTURAL CHEMISTRY, 2007, 18 (01) : 49 - 57
  • [10] A partial least squares and principal component regression study of quinone compounds with trypanocidal activity
    F. A. Molfetta
    A. T. Bruni
    F. P. Rosselli
    A. B. F. da Silva
    [J]. Structural Chemistry, 2007, 18 : 49 - 57