A Graphical Method to Evaluate Spectral Preprocessing in Multivariate Regression Calibrations: Example with Savitzky-Golay Filters and Partial Least Squares Regression

被引:53
|
作者
Delwiche, Stephen R. [1 ]
Reeves, James B., III [2 ]
机构
[1] ARS, USDA, Beltsville Agr Res Ctr, Food Qual Lab, Beltsville, MD 20705 USA
[2] ARS, USDA, Environm Management & Byprod Utilizat Lab, Beltsville, MD 20705 USA
关键词
Preprocessing; Savitzky-Golay; Near-infrared spectroscopy; NIR spectroscopy; Partial least squares; PLS; Derivative; Smoothing; Regression; DIFFERENTIATION; SPECTROSCOPY; MIXTURES; WHEAT; NIR;
D O I
10.1366/000370210790572007
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
In multivariate regression analysis of spectroscopy data, spectral preprocessing is often performed to reduce unwanted background information (offsets, sloped baselines) or accentuate absorption features in intrinsically overlapping bands. These procedures, also known as pretreatments, are commonly smoothing operations or derivatives. While such operations are often useful in reducing the number of latent variables of the actual decomposition and lowering residual error, they also run the risk of misleading the practitioner into accepting calibration equations that are poorly adapted to samples outside of the calibration. The current study developed a graphical method to examine this effect oil partial least squares (PI,S) regression calibrations of near-infrared (NIR) reflection spectra of ground wheat meal with two analytes, protein content and sodium dodecyl sulfate sedimentation (SDS) volume (an indicator of the quantity of the gluten proteins that contribute to strong doughs). These two properties were chosen because of their differing abilities to be modeled by NIR spectroscopy: excellent for protein content, fair for SDS sedimentation volume. To further demonstrate the potential pitfalls of preprocessing, an artificial component, a randomly generated value, was included in PLS regression trials. Savitzky-Golay (digital filter) smoothing, first-derivative, and second-derivative preprocess functions (5 to 25 centrally symmetric convolution points, derived from quadratic polynomials) were applied to PLS calibrations of 1 to 15 factors. The results demonstrated the danger of all over reliance on preprocessing when (1) the number of samples used in a multivariate calibration is low (<50), (2) the spectral response of the analyte is weak, and (3) the goodness of the calibration is based on the coefficient of determination (R(2)) rather than a term based on residual error. The graphical method has application to the evaluation of other preprocess functions and various types of spectroscopy data.
引用
收藏
页码:73 / 82
页数:10
相关论文
共 50 条
  • [1] Savitzky-Golay least-squares polynomial filters in ECG signal processing
    Hargittai, S
    Computers in Cardiology 2005, Vol 32, 2005, 32 : 763 - 766
  • [2] GENERAL LEAST-SQUARES SMOOTHING AND DIFFERENTIATION BY THE CONVOLUTION (SAVITZKY-GOLAY) METHOD
    GORRY, PA
    ANALYTICAL CHEMISTRY, 1990, 62 (06) : 570 - 573
  • [3] Spectral Partial Least Squares Regression
    Chen, Jiangfenng
    Yuan, Baozong
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 1351 - 1354
  • [5] On estimation of multivariate prediction regions in partial least squares regression
    Lin, Weilu
    Zhuang, Yingping
    Zhang, Siliang
    Martin, Elaine
    JOURNAL OF CHEMOMETRICS, 2013, 27 (09) : 243 - 250
  • [6] A robust partial least squares regression method with applications
    Gonzalez, Javier
    Pena, Daniel
    Romera, Rosario
    JOURNAL OF CHEMOMETRICS, 2009, 23 (1-2) : 78 - 90
  • [7] Partial least squares regression as a multivariate tool for the interpretation of infrared spectra
    Luinge, H. J.
    Van der Maas, J. H.
    Visser, T.
    Chinese Science Bulletin, 1993, 38 (17):
  • [8] Partial Least Squares Regression Analysis: Example of Motor Fitness Data
    Serbetar, Ivan
    CROATIAN JOURNAL OF EDUCATION-HRVATSKI CASOPIS ZA ODGOJ I OBRAZOVANJE, 2012, 14 (04): : 917 - 932
  • [9] Joint Optimization of Savitzky-Golay Smoothing Models and Partial Least Squares Factors for Near-infrared Spectroscopic Analysis of Serum Glucose
    Xie Jun
    Pan Tao
    Chen Jie-Mei
    Chen Hua-Zhou
    Ren Xiao-Huan
    CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2010, 38 (03) : 342 - 346
  • [10] Stacked partial least squares regression analysis for spectral calibration and prediction
    Ni, Wangdong
    Brown, Steven D.
    Man, Ruilin
    JOURNAL OF CHEMOMETRICS, 2009, 23 (9-10) : 505 - 517