Selection of the best calibration sample subset for multivariate regression

被引:44
|
作者
Ferre, J
Rius, FX
机构
[1] Departament de Química, Universitat Rovira I Virgili, 43005-Tarragona, Pl. Imperial Tarraco
关键词
D O I
10.1021/ac950482a
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper discusses a methodology for selecting the minimum number of calibration samples in principal component regression (PCR) analysis. The method uses only the instrumental responses of a large set of samples to select the optimal subset, which is then submitted to chemical analysis and calibration. The subset is selected to provide a low variance of the regression coefficients. The methodology has been applied to UV-visible spectroscopy data to determine Ca2+ in water and near-IR spectroscopy data to determine moisture in corn. In both cases, the regression models developed with a reduced number of samples provided accurate results. As far as precision is concerned, a similar root-mean-squared error of cross-validation (RMSECV) is found when comparing the new methodology with the results of the regression models that use the complete set of calibration samples and PCR. The number of analyzed samples in the calibration set can be reduced by up to 50%, which represents a considerable reduction in costs.
引用
收藏
页码:1565 / 1571
页数:7
相关论文
共 50 条
  • [1] SELECTION OF BEST SUBSET IN REGRESSION ANALYSIS
    HOCKING, RR
    LESLIE, RN
    [J]. TECHNOMETRICS, 1967, 9 (04) : 531 - &
  • [2] SELECTION OF BEST SUBSET IN REGRESSION ANALYSIS
    HOCKING, RR
    [J]. TECHNOMETRICS, 1967, 9 (01) : 188 - &
  • [3] Selection of a calibration sample subset by a semi-supervised method
    He, Zhonghai
    Ma, Zhenhe
    Li, Mengchao
    Zhou, Yang
    [J]. JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2018, 26 (02) : 87 - 94
  • [4] Variable selection in linear regression models: Choosing the best subset is not always the best choice
    Hanke, Moritz
    Dijkstra, Louis
    Foraita, Ronja
    Didelez, Vanessa
    [J]. BIOMETRICAL JOURNAL, 2023,
  • [5] Variable selection in linear regression models: Choosing the best subset is not always the best choice
    Hanke, Moritz
    Dijkstra, Louis
    Foraita, Ronja
    Didelez, Vanessa
    [J]. BIOMETRICAL JOURNAL, 2024, 66 (01)
  • [6] Combining support vector regression with feature selection for multivariate calibration
    Guo-Zheng Li
    Hao-Hua Meng
    Mary Qu Yang
    Jack Y. Yang
    [J]. Neural Computing and Applications, 2009, 18 : 813 - 820
  • [7] Combining support vector regression with feature selection for multivariate calibration
    Li, Guo-Zheng
    Meng, Hao-Hua
    Yang, Mary Qu
    Yang, Jack Y.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2009, 18 (07): : 813 - 820
  • [8] Cost-efficient unsupervised sample selection for multivariate calibration
    Diaz, Valeria Fonseca
    De Ketelaere, Bart
    Aernouts, Ben
    Saeys, Wouter
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 215
  • [9] Multivariate Regression with Calibration
    Liu, Han
    Wang, Lie
    Zhao, Tuo
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [10] A Local-branching Heuristic for the Best Subset Selection Problem in Linear Regression
    Bigler, Tamara
    Strub, Oliver
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2018, : 511 - 515