A bootstrap-based strategy for spectral interval selection in PLS regression

被引:47
|
作者
Bras, Ligia P. [1 ]
Lopes, Marta [1 ]
Ferreira, Ana P. [1 ]
Menezes, Jose C. [1 ]
机构
[1] Univ Tecn Lisboa, Inst Biotechnol & Bioengn, Ctr Biol & Chem Engn, Inst Super Tecn, P-1049001 Lisbon, Portugal
关键词
variable selection; bootstrap; spectral intervals; near-infrared; partial least squares;
D O I
10.1002/cem.1153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bootstrap-based methods have been applied for spectral variable selection in near (NIR) and mid-infrared (MIR) spectroscopy applications. In this paper, an extension of those methods for the selection of spectral intervals instead of single spectral variables is proposed. This approach, interval partial least square (PLS)-Bootstrap (iPLS-Bootstrap), was compared against the PLS-Bootstrap method and the use of the whole spectral region for model development. These methods were tested on a NIR spectral dataset obtained from at-line monitoring of an industrial fermentation process, by correlating the spectra with the concentration of the active pharmaceutical ingredient (API). The performance of the models was evaluated based on the predictive ability for both cross-validation and external validation. For the dataset used, iPLS-Bootstrap enabled to improve the model predictive ability, with a greater impact on external validation. The decrease observed in RMSEP relative to the full-spectrum and PLS-Bootstrap model was, respectively, 14 and 6%. Copyright (C) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:695 / 700
页数:6
相关论文
共 50 条
  • [41] Feature selection using distributions of orthogonal PLS regression vectors in spectral data
    Geonseok Lee
    Kichun Lee
    [J]. BioData Mining, 14
  • [42] Bootstrap-based design of residual control charts
    Capizzi, Giovanna
    Masarotto, Guido
    [J]. IIE TRANSACTIONS, 2009, 41 (04) : 275 - 286
  • [43] Bootstrap-based improvements for inference with clustered errors
    Cameron, A. Colin
    Gelbach, Jonah B.
    Miller, Douglas L.
    [J]. REVIEW OF ECONOMICS AND STATISTICS, 2008, 90 (03) : 414 - 427
  • [44] Bootstrap-based criteria for choosing the number of instruments
    Okui, R.
    [J]. MODSIM 2005: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING, 2005, : 933 - 939
  • [45] Bootstrap-based Budget Allocation for Nested Simulation
    Zhang, Kun
    Liu, Guangwu
    Wang, Shiyu
    [J]. OPERATIONS RESEARCH, 2022, 70 (02) : 1128 - 1142
  • [46] Smooth PLS Regression for Spectral Data
    Kondylis, Athanasios
    [J]. REVSTAT-STATISTICAL JOURNAL, 2022, 20 (04) : 463 - 479
  • [47] Bootstrap-based methods for estimating standard errors in Cox's regression analyses of clustered event times
    Xiao, Yongling
    Abrahamowicz, Michal
    [J]. STATISTICS IN MEDICINE, 2010, 29 (7-8) : 915 - 923
  • [48] Bootstrap-based Quality Metric for Scarce Sensing Systems
    Azmy, Sherif B.
    Zorba, Nizar
    Hassanein, Hossam S.
    [J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [49] A bootstrap-based aggregate classifier for model-based clustering
    José G. Dias
    Jeroen K. Vermunt
    [J]. Computational Statistics, 2008, 23 : 643 - 659
  • [50] Fast grid search and bootstrap-based inference for continuous two-phase polynomial regression models
    Son, Hyunju
    Fong, Youyi
    [J]. ENVIRONMETRICS, 2021, 32 (03)