Variable and subset selection in PLS regression

被引:222
|
作者
Höskuldsson, A [1 ]
机构
[1] Tech Univ Denmark, DK-2800 Lyngby, Denmark
关键词
variable selection; partial least squares (PLS); principal component analysis (PCA); H-principle; stepwise regression; Orthogonal Scatter Correction (OSC);
D O I
10.1016/S0169-7439(00)00113-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The purpose of this paper is to present some useful methods for introductory analysis of variables and subsets in relation to PLS regression. We present here methods that are efficient in finding the appropriate variables or subset to use in the PLS regression. The general conclusion is that variable selection is important for successful analysis of chemometric data. An important aspect of the results presented is that lack of variable selection can spoil the PLS regression, and that cross-validation measures using a test set can show larger variation, when we use different subsets of X, than obtained by different methods. We also present an approach to orthogonal scatter correction. The procedures and comparisons are applied to industrial data. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:23 / 38
页数:16
相关论文
共 50 条
  • [31] VARIABLE SELECTION IN QUANTILE REGRESSION
    Wu, Yichao
    Liu, Yufeng
    [J]. STATISTICA SINICA, 2009, 19 (02) : 801 - 817
  • [32] Variable Selection with Regression Trees
    Chang, Youngjae
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2010, 23 (02) : 357 - 366
  • [33] Variable Selection in ROC Regression
    Wang, Binhuan
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
  • [34] Variable Selection in PLS Discriminant Analysis via the Disco
    Simonetti, Biagio
    Lucadamo, Antonio
    Rodriguez, Maria R. G.
    [J]. CURRENT ANALYTICAL CHEMISTRY, 2012, 8 (02) : 266 - 272
  • [35] Relevance measures for subset variable selection in regression problems based on k-additive mutual information
    Kojadinovic, I
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2005, 49 (04) : 1205 - 1227
  • [36] Variable selection with stepwise and best subset approaches
    Zhang, Zhongheng
    [J]. ANNALS OF TRANSLATIONAL MEDICINE, 2016, 4 (07)
  • [37] A simple idea on applying large regression coefficient to improve the genetic algorithm-PLS for variable selection in multivariate calibration
    Yun, Yong-Huan
    Cao, Dong-Sheng
    Tan, Min-Li
    Yan, Jun
    Ren, Da-Bing
    Xu, Qing-Song
    Yu, Ling
    Liang, Yi-Zeng
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 130 : 76 - 83
  • [38] Variable selection by modified IPW (iterative predictor weighting)-PLS (partial least squares) in continuous wavelet regression models
    Chen, D
    Hu, XG
    Shao, XG
    Su, QD
    [J]. ANALYST, 2004, 129 (07) : 664 - 669
  • [39] ON THE OPTIMALITY OF BACKWARD REGRESSION: SPARSE RECOVERY AND SUBSET SELECTION
    Ament, Sebastian
    Gomes, Carla
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5599 - 5603
  • [40] Conditional Uncorrelation and Efficient Subset Selection in Sparse Regression
    Wang, Jianji
    Zhang, Shupei
    Liu, Qi
    Du, Shaoyi
    Guo, Yu-Cheng
    Zheng, Nanning
    Wang, Fei-Yue
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) : 10458 - 10467