Feature selection using distributions of orthogonal PLS regression vectors in spectral data

被引:0
|
作者
Geonseok Lee
Kichun Lee
机构
[1] Industrial Engineering,
[2] Hanyang University,undefined
来源
关键词
Feature selection; PLS; Orthogonal signal correction; Regression vector; Permutation test;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection, which is important for successful analysis of chemometric data, aims to produce parsimonious and predictive models. Partial least squares (PLS) regression is one of the main methods in chemometrics for analyzing multivariate data with input X and response Y by modeling the covariance structure in the X and Y spaces. Recently, orthogonal projections to latent structures (OPLS) has been widely used in processing multivariate data because OPLS improves the interpretability of PLS models by removing systematic variation in the X space not correlated to Y. The purpose of this paper is to present a feature selection method of multivariate data through orthogonal PLS regression (OPLSR), which combines orthogonal signal correction with PLS. The presented method generates empirical distributions of features effects upon Y in OPLSR vectors via permutation tests and examines the significance of the effects of the input features on Y. We show the performance of the proposed method using a simulation study in which a three-layer network structure exists in compared with the false discovery rate method. To demonstrate this method, we apply it to both real-life NIR spectra data and mass spectrometry data.
引用
收藏
相关论文
共 50 条
  • [1] Feature selection using distributions of orthogonal PLS regression vectors in spectral data
    Lee, Geonseok
    Lee, Kichun
    [J]. BIODATA MINING, 2021, 14 (01)
  • [2] Smooth PLS Regression for Spectral Data
    Kondylis, Athanasios
    [J]. REVSTAT-STATISTICAL JOURNAL, 2022, 20 (04) : 463 - 479
  • [3] Application of genetic algorithm-PLS for feature selection in spectral data sets
    Leardi, R
    [J]. JOURNAL OF CHEMOMETRICS, 2000, 14 (5-6) : 643 - 655
  • [4] Supervised Feature Selection With Orthogonal Regression and Feature Weighting
    Wu, Xia
    Xu, Xueyuan
    Liu, Jianhong
    Wang, Hailing
    Hu, Bin
    Nie, Feiping
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 1831 - 1838
  • [5] EEG FEATURE SELECTION USING ORTHOGONAL REGRESSION: APPLICATION TO EMOTION RECOGNITION
    Xu, Xueyuan
    Wei, Fulin
    Zhu, Zhiyuan
    Liu, Jianhong
    Wu, Xia
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1239 - 1243
  • [6] Sorting variables by using informative vectors as a strategy for feature selection in multivariate regression
    Teofilo, Reinaldo F.
    Martins, Joao Paulo A.
    Ferreira, Marcia M. C.
    [J]. JOURNAL OF CHEMOMETRICS, 2009, 23 (1-2) : 32 - 48
  • [7] FEATURE SELECTION UNDER ORTHOGONAL REGRESSION WITH REDUNDANCY MINIMIZING
    Xu, Xueyuan
    Wu, Xia
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3457 - 3461
  • [8] Regularized Feature Selection in Categorical PLS for Multicollinear Data
    Mehmood, Tahir
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [9] A bootstrap-based strategy for spectral interval selection in PLS regression
    Bras, Ligia P.
    Lopes, Marta
    Ferreira, Ana P.
    Menezes, Jose C.
    [J]. JOURNAL OF CHEMOMETRICS, 2008, 22 (11-12) : 695 - 700
  • [10] Kernel feature selection with side data using a spectral approach
    Shashua, A
    Wolf, L
    [J]. COMPUTER VISION - ECCV 2004, PT 3, 2004, 3023 : 39 - 53