Clustered Variable Selection by Regularized Elimination in PLS

被引:2
|
作者
Mehmood, Tahir [1 ]
Snipen, Lars [1 ]
机构
[1] Norwegian Univ Life Sci, Dept Chem Biotechnol & Food Sci, As, Norway
来源
NEW PERSPECTIVES IN PARTIAL LEAST SQUARES AND RELATED METHODS | 2013年 / 56卷
关键词
Regularization; High-dimension; Collinearity; Clustering; Power; Parameter estimation; LEAST-SQUARES REGRESSION;
D O I
10.1007/978-1-4614-8283-3_5
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Variable selection is a crucial issue in many sciences, including modern biology, where an example is the selection of genomic markers for classification (diagnosing diseases, recognizing pathogenic bacteria, etc.). This becomes complicated as biological variables are in general correlated. For example, genes may be easily correlated, if they provide common biological functions. Variable selection may dissolve the group effects and mislead the focus onto a specific variable instead of a variable cluster. We study the selection and estimation properties of variable clusters in high dimensional settings when the number of variables exceeds the sample size. To address the issue a regularized elimination procedure in multiblock-PLS (mbPLS) is used, where highly correlated variables are clustered together, and whole groups are selected if they establish a relation with the response.
引用
收藏
页码:95 / 105
页数:11
相关论文
共 50 条
  • [1] Regularized Feature Selection in Categorical PLS for Multicollinear Data
    Mehmood, Tahir
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [2] Variable and subset selection in PLS regression
    Höskuldsson, A
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 55 (1-2) : 23 - 38
  • [3] Evolutionary variable selection in regression and PLS analyses
    Kubinyi, H
    JOURNAL OF CHEMOMETRICS, 1996, 10 (02) : 119 - 133
  • [4] Effect of outliers on the variable selection by the regularized regression
    Jeong, Junho
    Kim, Choongrak
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2018, 25 (02) : 235 - 243
  • [5] Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression
    Mehmood, Tahir
    Warringer, Jonas
    Snipen, Lars
    Saebo, Solve
    BMC BIOINFORMATICS, 2012, 13
  • [6] Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression
    Tahir Mehmood
    Jonas Warringer
    Lars Snipen
    Solve Sæbø
    BMC Bioinformatics, 13
  • [7] REGULARIZED ESTIMATING EQUATIONS FOR MODEL SELECTION OF CLUSTERED SPATIAL POINT PROCESSES
    Thurman, Andrew L.
    Fu, Rao
    Guan, Yongtao
    Zhu, Jun
    STATISTICA SINICA, 2015, 25 (01) : 173 - 188
  • [8] A Backward Variable Selection method for PLS regression (BVSPLS)
    Pierna, Juan Antonio Fernandez
    Abbas, Ouissam
    Baeten, Vincent
    Dardenne, Pierre
    ANALYTICA CHIMICA ACTA, 2009, 642 (1-2) : 89 - 93
  • [9] Variable selection of multiple types of data: a PLS approach
    Boao Kong
    Huiwen Wang
    Shan Lu
    Soft Computing, 2025, 29 (3) : 1369 - 1387
  • [10] Variable Selection in PLS Discriminant Analysis via the Disco
    Simonetti, Biagio
    Lucadamo, Antonio
    Rodriguez, Maria R. G.
    CURRENT ANALYTICAL CHEMISTRY, 2012, 8 (02) : 266 - 272