Clustered Variable Selection by Regularized Elimination in PLS

被引:2
|
作者
Mehmood, Tahir [1 ]
Snipen, Lars [1 ]
机构
[1] Norwegian Univ Life Sci, Dept Chem Biotechnol & Food Sci, As, Norway
来源
NEW PERSPECTIVES IN PARTIAL LEAST SQUARES AND RELATED METHODS | 2013年 / 56卷
关键词
Regularization; High-dimension; Collinearity; Clustering; Power; Parameter estimation; LEAST-SQUARES REGRESSION;
D O I
10.1007/978-1-4614-8283-3_5
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Variable selection is a crucial issue in many sciences, including modern biology, where an example is the selection of genomic markers for classification (diagnosing diseases, recognizing pathogenic bacteria, etc.). This becomes complicated as biological variables are in general correlated. For example, genes may be easily correlated, if they provide common biological functions. Variable selection may dissolve the group effects and mislead the focus onto a specific variable instead of a variable cluster. We study the selection and estimation properties of variable clusters in high dimensional settings when the number of variables exceeds the sample size. To address the issue a regularized elimination procedure in multiblock-PLS (mbPLS) is used, where highly correlated variables are clustered together, and whole groups are selected if they establish a relation with the response.
引用
收藏
页码:95 / 105
页数:11
相关论文
共 50 条
  • [41] Variable-weighted PLS
    Xu, Lu
    Jiang, Jian-Hui
    Wu, Hal-Long
    Shen, Guo-Li
    Yu, Ru-Qin
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2007, 85 (01) : 140 - 143
  • [42] Penalized variable selection in copula survival models for clustered time-to-event data
    Kwon, Sookhee
    Ha, Il Do
    Kim, Jong-Min
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020, 90 (04) : 657 - 675
  • [43] A new method for clustered survival data: Estimation of treatment effect heterogeneity and variable selection
    Hu, Liangyuan
    BIOMETRICAL JOURNAL, 2024, 66 (01)
  • [44] Regularized win ratio regression for variable selection and risk prediction, with an application to a cardiovascular trial
    Lu Mao
    BMC Medical Research Methodology, 25 (1)
  • [45] Variable Selection for Varying Coefficient Models Via Kernel Based Regularized Rank Regression
    Wang, Kang-ning
    Lin, Lu
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2020, 36 (02): : 458 - 470
  • [46] Efficient Regularized Regression with L0 Penalty for Variable Selection and Network Construction
    Liu, Zhenqiu
    Li, Gang
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2016, 2016
  • [47] Variable Selection for Varying Coefficient Models Via Kernel Based Regularized Rank Regression
    Kang-ning Wang
    Lu Lin
    Acta Mathematicae Applicatae Sinica, English Series, 2020, 36 : 458 - 470
  • [48] Variable Selection for Varying Coefficient Models Via Kernel Based Regularized Rank Regression
    Kang-ning WANG
    Lu LIN
    ActaMathematicaeApplicataeSinica, 2020, 36 (02) : 458 - 470
  • [49] Comparative study of L1 regularized logistic regression methods for variable selection
    El Guide, M.
    Jbilou, K.
    Koukouvinos, C.
    Lappa, A.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (09) : 4957 - 4972
  • [50] Dimension reduction and variable selection in case control studies via regularized likelihood optimization
    Buneo, Florentino
    Barbu, Adrian
    ELECTRONIC JOURNAL OF STATISTICS, 2009, 3 : 1257 - 1287