共 50 条
Clustered Variable Selection by Regularized Elimination in PLS
被引:2
|作者:
Mehmood, Tahir
[1
]
Snipen, Lars
[1
]
机构:
[1] Norwegian Univ Life Sci, Dept Chem Biotechnol & Food Sci, As, Norway
来源:
NEW PERSPECTIVES IN PARTIAL LEAST SQUARES AND RELATED METHODS
|
2013年
/
56卷
关键词:
Regularization;
High-dimension;
Collinearity;
Clustering;
Power;
Parameter estimation;
LEAST-SQUARES REGRESSION;
D O I:
10.1007/978-1-4614-8283-3_5
中图分类号:
O1 [数学];
学科分类号:
0701 ;
070101 ;
摘要:
Variable selection is a crucial issue in many sciences, including modern biology, where an example is the selection of genomic markers for classification (diagnosing diseases, recognizing pathogenic bacteria, etc.). This becomes complicated as biological variables are in general correlated. For example, genes may be easily correlated, if they provide common biological functions. Variable selection may dissolve the group effects and mislead the focus onto a specific variable instead of a variable cluster. We study the selection and estimation properties of variable clusters in high dimensional settings when the number of variables exceeds the sample size. To address the issue a regularized elimination procedure in multiblock-PLS (mbPLS) is used, where highly correlated variables are clustered together, and whole groups are selected if they establish a relation with the response.
引用
收藏
页码:95 / 105
页数:11
相关论文