Regularized Multivariate Analysis Framework for Interpretable High-Dimensional Variable Selection

被引:9
|
作者
Munoz-Romero, Sergio [1 ]
Gomez-Verdejo, Vanessa [2 ]
Arenas-Garcia, Jernimo [2 ]
机构
[1] Univ Rey Juan Carlos, Dept Signal Proc & Commun, Madrid, Spain
[2] Univ Carlos III Madrid, Dept Signal Proc & Commun, E-28903 Getafe, Spain
关键词
SPARSE; REGRESSION;
D O I
10.1109/MCI.2016.2601701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multivariate Analysis (MVA) comprises a family of well-known methods for feature extraction which exploit correlations among input variables representing the data. One important property that is enjoyed by most such methods is uncorrelation among the extracted features. Recently, regularized versions of MVA methods have appeared in the literature, mainly with the goal to gain interpretability of the solution. In these cases, the solutions can no longer be obtained in a closed manner, and more complex optimization methods that rely on the iteration of two steps are frequently used. This paper recurs to an alternative approach to solve efficiently this iterative problem. The main novelty of this approach lies in preserving several properties of the original methods, most notably the uncorrelation of the extracted features. Under this framework, we propose a novel method that takes advantage of the,2,1 norm to perform variable selection during the feature extraction process. Experimental results over different problems corroborate the advantages of the proposed formulation in comparison to state of the art formulations.
引用
收藏
页码:24 / 35
页数:12
相关论文
共 50 条
  • [1] HIGH-DIMENSIONAL VARIABLE SELECTION
    Wasserman, Larry
    Roeder, Kathryn
    ANNALS OF STATISTICS, 2009, 37 (5A): : 2178 - 2201
  • [2] Variable selection in multivariate linear models with high-dimensional covariance matrix estimation
    Perrot-Dockes, Marie
    Levy-Leduc, Celine
    Sansonnet, Laure
    Chiquet, Julien
    JOURNAL OF MULTIVARIATE ANALYSIS, 2018, 166 : 78 - 97
  • [3] The EAS approach to variable selection for multivariate response data in high-dimensional settings
    Koner, Salil
    Williams, Jonathan P.
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (02): : 1947 - 1995
  • [4] PALLADIO: a parallel framework for robust variable selection in high-dimensional data
    Barbieri, Matteo
    Fiorini, Samuele
    Tomasi, Federico
    Barla, Annalisa
    PROCEEDINGS OF PYHPC2016: 6TH WORKSHOP ON PYTHON FOR HIGH-PERFORMANCE AND SCIENTIFIC COMPUTING, 2016, : 19 - 26
  • [5] Variable selection in high-dimensional multivariate binary data with application to the analysis of microbial community DNA fingerprints
    Wilbur, JD
    Ghosh, JK
    Nakatsu, CH
    Brouder, SM
    Doerge, RW
    BIOMETRICS, 2002, 58 (02) : 378 - 386
  • [6] Variable selection and subgroup analysis for high-dimensional censored data
    Zhang, Yu
    Wang, Jiangli
    Zhang, Weiping
    STATISTICAL THEORY AND RELATED FIELDS, 2024, 8 (03) : 211 - 231
  • [7] FACTOR MODELS AND VARIABLE SELECTION IN HIGH-DIMENSIONAL REGRESSION ANALYSIS
    Kneip, Alois
    Sarda, Pascal
    ANNALS OF STATISTICS, 2011, 39 (05): : 2410 - 2447
  • [8] Variable selection and estimation in high-dimensional models
    Horowitz, Joel L.
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2015, 48 (02): : 389 - 407
  • [9] Variable selection for high-dimensional incomplete data
    Liang, Lixing
    Zhuang, Yipeng
    Yu, Philip L. H.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 192
  • [10] High-dimensional multivariate probit analysis
    Bock, RD
    Gibbons, RD
    BIOMETRICS, 1996, 52 (04) : 1183 - 1194