Optimal variable selection in multi-group sparse discriminant analysis

被引:7
|
作者
Gaynanova, Irina [1 ]
Kolar, Mladen [2 ]
机构
[1] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
[2] Univ Chicago, Booth Sch Business, Chicago, IL 60637 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2015年 / 9卷 / 02期
关键词
Classification; Fisher's discriminant analysis; group penalization; high-dimensional statistics; MODEL SELECTION; CLASSIFICATION; CENTROIDS; RECOVERY;
D O I
10.1214/15-EJS1064
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article considers the problem of multi-group classification in the setting where the number of variables p is larger than the number of observations n. Several methods have been proposed in the literature that address this problem, however their variable selection performance is either unknown or suboptimal to the results known in the two-group case. In this work we provide sharp conditions for the consistent recovery of relevant variables in the multi-group case using the discriminant analysis proposal of Gaynanova et al. [7]. We achieve the rates of convergence that attain the optimal scaling of the sample size n, number of variables p and the sparsity level s. These rates are significantly faster than the best known results in the multi-group case. Moreover, they coincide with the minimax optimal rates for the two-group case. We validate our theoretical results with numerical analysis.
引用
收藏
页码:2007 / 2034
页数:28
相关论文
共 50 条
  • [31] Sparse optimal discriminant clustering
    Wang, Yanhong
    Fang, Yixin
    Wang, Junhui
    STATISTICS AND COMPUTING, 2016, 26 (03) : 629 - 639
  • [32] Erratum to: A doubly sparse approach for group variable selection
    Sunghoon Kwon
    Jeongyoun Ahn
    Woncheol Jang
    Sangin Lee
    Yongdai Kim
    Annals of the Institute of Statistical Mathematics, 2017, 69 : 1027 - 1027
  • [33] Multi-Group Tensor Canonical Correlation Analysis
    Zhou, Zhuoping
    Tong, Boning
    Tarzanagh, Davoud Ataee
    Hou, Bojian
    Saykin, Andrew J.
    Long, Qi
    Shen, Li
    14TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, BCB 2023, 2023,
  • [34] Model Selection with Lasso in Multi-group Structural Equation Models
    Lindstrom, Jonas Christoffer
    Dahl, Fredrik A.
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2020, 27 (01) : 33 - 42
  • [35] Multi-Group Multicast Beamforming: Optimal Structure and Efficient Algorithms
    Dong, Min
    Wang, Qiqi
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) : 3738 - 3753
  • [36] PREDICTORS OF SEEKING CARE: A MULTI-GROUP ANALYSIS
    May, April
    Casteel, Danielle
    Cronan, Terry A.
    ANNALS OF BEHAVIORAL MEDICINE, 2016, 50 : S33 - S33
  • [37] SPOT: Sparse Optimal Transformations for High Dimensional Variable Selection and Exploratory Regression Analysis
    Huang, Qiming
    Zhu, Michael
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 857 - 865
  • [38] Variable Selection in PLS Discriminant Analysis via the Disco
    Simonetti, Biagio
    Lucadamo, Antonio
    Rodriguez, Maria R. G.
    CURRENT ANALYTICAL CHEMISTRY, 2012, 8 (02) : 266 - 272
  • [39] DALASS: Variable selection in discriminant analysis via the LASSO
    Trendafilov, Nickolay T.
    Jolliffe, Ian T.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (08) : 3718 - 3736
  • [40] Variable selection in model-based discriminant analysis
    Maugis, C.
    Celeux, G.
    Martin-Magniette, M-L
    JOURNAL OF MULTIVARIATE ANALYSIS, 2011, 102 (10) : 1374 - 1387