Penalized classification using Fisher's linear discriminant

被引:254
|
作者
Witten, Daniela M. [1 ]
Tibshirani, Robert [2 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Stanford Univ, Stanford, CA 94305 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Classification; Feature selection; High dimensional problems; Lasso; Linear discriminant analysis; Supervised learning; MULTICLASS CANCER-DIAGNOSIS; SHRUNKEN CENTROIDS; VARIABLE SELECTION; OPTIMIZATION; PREDICTION; REGRESSION;
D O I
10.1111/j.1467-9868.2011.00783.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the supervised classification setting, in which the data consist of p features measured on n observations, each of which belongs to one of K classes. Linear discriminant analysis (LDA) is a classical method for this problem. However, in the high dimensional setting where p >> n, LDA is not appropriate for two reasons. First, the standard estimate for the within-class covariance matrix is singular, and so the usual discriminant rule cannot be applied. Second, when p is large, it is difficult to interpret the classification rule that is obtained from LDA, since it involves all p features. We propose penalized LDA, which is a general approach for penalizing the discriminant vectors in Fisher's discriminant problem in a way that leads to greater interpretability. The discriminant problem is not convex, so we use a minorization-maximization approach to optimize it efficiently when convex penalties are applied to the discriminant vectors. In particular, we consider the use of L-1 and fused lasso penalties. Our proposal is equivalent to recasting Fisher's discriminant problem as a biconvex problem. We evaluate the performances of the resulting methods on a simulation study, and on three gene expression data sets. We also survey past methods for extending LDA to the high dimensional setting and explore their relationships with our proposal.
引用
收藏
页码:753 / 772
页数:20
相关论文
共 50 条
  • [1] Classification of MEG signals using Fisher linear discriminant analysis
    [J]. Zhao, H.-B. (hbzhao@mail.neu.edu.cn), 1695, Northeast University (34):
  • [2] The Fisher's Linear Discriminant
    Iatan, Iuliana F.
    [J]. COMBINING SOFT COMPUTING AND STATISTICAL METHODS IN DATA ANALYSIS, 2010, 77 : 345 - 352
  • [3] Palmprint recognition using Fisher's linear discriminant
    Wu, XQ
    Wang, KQ
    Zhang, D
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3150 - 3154
  • [4] Classification of EEG-P300 Signals Using Fisher's Linear Discriminant Analysis
    Turnip, Arjon
    Widyotriatmo, Augie
    Suprijanto
    [J]. 2013 3RD INTERNATIONAL CONFERENCE ON INSTRUMENTATION CONTROL AND AUTOMATION (ICA 2013), 2013, : 98 - 103
  • [5] Classification of Encryption Algorithms using Fisher's Discriminant Analysis
    Ray, Prabhat Kumar
    Kant, Shri
    Roy, Bimal K.
    Basu, Ayanendranath
    [J]. DEFENCE SCIENCE JOURNAL, 2017, 67 (01) : 59 - 65
  • [6] Recursive dimensionality reduction using Fisher's Linear Discriminant
    Poston, WL
    Marchette, DJ
    [J]. PATTERN RECOGNITION, 1998, 31 (07) : 881 - 888
  • [7] Unsupervised constrained linear Fisher's discriminant analysis for hyperspectral image classification
    Ji, BH
    Chang, CI
    Jensen, JL
    Jensen, JO
    [J]. IMAGING SPECTROMETRY X, 2004, 5546 : 344 - 353
  • [8] Sparse Fisher's Linear Discriminant Analysis
    Siddiqui, Hasib
    Hwang, Hau
    [J]. COMPUTATIONAL IMAGING IX, 2011, 7873
  • [9] Microcalcifications Detection Using Fisher's Linear Discriminant and Breast Density
    Rodriguez, G. A.
    Gonzalez, J. A.
    Altamirano, L.
    Guichard, J. S.
    Diaz, R.
    [J]. SOFTWARE TOOLS AND ALGORITHMS FOR BIOLOGICAL SYSTEMS, 2011, 696 : 451 - 459
  • [10] Two Class Fisher's Linear Discriminant Analysis Using MapReduce
    Anchalia, Prajesh P.
    Roy, Kaushik
    Roy, Kunal
    [J]. 2015 17TH UKSIM-AMSS INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2015, : 463 - 469