Sparse principal component analysis by choice of norm

被引:26
|
作者
Qi, Xin [1 ]
Luo, Ruiyan [1 ]
Zhao, Hongyu [2 ]
机构
[1] Georgia State Univ, Dept Math & Stat, Atlanta, GA 30303 USA
[2] Yale Univ, Dept Epidemiol & Publ Hlth, New Haven, CT 06520 USA
关键词
Sparse principal component analysis; High-dimensional data; Uncorrelated or orthogonal principal components; Iterative algorithm; Consistency in high dimension;
D O I
10.1016/j.jmva.2012.07.004
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Recent years have seen the developments of several methods for sparse principal component analysis due to its importance in the analysis of high dimensional data. Despite the demonstration of their usefulness in practical applications, they are limited in terms of lack of orthogonality in the loadings (coefficients) of different principal components, the existence of correlation in the principal components, the expensive computation needed, and the lack of theoretical results such as consistency in high-dimensional situations. In this paper, we propose a new sparse principal component analysis method by introducing a new norm to replace the usual norm in traditional eigenvalue problems, and propose an efficient iterative algorithm to solve the optimization problems. With this method, we can efficiently obtain uncorrelated principal components or orthogonal loadings, and achieve the goal of explaining a high percentage of variations with sparse linear combinations. Due to the strict convexity of the new norm, we can prove the convergence of the iterative method and provide the detailed characterization of the limits. We also prove that the obtained principal component is consistent for a single component model in high dimensional situations. As illustration, we apply this method to real gene expression data with competitive results. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:127 / 160
页数:34
相关论文
共 50 条
  • [1] Sparse Nuclear Norm Two Dimensional Principal Component Analysis
    Chen, Yudong
    Lai, Zhihui
    Zhang, Ye
    [J]. BIOMETRIC RECOGNITION, 2016, 9967 : 547 - 555
  • [2] Nuclear norm based two-dimensional sparse principal component analysis
    Chen, Yudong
    Lai, Zhihui
    Wen, Jiajun
    Gao, Can
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2018, 16 (02)
  • [3] OPTIMAL SPARSE L1-NORM PRINCIPAL-COMPONENT ANALYSIS
    Chamadia, Shubham
    Pados, Dimitris A.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2686 - 2690
  • [4] Sparse principal component analysis
    Zou, Hui
    Hastie, Trevor
    Tibshirani, Robert
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2006, 15 (02) : 265 - 286
  • [5] Principal component analysis in an asymmetric norm
    Tran, Ngoc M.
    Burdejova, Petra
    Ospienko, Maria
    Haerdle, Wolfgang K.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 1 - 21
  • [6] Robust sparse principal component analysis
    ZHAO Qian
    MENG DeYu
    XU ZongBen
    [J]. Science China(Information Sciences), 2014, 57 (09) : 175 - 188
  • [7] Multilinear Sparse Principal Component Analysis
    Lai, Zhihui
    Xu, Yong
    Chen, Qingcai
    Yang, Jian
    Zhang, David
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (10) : 1942 - 1950
  • [8] Robust Sparse Principal Component Analysis
    Croux, Christophe
    Filzmoser, Peter
    Fritz, Heinrich
    [J]. TECHNOMETRICS, 2013, 55 (02) : 202 - 214
  • [9] Robust sparse principal component analysis
    Zhao Qian
    Meng DeYu
    Xu ZongBen
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2014, 57 (09) : 1 - 14
  • [10] Streaming Sparse Principal Component Analysis
    Yang, Wenzhuo
    Xu, Huan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 494 - 503