Stability approach to selecting the number of principal components

被引:4
|
作者
Song, Jiyeon [1 ]
Shin, Seung Jun [1 ]
机构
[1] Korea Univ, Dept Stat, 45 Anam Ro, Seoul 02841, South Korea
基金
新加坡国家研究基金会;
关键词
Principal component analysis; Stability selection; Structural dimension; Subsampling; SLICED INVERSE REGRESSION; CHOICE;
D O I
10.1007/s00180-018-0826-7
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Principal component analysis (PCA) is a canonical tool that reduces data dimensionality by finding linear transformations that project the data into a lower dimensional subspace while preserving the variability of the data. Selecting the number of principal components (PC) is essential but challenging for PCA since it represents an unsupervised learning problem without a clear target label at the sample level. In this article, we propose a new method to determine the optimal number of PCs based on the stability of the space spanned by PCs. A series of analyses with both synthetic data and real data demonstrates the superior performance of the proposed method.
引用
收藏
页码:1923 / 1938
页数:16
相关论文
共 50 条
  • [1] Stability approach to selecting the number of principal components
    Jiyeon Song
    Seung Jun Shin
    Computational Statistics, 2018, 33 : 1923 - 1938
  • [2] Selecting the Number of Principal Components with SURE
    Ulfarsson, Magnus O.
    Solo, Victor
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (02) : 239 - 243
  • [3] Selecting the Number of Principal Components in Functional Data
    Li, Yehua
    Wang, Naisyin
    Carroll, Raymond J.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (504) : 1284 - 1294
  • [4] A Corrected Criterion for Selecting the Optimum Number of Principal Components
    Kazianka, Hannes
    Pilz, Juergen
    AUSTRIAN JOURNAL OF STATISTICS, 2009, 38 (03) : 135 - 150
  • [5] SELECTING THE NUMBER OF PRINCIPAL COMPONENTS: ESTIMATION OF THE TRUE RANK OF A NOISY MATRIX
    Choi, Yunjin
    Taylor, Jonathan
    Tibshirani, Robert
    ANNALS OF STATISTICS, 2017, 45 (06): : 2590 - 2617
  • [6] SELECTING PRINCIPAL COMPONENTS IN REGRESSION
    MASON, RL
    GUNST, RF
    STATISTICS & PROBABILITY LETTERS, 1985, 3 (06) : 299 - 301
  • [7] Selecting the Number of Principal Components on the Basis of Performance Optimization of Fault Detection and Identification
    Xuan, Jiyang
    Xu, Zhengguo
    Sun, Youxian
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2015, 54 (12) : 3145 - 3153
  • [8] Selecting the number of components in principal component analysis using cross-validation approximations
    Josse, Julie
    Husson, Francois
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) : 1869 - 1879
  • [9] Stability of principal components
    Al-Ibrahim, A. H.
    Al-Kandari, Noriah M.
    COMPUTATIONAL STATISTICS, 2008, 23 (01) : 153 - 171
  • [10] Stability of principal components
    A. H. Al-Ibrahim
    Noriah M. Al-Kandari
    Computational Statistics, 2008, 23 : 153 - 171