PCA Consistency for Non-Gaussian Data in High Dimension, Low Sample Size Context

被引:33
|
作者
Yata, Kazuyoshi [2 ]
Aoshima, Makoto [1 ]
机构
[1] Univ Tsukuba, Inst Math, Tsukuba 3058571, Japan
[2] Univ Tsukuba, Grad Sch Pure & Appl Sci, Tsukuba 3058571, Japan
基金
日本学术振兴会;
关键词
Consistency; Dual covariance matrix; Eigenvalue distribution; HDLSS; Large p small n; Principal component analysis; Random matrix theory; Sample size; GEOMETRIC REPRESENTATION; COVARIANCE MATRICES; LARGEST EIGENVALUE;
D O I
10.1080/03610910902936083
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we investigate both sample eigenvalues and Principal Component (PC) directions along with PC scores when the dimension d and the sample size n both grow to infinity in such a way that n is much lower than d. We consider general settings that include the case when the eigenvalues are all in the range of sphericity. We do not assume either the normality or a -mixing condition. We attempt finding a difference among the eigenvalues by choosing n with a suitable order of d. We give the consistency properties for both the sample eigenvalues and the PC directions along with the PC scores. We also show that the sample eigenvalue has a Gaussian limiting distribution when the population counterpart is of multiplicity one.
引用
收藏
页码:2634 / 2652
页数:19
相关论文
共 50 条