Spectrum estimation: A unified framework for covariance matrix estimation and PCA in large dimensions

被引:105
|
作者
Ledoit, Olivier [1 ]
Wolf, Michael [1 ]
机构
[1] Univ Zurich, Dept Econ, CH-8032 Zurich, Switzerland
关键词
Large-dimensional asymptotics; Covariance matrix eigenvalues; Nonlinear shrinkage; Principal component analysis; EMPIRICAL DISTRIBUTION; REGRESSION-MODELS; EIGENVALUES; ROBUST;
D O I
10.1016/j.jmva.2015.04.006
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Covariance matrix estimation and principal component analysis (PCA) are two cornerstones of multivariate analysis. Classic textbook solutions perform poorly when the dimension of the data is of a magnitude similar to the sample size, or even larger. In such settings, there is a common remedy for both statistical problems: nonlinear shrinkage of the eigenvalues of the sample covariance matrix. The optimal nonlinear shrinkage formula depends on unknown population quantities and is thus not available. It is, however, possible to consistently estimate an oracle nonlinear shrinkage, which is motivated on asymptotic grounds. A key tool to this end is consistent estimation of the set of eigenvalues of the population covariance matrix (also known as the spectrum), an interesting and challenging problem in its own right. Extensive Monte Carlo simulations demonstrate that our methods have desirable finite-sample properties and outperform previous proposals. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:360 / 384
页数:25
相关论文
共 50 条