Spectral analysis of high-dimensional sample covariance matrices with missing observations

被引:4
|
作者
Jurczak, Kamil [1 ]
Rohde, Angelika [1 ]
机构
[1] Ruhr Univ Bochum, Fak Math, D-44780 Bochum, Germany
关键词
almost sure convergence of extremal eigenvalues; characterization of positive definiteness; limiting spectral distribution; sample covariance matrix with missing observations; Stieltjes transform; EMPIRICAL DISTRIBUTION; LARGEST EIGENVALUE; LIMIT; THEOREM;
D O I
10.3150/16-BEJ815
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study high-dimensional sample covariance matrices based on independent random vectors with missing coordinates. The presence of missing observations is common in modern applications such as climate studies or gene expression micro-arrays. A weak approximation on the spectral distribution in the "large dimension d and large sample size n" asymptotics is derived for possibly different observation probabilities in the coordinates. The spectral distribution turns out to be strongly influenced by the missingness mechanism. In the null case under the missing at random scenario where each component is observed with the same probability p, the limiting spectral distribution is a Marcenko-Pastur law shifted by (1 - p)/p to the left. As d/n -> y is an element of(0, 1), the almost sure convergence of the extremal eigenvalues to the respective boundary points of the support of the limiting spectral distribution is proved, which are explicitly given in terms of y and p. Eventually, the sample covariance matrix is positive definite if p is larger than 1 - (1 - root y)(2), whereas this is not true any longer if p is smaller than this quantity.
引用
收藏
页码:2466 / 2532
页数:67
相关论文
共 50 条
  • [1] High-dimensional covariance matrix estimation with missing observations
    Lounici, Karim
    [J]. BERNOULLI, 2014, 20 (03) : 1029 - 1058
  • [2] TWO SAMPLE TESTS FOR HIGH-DIMENSIONAL COVARIANCE MATRICES
    Li, Jun
    Chen, Song Xi
    [J]. ANNALS OF STATISTICS, 2012, 40 (02): : 908 - 940
  • [3] Weak Convergence of the Empirical Spectral Distribution of High-Dimensional Band Sample Covariance Matrices
    Jurczak, Kamil
    [J]. JOURNAL OF THEORETICAL PROBABILITY, 2018, 31 (03) : 1273 - 1302
  • [4] CLT for linear spectral statistics of high-dimensional sample covariance matrices in elliptical distributions
    Zhang, Yangchun
    Hu, Jiang
    Li, Weiming
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 191
  • [5] Weak Convergence of the Empirical Spectral Distribution of High-Dimensional Band Sample Covariance Matrices
    Kamil Jurczak
    [J]. Journal of Theoretical Probability, 2018, 31 : 1273 - 1302
  • [6] Multi-sample test for high-dimensional covariance matrices
    Zhang, Chao
    Bai, Zhidong
    Hu, Jiang
    Wang, Chen
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2018, 47 (13) : 3161 - 3177
  • [7] High-dimensional sample covariance matrices with Curie-Weiss entries
    Fleermann, Michael
    Heiny, Johannes
    [J]. ALEA-LATIN AMERICAN JOURNAL OF PROBABILITY AND MATHEMATICAL STATISTICS, 2020, 17 (02): : 857 - 876
  • [8] Global one-sample tests for high-dimensional covariance matrices
    Wang, Xiaoyi
    Liu, Baisen
    Shi, Ning-Zhong
    Tian, Guo-Liang
    Zheng, Shurong
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (10) : 2051 - 2073
  • [9] On a model selection problem from high-dimensional sample covariance matrices
    Chen, J.
    Delyon, B.
    Yao, J-F
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2011, 102 (10) : 1388 - 1398
  • [10] Tests for High-Dimensional Covariance Matrices
    Chen, Song Xi
    Zhang, Li-Xin
    Zhong, Ping-Shou
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (490) : 810 - 819