Two-Sample Covariance Matrix Testing and Support Recovery in High-Dimensional and Sparse Settings

被引:151
|
作者
Cai, Tony [1 ]
Liu, Weidong [2 ,3 ]
Xia, Yin [1 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat, Philadelphia, PA 19104 USA
[2] Shanghai Jiao Tong Univ, Dept Math, Inst Nat Sci, Shanghai 200030, Peoples R China
[3] Shanghai Jiao Tong Univ, MOE LSC, Shanghai 200030, Peoples R China
基金
美国国家科学基金会;
关键词
Extreme value Type I distribution; Gene selection; Hypothesis testing; Sparsity; ASYMPTOTIC-DISTRIBUTION; EQUALITY; DISTRIBUTIONS; COHERENCE;
D O I
10.1080/01621459.2012.758041
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the high-dimensional setting, this article considers three interrelated problems: (a) testing the equality of two covariance matrices Sigma(1) and Sigma(2); (b) recovering the support of Sigma(1) - Sigma(2); and (c) testing the equality of Sigma(1) and Sigma(2) row by row. We propose a new test for testing the hypothesis H-0: Sigma(1) = Sigma(2) and investigate its theoretical and numerical properties. The limiting null distribution of the test statistic is derived and the power of the test is studied. The test is shown to enjoy certain optimality and to be especially powerful against sparse alternatives. The simulation results show that the test significantly outperforms the existing methods both in terms of size and power. Analysis of a prostate cancer dataset is carried out to demonstrate the application of the testing procedures. When the null hypothesis of equal covariance matrices is rejected, it is often of significant interest to further investigate how they differ from each other. Motivated by applications in genomics, we also consider recovering the support of Sigma(1) - Sigma(2) and testing the equality of the two covariance matrices row by row. New procedures are introduced and their properties are studied. Applications to gene selection are also discussed. Supplementary materials for this article are available online.
引用
收藏
页码:265 / 277
页数:13
相关论文
共 50 条
  • [21] On the behaviour of the smallest eigenvalue of a high-dimensional sample covariance matrix
    Yaskov, P. A.
    [J]. RUSSIAN MATHEMATICAL SURVEYS, 2013, 68 (03) : 569 - 570
  • [22] TWO-SAMPLE TESTING OF HIGH-DIMENSIONAL LINEAR REGRESSION COEFFICIENTS VIA COMPLEMENTARY SKETCHING
    Gao, Fengnan
    Wang, Tengyao
    [J]. ANNALS OF STATISTICS, 2022, 50 (05): : 2950 - 2972
  • [23] A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING
    Chen, Song Xi
    Qin, Ying-Li
    [J]. ANNALS OF STATISTICS, 2010, 38 (02): : 808 - 835
  • [24] Local two-sample testing: a new tool for analysing high-dimensional astronomical data
    Freeman, P. E.
    Kim, I.
    Lee, A. B.
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2017, 471 (03) : 3273 - 3282
  • [25] Testing proportionality of two high-dimensional covariance matrices
    Cheng, Guanghui
    Liu, Baisen
    Tian, Guoliang
    Zheng, Shurong
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 150
  • [26] Testing the Number of Common Factors by Bootstrapped Sample Covariance Matrix in High-Dimensional Factor Models
    Yu, Long
    Zhao, Peng
    Zhou, Wang
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024,
  • [27] Small Populations, High-Dimensional Spaces: Sparse Covariance Matrix Adaptation
    Meyer-Nieberg, Silja
    Kropat, Erik
    [J]. PROCEEDINGS OF THE 2015 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 5 : 525 - 535
  • [28] A simultaneous test of mean vector and covariance matrix in high-dimensional settings
    Cao, Mingxiang
    Sun, Peng
    Park, Junyong
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2021, 212 : 141 - 152
  • [29] HYPOTHESIS TESTING ON LINEAR STRUCTURES OF HIGH-DIMENSIONAL COVARIANCE MATRIX
    Zheng, Shurong
    Chen, Zhao
    Cui, Hengjian
    Li, Runze
    [J]. ANNALS OF STATISTICS, 2019, 47 (06): : 3300 - 3334
  • [30] SHARP OPTIMALITY FOR HIGH-DIMENSIONAL COVARIANCE TESTING UNDER SPARSE SIGNALS
    Chen, Song xi
    Qiu, Yumou
    Zhang, Shuyi
    [J]. ANNALS OF STATISTICS, 2023, 51 (05): : 1921 - 1945