TESTING INDEPENDENCE WITH HIGH-DIMENSIONAL CORRELATED SAMPLES

被引:7
|
作者
Chen, Xi [1 ]
Liu, Weidong [2 ,3 ]
机构
[1] NYU, Stern Sch Business, 44 West 4Th St, New York, NY 10012 USA
[2] Shanghai Jiao Tong Univ, Dept Math, Inst Nat Sci, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, MOE LSC, Shanghai, Peoples R China
来源
ANNALS OF STATISTICS | 2018年 / 46卷 / 02期
基金
澳大利亚研究理事会;
关键词
Independence test; multiple testing of correlations; false discovery rate; matrix-variate normal; quadratic functional estimation; high-dimensional sample correlation matrix; FALSE DISCOVERY RATE; COVARIANCE-MATRIX; PHASE-TRANSITION; OPTIMAL RATES; DISTRIBUTIONS; CONVERGENCE; COHERENCE; FRAMEWORK; STRENGTH; GENES;
D O I
10.1214/17-AOS1571
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Testing independence among a number of (ultra) high-dimensional random samples is a fundamental and challenging problem. By arranging n identically distributed p-dimensional random vectors into a p x n data matrix, we investigate the problem of testing independence among columns under the matrix-variate normal modeling of data. We propose a computationally simple and tuning-free test statistic, characterize its limiting null distribution, analyze the statistical power and prove its minimax optimality. As an important by-product of the test statistic, a ratio-consistent estimator for the quadratic functional of a covariance matrix from correlated samples is developed. We further study the effect of correlation among samples to an important high-dimensional inference problem-large-scale multiple testing of Pearson's correlation coefficients. Indeed, blindly using classical inference results based on the assumed independence of samples will lead to many false discoveries, which suggests the need for conducting independence testing before applying existing methods. To address the challenge arising from correlation among samples, we propose a "sandwich estimator" of Pearson's correlation coefficient by de-correlating the samples. Based on this approach, the resulting multiple testing procedure asymptotically controls the overall false discovery rate at the nominal level while maintaining good statistical power. Both simulated and real data experiments are carried out to demonstrate the advantages of the proposed methods.
引用
收藏
页码:866 / 894
页数:29
相关论文
共 50 条
  • [41] Testing linear hypotheses in high-dimensional regressions
    Bai, Zhidong
    Jiang, Dandan
    Yao, Jian-feng
    Zheng, Shurong
    [J]. STATISTICS, 2013, 47 (06) : 1207 - 1223
  • [42] PROPERTY TESTING IN HIGH-DIMENSIONAL ISING MODELS
    Neykov, Matey
    Liu, Han
    [J]. ANNALS OF STATISTICS, 2019, 47 (05): : 2472 - 2503
  • [43] Testing for High-Dimensional Geometry in Random Graphs
    Bubeck, Sebastien
    Ding, Jian
    Eldan, Ronen
    Racz, Miklos Z.
    [J]. RANDOM STRUCTURES & ALGORITHMS, 2016, 49 (03) : 503 - 532
  • [44] Private Identity Testing for High-Dimensional Distributions
    Canonne, Clement
    Kamath, Gautam
    McMillan, Audra
    Ullman, Jonathan
    Zakynthinou, Lydia
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [45] Testing for Trends in High-Dimensional Time Series
    Chen, Likai
    Wu, Wei Biao
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 869 - 881
  • [46] High-dimensional testing for proportional covariance matrices
    Tsukuda, Koji
    Matsuura, Shun
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 412 - 420
  • [47] Hypothesis testing for high-dimensional covariance matrices
    Li, Weiming
    Qin, Yingli
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 128 : 108 - 119
  • [48] Robust Testing in High-Dimensional Sparse Models
    George, Anand Jerry
    Canonne, Clement L.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [49] TESTING HOMOGENEITY OF HIGH-DIMENSIONAL COVARIANCE MATRICES
    Zheng, Shurong
    Lin, Ruitao
    Guo, Jianhua
    Yin, Guosheng
    [J]. STATISTICA SINICA, 2020, 30 (01) : 35 - 53
  • [50] Testing high-dimensional mean vector with applications
    Zhang, Jin-Ting
    Zhou, Bu
    Guo, Jia
    [J]. STATISTICAL PAPERS, 2022, 63 (04) : 1105 - 1137