High-dimensional two-sample mean vectors test and support recovery with factor adjustment

被引:0
|
作者
He, Yong [1 ]
Zhang, Mingjuan [2 ]
Zhang, Xinsheng [3 ]
Zhou, Wang [4 ]
机构
[1] Shandong Univ, Inst Financial Studies, Jinan 250100, Peoples R China
[2] Shanghai Lixin Univ Accounting & Finance, Sch Stat & Math, Shanghai 201209, Peoples R China
[3] Fudan Univ, Sch Management, Shanghai 200433, Peoples R China
[4] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore 117546, Singapore
基金
美国国家科学基金会;
关键词
False discovery rate; High-dimensional; Multiple testing; Multiplier bootstrap; Two-sample test; FALSE DISCOVERY RATE; LARGE COVARIANCE ESTIMATION; PRINCIPAL-COMPONENTS; FEWER OBSERVATIONS; FACTOR MODELS; NUMBER; MATRIX; PROPORTION; BOOTSTRAP; RETURNS;
D O I
10.1016/j.csda.2020.107004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Testing the equality of two mean vectors is a classical problem in multivariate analysis. In this article, we consider the test in the high-dimensional setting. Existing tests often assume that the covariance matrix (or its inverse) of the underlying variables is sparse, which is rarely true in social science due to the existence of latent common factors. In the article, we introduce a maximum-type test statistic based on the factor-adjusted data. The factor-adjustment step increases the signal-to-noise ratio and thus results in more powerful test. We obtain the limiting null distribution of the maximum-type test statistic, which is the extreme value distribution of type I. To overcome the well-known slow convergence rate of the test statistic's distribution to the limiting extreme value distribution, we also propose a multiplier bootstrap method to improve the finite-sample performance. In addition, a multiple testing procedure with false discovery rate (FDR) control is proposed for identifying specific locations that differ significantly between the two groups. Thorough numerical studies are conducted to show the superiority of the test over other state-of-the-art tests. The performance of the test is also assessed through a real stock market dataset. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Robust two-sample test of high-dimensional mean vectors under dependence
    Wang, Wei
    Lin, Nan
    Tang, Xiang
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 169 : 312 - 329
  • [2] A high-dimensional two-sample test for the mean using random subspaces
    Thulin, Mans
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 74 : 26 - 38
  • [3] Two-sample mean vector projection test in high-dimensional data
    Huang, Caizhu
    Cui, Xia
    Pagui, Euloge Clovis Kenne
    [J]. COMPUTATIONAL STATISTICS, 2024, 39 (03) : 1061 - 1091
  • [4] Two-sample mean vector projection test in high-dimensional data
    Caizhu Huang
    Xia Cui
    Euloge Clovis Kenne Pagui
    [J]. Computational Statistics, 2024, 39 : 1061 - 1091
  • [5] A note on high-dimensional two-sample test
    Feng, Long
    Sun, Fasheng
    [J]. STATISTICS & PROBABILITY LETTERS, 2015, 105 : 29 - 36
  • [6] Order test for high-dimensional two-sample means
    Lee, Sang H.
    Lim, Johan
    Li, Erning
    Vannucci, Marina
    Petkova, Eva
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (09) : 2719 - 2725
  • [7] An adaptive two-sample test for high-dimensional means
    Xu, Gongjun
    Lin, Lifeng
    Wei, Peng
    Pan, Wei
    [J]. BIOMETRIKA, 2016, 103 (03) : 609 - 624
  • [8] Two-Sample Covariance Matrix Testing and Support Recovery in High-Dimensional and Sparse Settings
    Cai, Tony
    Liu, Weidong
    Xia, Yin
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) : 265 - 277
  • [9] Two-sample test for sparse high-dimensional multinomial distributions
    Amanda Plunkett
    Junyong Park
    [J]. TEST, 2019, 28 : 804 - 826
  • [10] Empirical likelihood test for high-dimensional two-sample model
    Ciuperca, Gabriela
    Salloum, Zahraa
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2016, 178 : 37 - 60