Robust two-sample test of high-dimensional mean vectors under dependence

被引:5
|
作者
Wang, Wei [1 ]
Lin, Nan [1 ]
Tang, Xiang [1 ]
机构
[1] Washington Univ, Dept Math & Stat, St Louis, MO 63130 USA
基金
美国国家科学基金会;
关键词
Cell-wise contamination; Robust precision matrix estimation; Sparse and strong alternatives; Two-sample mean test; Trimmed mean; MATRIX ESTIMATION;
D O I
10.1016/j.jmva.2018.09.013
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A basic problem in modern multivariate analysis is testing the equality of two mean vectors in settings where the dimension p increases with the sample size n. This paper proposes a robust two-sample test for high-dimensional data against sparse and strong alternatives, in which the mean vectors of the populations differ in only a few dimensions, but the magnitude of the differences is large. The test is based on trimmed means and robust precision matrix estimators. The asymptotic joint distribution of the trimmed means is established, and the proposed test statistic is shown to have a Gumbel distribution in the limit. Simulation studies suggest that the numerical performance of the proposed test is comparable to that of non-robust tests for uncontaminated data. For cell-wise contaminated data, it outperforms non-robust tests. An illustration involves biomarker identification in an Alzheimer's disease dataset. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:312 / 329
页数:18
相关论文
共 50 条
  • [31] A ROBUST AND NONPARAMETRIC TWO-SAMPLE TEST IN HIGH DIMENSIONS
    Qiu, Tao
    Xu, Wangli
    Zhu, Liping
    STATISTICA SINICA, 2021, 31 (04) : 1853 - 1869
  • [32] Test on the linear combinations of mean vectors in high-dimensional data
    Huiqin Li
    Jiang Hu
    Zhidong Bai
    Yanqing Yin
    Kexin Zou
    TEST, 2017, 26 : 188 - 208
  • [33] Test on the linear combinations of mean vectors in high-dimensional data
    Li, Huiqin
    Hu, Jiang
    Bai, Zhidong
    Yin, Yanqing
    Zou, Kexin
    TEST, 2017, 26 (01) : 188 - 208
  • [34] Two-sample tests of high-dimensional means for compositional data
    Cao, Yuanpei
    Lin, Wei
    Li, Hongzhe
    BIOMETRIKA, 2018, 105 (01) : 115 - 132
  • [35] Two-sample tests for sparse high-dimensional binary data
    Plunkett, Amanda
    Park, Junyong
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (22) : 11181 - 11193
  • [36] Multi-sample hypothesis testing of high-dimensional mean vectors under covariance heterogeneity
    Wu, Lixiu
    Hu, Jiang
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2024, 76 (04) : 579 - 615
  • [37] High-dimensional two-sample precision matrices test: an adaptive approach through multiplier bootstrap
    Zhang, Mingjuan
    He, Yong
    Zhou, Cheng
    Zhang, Xinsheng
    STATISTICS AND ITS INTERFACE, 2020, 13 (01) : 37 - 48
  • [38] A robust mean and variance test with application to high-dimensional phenotypes
    James R. Staley
    Frank Windmeijer
    Matthew Suderman
    Matthew S. Lyon
    George Davey Smith
    Kate Tilling
    European Journal of Epidemiology, 2022, 37 : 377 - 387
  • [39] Multiple-Splitting Projection Test for High-Dimensional Mean Vectors
    Liu, Wanjun
    Yu, Xiufan
    Li, Runze
    Journal of Machine Learning Research, 2022, 23
  • [40] A robust mean and variance test with application to high-dimensional phenotypes
    Staley, James R.
    Windmeijer, Frank
    Suderman, Matthew
    Lyon, Matthew S.
    Davey Smith, George
    Tilling, Kate
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2022, 37 (04) : 377 - 387