A ROBUST AND NONPARAMETRIC TWO-SAMPLE TEST IN HIGH DIMENSIONS

被引:1
|
作者
Qiu, Tao [1 ]
Xu, Wangli [1 ]
Zhu, Liping [1 ,2 ]
机构
[1] Renmin Univ China, Beijing, Peoples R China
[2] Zhejiang Gongshang Univ, Hangzhou, Zhejiang, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Cramer-von Mises test; equality of distributions; high dimension; homogeneity; two-sample test; U-statistics; MULTIVARIATE; DISTRIBUTIONS;
D O I
10.5705/ss.202019.0170
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Many tests are available to test the homogeneity of two random samples, that is, the exact equivalence of their statistical distributions. When the two random samples are high dimensional or not normally distributed, the asymptotic null distributions of most existing two-sample tests are rarely tractable. This limits their usefulness in high dimensions, even when the sample sizes are sufficiently large. In addition, existing tests require a careful selection of the tuning parameters to enhance their power performance. However, doing so is very challenging, especially in high dimensions. In this paper, we propose a robust and fully nonparametric two-sample test to detect the heterogeneity of two random samples. Our proposed test is free of tuning parameters. It is built upon the Cramer-von Mises distance, and can be readily used in high dimensions. In addition, our proposed test is robust to the presence of outliers or extreme values in that no moment condition is required. The asymptotic null distribution of our proposed test is standard normal when both the sample sizes and the dimensions of the two random samples diverge to infinity. This facilitates the implementation of our proposed test dramatically, in that no bootstrap or re-sampling technique has to be used to decide an appropriate critical value. We demonstrate the power performance of our proposed test through extensive simulations and real-world applications.
引用
收藏
页码:1853 / 1869
页数:17
相关论文
共 50 条
  • [1] A nonparametric two-sample test applicable to high dimensional data
    Biswas, Munmun
    Ghosh, Anil K.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 123 : 160 - 171
  • [2] A two-sample nonparametric likelihood ratio test
    Marsh, Patrick
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2010, 22 (08) : 1053 - 1065
  • [3] A two-sample nonparametric test with missing observations
    Lee, YJ
    [J]. AMERICAN JOURNAL OF MATHEMATICAL AND MANAGEMENT SCIENCES, VOL 17, NOS 1 AND 2, 1997: MULTIVARIATE STATISTICAL INFERENCE - MSI-2000L MULTIVARIATE STATISTICAL ANALYSIS IN HONOR OF PROFESSOR MINORU SIOTANI ON HIS 70TH BIRTHDAY, 1997, 17 (1&2): : 187 - 200
  • [4] A nonparametric test for the general two-sample problem
    Baumgartner, W
    Weiss, P
    Schindler, H
    [J]. BIOMETRICS, 1998, 54 (03) : 1129 - 1135
  • [5] Two-sample test in high dimensions through random selection
    Qiu, Tao
    Xu, Wangli
    Zhu, Liping
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 160
  • [6] Robust nonparametric tests for the two-sample location problem
    Fried, Roland
    Dehling, Herold
    [J]. STATISTICAL METHODS AND APPLICATIONS, 2011, 20 (04): : 409 - 422
  • [7] Robust nonparametric tests for the two-sample location problem
    Roland Fried
    Herold Dehling
    [J]. Statistical Methods & Applications, 2011, 20 : 409 - 422
  • [8] Two-sample testing in high dimensions
    Stadler, Nicolas
    Mukherjee, Sach
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2017, 79 (01) : 225 - 246
  • [9] Efficient and adaptive nonparametric test for the two-sample problem
    Ducharme, GR
    Ledwina, T
    [J]. ANNALS OF STATISTICS, 2003, 31 (06): : 2036 - 2058
  • [10] Proposed nonparametric test for the mixed two-sample design
    Magel R.C.
    Fu R.
    [J]. Journal of Statistical Theory and Practice, 2014, 8 (2) : 221 - 237