Asymptotic normality of interpoint distances for high-dimensional data with applications to the two-sample problem

被引:17
|
作者
Li, Jun [1 ]
机构
[1] Univ Calif Riverside, Dept Stat, 1337 Olmsted Hall, Riverside, CA 92521 USA
关键词
Asymptotic normality; High-dimensional data; Interpoint distance; Strong mixing condition; Two-sample problem; GEOMETRIC REPRESENTATION; MILD CONDITIONS; MULTIVARIATE; TESTS;
D O I
10.1093/biomet/asy020
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Interpoint distances have applications in many areas of probability and statistics. Thanks to their simplicity of computation, interpoint distance-based procedures are particularly appealing for analysing small samples of high-dimensional data. In this paper, we first study the asymptotic distribution of interpoint distances in the high-dimension, low-sample-size setting and show that it is normal under regularity conditions. We then construct a powerful test for the two-sample problem, which is consistent for detecting location and scale differences. Simulations show that the test compares favourably with existing distance-based tests.
引用
收藏
页码:529 / 546
页数:18
相关论文
共 50 条
  • [1] TWO-SAMPLE BEHRENS-FISHER PROBLEM FOR HIGH-DIMENSIONAL DATA
    Feng, Long
    Zou, Changliang
    Wang, Zhaojun
    Zhu, Lixing
    [J]. STATISTICA SINICA, 2015, 25 (04) : 1297 - 1312
  • [2] Asymptotic distribution of the maximum interpoint distance for high-dimensional data
    Tang, Ping
    Lu, Rongrong
    Xie, Junshan
    [J]. STATISTICS & PROBABILITY LETTERS, 2022, 190
  • [3] A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING
    Chen, Song Xi
    Qin, Ying-Li
    [J]. ANNALS OF STATISTICS, 2010, 38 (02): : 808 - 835
  • [4] Two-sample tests of high-dimensional means for compositional data
    Cao, Yuanpei
    Lin, Wei
    Li, Hongzhe
    [J]. BIOMETRIKA, 2018, 105 (01) : 115 - 132
  • [5] Two-sample tests for sparse high-dimensional binary data
    Plunkett, Amanda
    Park, Junyong
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (22) : 11181 - 11193
  • [6] Asymptotic distribution-free change-point detection based on interpoint distances for high-dimensional data
    Li, Jun
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2020, 32 (01) : 157 - 184
  • [7] Some high-dimensional one-sample tests based on functions of interpoint distances
    Saha, Enakshi
    Sarkar, Soham
    Ghosh, Anil K.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2017, 161 : 83 - 95
  • [8] Reducing multidimensional two-sample data to one-dimensional interpoint comparisons
    Maa, JF
    Pearl, DK
    Bartoszynski, R
    [J]. ANNALS OF STATISTICS, 1996, 24 (03): : 1069 - 1074
  • [9] Two-sample high-dimensional empirical likelihood
    Fang, Jianglin
    Liu, Wanrong
    Lu, Xuewen
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (13) : 6323 - 6335
  • [10] A note on high-dimensional two-sample test
    Feng, Long
    Sun, Fasheng
    [J]. STATISTICS & PROBABILITY LETTERS, 2015, 105 : 29 - 36