Multivariate Rank-Based Distribution-Free Nonparametric Testing Using Measure Transportation

被引:38
|
作者
Deb, Nabarun [1 ]
Sen, Bodhisattva [1 ]
机构
[1] Columbia Univ, Dept Stat, 1255 Amsterdam Ave,Room 1032, New York, NY 10027 USA
关键词
Asymptotic null distribution; Consistency against fixed alternatives; Distance covariance; Distribution-free inference; Energy distance; Multivariate ranks; Multivariate two-sample testing; Quasi-Monte Carlo sequences; Stein's method for exchangeable pairs; Testing for mutual independence; 2-SAMPLE TESTS; INDEPENDENCE; ASSOCIATION; STATISTICS; DEPTH; COEFFICIENTS; COMPETITORS; HYPOTHESES; EFFICIENCY; DEPENDENCE;
D O I
10.1080/01621459.2021.1923508
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we propose a general framework for distribution-free nonparametric testing in multi-dimensions, based on a notion of multivariate ranks defined using the theory of measure transportation. Unlike other existing proposals in the literature, these multivariate ranks share a number of useful properties with the usual one-dimensional ranks; most importantly, these ranks are distribution-free. This crucial observation allows us to design nonparametric tests that are exactly distribution-free under the null hypothesis. We demonstrate the applicability of this approach by constructing exact distribution-free tests for two classical nonparametric problems: (I) testing for mutual independence between random vectors, and (II) testing for the equality of multivariate distributions. In particular, we propose (multivariate) rank versions of distance covariance and energy statistic for testing scenarios (I) and (II), respectively. In both these problems, we derive the asymptotic null distribution of the proposed test statistics. We further show that our tests are consistent against all fixed alternatives. Moreover, the proposed tests are computationally feasible and are well-defined under minimal assumptions on the underlying distributions (e.g., they do not need any moment assumptions). We also demonstrate the efficacy of these procedures via extensive simulations. In the process of analyzing the theoretical properties of our procedures, we end up proving some new results in the theory of measure transportation and in the limit theory of permutation statistics using Stein's method for exchangeable pairs, which may be of independent interest.
引用
收藏
页码:192 / 207
页数:16
相关论文
共 50 条
  • [1] Rank-based partial autocorrelations are not asymptotically distribution-free
    Garel, B
    Hallin, M
    [J]. STATISTICS & PROBABILITY LETTERS, 2000, 47 (03) : 219 - 227
  • [2] A distribution-free robust method for monitoring linear profiles using rank-based regression
    Zi, Xuemin
    Zou, Changliang
    Tsung, Fugee
    [J]. IIE TRANSACTIONS, 2012, 44 (11) : 949 - 963
  • [3] Rank-based testing for semiparametric VAR models: A measure transportation approach
    Hallin, Marc
    La Vecchia, Davide
    Liu, Hang
    [J]. BERNOULLI, 2023, 29 (01) : 229 - 273
  • [4] A class of simple distribution-free rank-based unit root tests
    Hallin, Marc
    van den Akker, Ramon
    Werker, Bas J. M.
    [J]. JOURNAL OF ECONOMETRICS, 2011, 163 (02) : 200 - 214
  • [5] Distribution-Free Detection of Structured Anomalies: Permutation and Rank-Based Scans
    Arias-Castro, Ery
    Castro, Rui M.
    Tanczos, Ervin
    Wang, Meng
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) : 789 - 801
  • [6] Distribution-free and model-free multivariate feature screening via multivariate rank distance correlation
    Zhao, Shaofei
    Fu, Guifang
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 192
  • [7] Distribution-free multivariate process monitoring: A rank-energy statistic-based approach
    Chakraborty, Niladri
    Finkelstein, Maxim
    [J]. QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2024,
  • [8] A rank-based multivariate CUSUM procedure
    Qiu, PH
    Hawkins, D
    [J]. TECHNOMETRICS, 2001, 43 (02) : 120 - 132
  • [9] A DISTRIBUTION-FREE MULTIVARIATE SIGN TEST BASED ON INTERDIRECTIONS
    RANDLES, RH
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (408) : 1045 - 1050
  • [10] Inferring directed networks using a rank-based connectivity measure
    Leguia, Marc G.
    Martinez, Cristina G. B.
    Malvestio, Irene
    Campo, Adria Tauste
    Rocamora, Rodrigo
    Levnajic, Zoran
    Andrzejak, Ralph G.
    [J]. PHYSICAL REVIEW E, 2019, 99 (01)