RANK-BASED INDICES FOR TESTING INDEPENDENCE BETWEEN TWO HIGH-DIMENSIONAL VECTORS

被引:1
|
作者
Zhou, Yeqing [1 ,2 ]
Xu, Kai [3 ]
Zhu, Liping [4 ,5 ]
Li, Runze [6 ]
机构
[1] Tongji Univ, Sch Math Sci, Shanghai, Peoples R China
[2] Tongji Univ, Key Lab Intelligent Comp & Applicat, Shanghai, Peoples R China
[3] Anhui Normal Univ, Sch Math & Stat, Hefei, Peoples R China
[4] Renmin Univ China, Inst Stat & Big Data, Ctr Appl Stat, Beijing, Peoples R China
[5] Zhejiang Gongshang Univ, Zhijiang Inst Big Data & Stat, Sch Stat & Math, Hangzhou, Peoples R China
[6] Penn State Univ, Dept Stat, University Pk, PA USA
来源
ANNALS OF STATISTICS | 2024年 / 52卷 / 01期
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Bergsma-Dassios-Yanagimoto's t*; Blum-Kiefer-Rosenblatt's R; degenerate U- statistics; Hoeffding's D; SIGN COVARIANCE; DISTANCE; ASSOCIATION; LIMIT; DEPENDENCE; FRAMEWORK; METRICS;
D O I
10.1214/23-AOS2339
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
To test independence between two high-dimensional random vectors, we propose three tests based on the rank-based indices derived from Hoeffding's D, Blum-Kiefer-Rosenblatt's R and Bergsma-Dassios-Yanagimoto's tau(& lowast;). Under the null hypothesis of independence, we show that the distributions of the proposed test statistics converge to normal ones if the dimensions diverge arbitrarily with the sample size. We further derive an explicit rate of convergence. Thanks to the monotone transformation-invariant property, these distribution-free tests can be readily used to generally distributed random vectors including heavily-tailed ones. We further study the local power of the proposed tests and compare their relative efficiencies with two classic distance covariance/correlation based tests in high-dimensional settings. We establish explicit relationships between D, R, tau(& lowast;) and Pearson's correlation for bivariate normal random variables. The relationships serve as a basis for power comparison. Our theoretical results show that under a Gaussian equicorrelation alternative: (i) the proposed tests are superior to the two classic distance covariance/correlation based tests if the components of random vectors have very different scales; (ii) the asymptotic efficiency of the proposed tests based on D, tau(& lowast;) and R are sorted in a descending order.
引用
收藏
页码:184 / 206
页数:23
相关论文
共 50 条
  • [1] Rank-based max-sum tests for mutual independence of high-dimensional random vectors
    Wang, Hongfei
    Liu, Binghui
    Feng, Long
    Ma, Yanyuan
    [J]. JOURNAL OF ECONOMETRICS, 2024, 238 (01)
  • [2] A rank-based adaptive independence test for high-dimensional data
    Shi, Xiangyu
    Cao, Ruiyuan
    Du, Jiang
    Miao, Zhuqing
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [3] A rank-based high-dimensional test for equality of mean vectors
    Ouyang, Yanyan
    Liu, Jiamin
    Tong, Tiejun
    Xu, Wangli
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 173
  • [4] High-dimensional rank-based inference
    Kong, Xiaoli
    Harrar, Solomon W.
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2020, 32 (02) : 294 - 322
  • [5] A Simple Method for Testing Independence of High-Dimensional Random Vectors
    Jakimauskas, Gintautas
    Radavicius, Marijus
    Susinskas, Jurgis
    [J]. AUSTRIAN JOURNAL OF STATISTICS, 2008, 37 (01) : 101 - 108
  • [6] HIGH-DIMENSIONAL CONSISTENT INDEPENDENCE TESTING WITH MAXIMA OF RANK CORRELATIONS
    Drton, Mathias
    Han, Fang
    Shi, Hongjian
    [J]. ANNALS OF STATISTICS, 2020, 48 (06): : 3206 - 3227
  • [7] Rank-based score tests for high-dimensional regression coefficients
    Feng, Long
    Zou, Changliang
    Wang, Zhaojun
    Chen, Bin
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 : 2131 - 2149
  • [8] Testing Independence Among a Large Number of High-Dimensional Random Vectors
    Pan, Guangming
    Gao, Jiti
    Yang, Yanrong
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (506) : 600 - 612
  • [9] Rank-based classifiers for extremely high-dimensional gene expression data
    Ludwig Lausser
    Florian Schmid
    Lyn-Rouven Schirra
    Adalbert F. X. Wilhelm
    Hans A. Kestler
    [J]. Advances in Data Analysis and Classification, 2018, 12 : 917 - 936
  • [10] Rank-based classifiers for extremely high-dimensional gene expression data
    Lausser, Ludwig
    Schmid, Florian
    Schirra, Lyn-Rouven
    Wilhelm, Adalbert F. X.
    Kestler, Hans A.
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (04) : 917 - 936