Semi-Distance Correlation and Its Applications

被引:0
|
作者
Zhong, Wei [1 ,2 ]
Li, Zhuoxi [2 ]
Guo, Wenwen [3 ]
Cui, Hengjian [3 ]
机构
[1] Xiamen Univ, MOE, WISE, Key Lab Econometr, Xiamen, Peoples R China
[2] Xiamen Univ, Dept Stat & Data Sci, SOE, Xiamen, Peoples R China
[3] Capital Normal Univ, Sch Math Sci, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Groupwise variable screening; High dimensionality; Measures of dependence; Test of independence; SELECTION; MODELS; ASSOCIATION; DENSITY;
D O I
10.1080/01621459.2023.2284988
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a new measure of dependence between a categorical random variable and a random vector with potentially high dimensions, named semi-distance correlation. It is an interesting extension of distance correlation to accommodate the information of the categorical random variable. It equals zero if and only if the categorical random variable and the other random vector are independent. Two important applications of semi-distance correlation are considered. First, we develop a semi-distance independence test between a categorical random variable and a random vector and derive its asymptotic distributions. When the dimension of the random vector tends to infinity, we derive the explicit asymptotic normal distribution of the test statistic under the null hypothesis, which allows us to compute p-values in an efficient and fast way for high dimensional data. Second, we propose to use the semi-distance correlation as a marginal utility between the response and a group of covariates to do groupwise variable screening for ultrahigh dimensional classification problems. The sure screening property has also been established. Monte Carlo simulations and a real data application are presented to demonstrate the excellent finite sample property of the proposed procedures. A new R package semidist is also developed to implement the proposed methods. Supplementary materials for this article are available online.
引用
收藏
页码:2919 / 2933
页数:15
相关论文
共 50 条
  • [31] A semi-classical limit and its applications
    Yu, YL
    GEOMETRY AND TOPOLOGY OF SUBMANIFOLDS X: DIFFERENTIAL GEOMETRY IN HONOR OF PROF S.S. CHERN, 2000, : 315 - 335
  • [32] On semi-R-boundedness and its applications
    Veraar, Mark
    Weis, Lutz
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2010, 363 (02) : 431 - 443
  • [33] The holodisc distance transform and its applications in image analysis
    Pirard, E
    MICROSCOPY MICROANALYSIS MICROSTRUCTURES, 1996, 7 (5-6): : 453 - 460
  • [34] Image ruler and its applications in distance and area measurement
    Wang, Ti-Ho
    Lu, Ming-Chih
    Hsu, Chen-Chien
    Lu, Yin Yu
    WSEAS Transactions on Systems, 2007, 6 (05): : 901 - 907
  • [35] The weak lower semicontinuity of the Kobayashi distance and its applications
    Tadeusz Kuczumow
    Mathematische Zeitschrift, 2001, 236 : 1 - 9
  • [36] Privacy-preserving distance measurement and its applications
    Luo, YL
    Huang, LS
    Chen, GL
    Shen, H
    CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (02): : 237 - 241
  • [37] The Generalized Laplacian Distance and its Applications for Visual Matching
    Elboher, Elhanan
    Werman, Michael
    Hel-Or, Yacov
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2315 - 2322
  • [38] The weak lower semicontinuity of the Kobayashi distance and its applications
    Kuczumow, T
    MATHEMATISCHE ZEITSCHRIFT, 2001, 236 (01) : 1 - 9
  • [39] Secure Hamming Distance Based Computation and Its Applications
    Jarrous, Ayman
    Pinkas, Benny
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY, 2009, 5536 : 107 - 124
  • [40] Torque and its applications in the designs of microprocessor distance relays
    Wang, LC
    Price, E
    1999 IEEE TRANSMISSION AND DISTRIBUTION CONFERENCE, VOLS 1 & 2, 1999, : 433 - 440