Semi-Distance Correlation and Its Applications

被引:0
|
作者
Zhong, Wei [1 ,2 ]
Li, Zhuoxi [2 ]
Guo, Wenwen [3 ]
Cui, Hengjian [3 ]
机构
[1] Xiamen Univ, MOE, WISE, Key Lab Econometr, Xiamen, Peoples R China
[2] Xiamen Univ, Dept Stat & Data Sci, SOE, Xiamen, Peoples R China
[3] Capital Normal Univ, Sch Math Sci, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Groupwise variable screening; High dimensionality; Measures of dependence; Test of independence; SELECTION; MODELS; ASSOCIATION; DENSITY;
D O I
10.1080/01621459.2023.2284988
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a new measure of dependence between a categorical random variable and a random vector with potentially high dimensions, named semi-distance correlation. It is an interesting extension of distance correlation to accommodate the information of the categorical random variable. It equals zero if and only if the categorical random variable and the other random vector are independent. Two important applications of semi-distance correlation are considered. First, we develop a semi-distance independence test between a categorical random variable and a random vector and derive its asymptotic distributions. When the dimension of the random vector tends to infinity, we derive the explicit asymptotic normal distribution of the test statistic under the null hypothesis, which allows us to compute p-values in an efficient and fast way for high dimensional data. Second, we propose to use the semi-distance correlation as a marginal utility between the response and a group of covariates to do groupwise variable screening for ultrahigh dimensional classification problems. The sure screening property has also been established. Monte Carlo simulations and a real data application are presented to demonstrate the excellent finite sample property of the proposed procedures. A new R package semidist is also developed to implement the proposed methods. Supplementary materials for this article are available online.
引用
收藏
页码:2919 / 2933
页数:15
相关论文
共 50 条
  • [21] Correlation Wavelet and its Applications
    徐长发
    蔡超
    皮明红
    朱春喜
    李国宽
    数学季刊, 1999, (01) : 5 - 9
  • [22] On the Average Taxicab Distance Function and Its Applications
    Csaba Vincze
    Ábris Nagy
    Acta Applicandae Mathematicae, 2019, 161 : 201 - 220
  • [23] KNOWLEDGE MANAGEMENT & ITS APPLICATIONS IN DISTANCE EDUCATION
    Saxena, Anurag
    TURKISH ONLINE JOURNAL OF DISTANCE EDUCATION, 2007, 8 (04): : 96 - 101
  • [24] Distance measure of uncertain sets and its applications
    Wang, Xiao
    Ning, Yufu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (03) : 1933 - 1945
  • [25] On the Average Taxicab Distance Function and Its Applications
    Vincze, Csaba
    Nagy, Abris
    ACTA APPLICANDAE MATHEMATICAE, 2019, 161 (01) : 201 - 220
  • [26] Fractional triple correlation and its applications
    Department of Physical Electronics, Faculty of Engineering, Tel Aviv University, 69978 Tel Aviv, Israel
    不详
    不详
    J Opt Soc Am A, 6 (1658-1661):
  • [27] Fractional triple correlation and its applications
    Mendlovic, David
    Mas, David
    Lohmann, Adolf W.
    Zalevsky, Zeev
    Shabtay, Gal
    Journal of the Optical Society of America A: Optics and Image Science, and Vision, 1998, 15 (06):
  • [28] On Kuneth's correlation and its applications
    Mdzinarishvili, Leonard
    GEORGIAN MATHEMATICAL JOURNAL, 2019, 26 (02) : 295 - 301
  • [29] Fractional triple correlation and its applications
    Mendlovic, D
    Mas, D
    Lohmann, AW
    Zalevsky, Z
    Shabtay, G
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1998, 15 (06): : 1658 - 1661
  • [30] The Semi-Hyperbolic Distribution and Its Applications
    Ivanov, Roman V.
    STATS, 2023, 6 (04): : 1126 - 1146