Gini Correlation for Feature Screening

被引:0
|
作者
Jun-ying ZHANG [1 ]
Xiao-feng LIU [2 ]
Ri-quan ZHANG [3 ,4 ]
Hang-WANG [1 ]
机构
[1] Department of Mathematics, Taiyuan University of Technology  2. College of Data Science, Taiyuan University of Technology
[2] School of Finance and Statistics, East China Normal University
[3] Department of Mathematics, Shanxi Datong University
基金
中国国家自然科学基金; 国家教育部博士点专项基金资助;
关键词
ultrahigh dimension; Gini correlation coefficient; variable screening; feature ranking;
D O I
暂无
中图分类号
O212 [数理统计];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we propose the Gini correlation screening(GCS) method to select the important variables with ultrahigh dimensional data. The new procedure is based on the Gini correlation coefficient via the covariance between the response and the rank of the predictor variables rather than the Pearson correlation and the Kendall τ correlation coefficient. The new method does not require imposing a specific model structure on regression functions and only needs the condition which the predictors and response have continuous distribution function. We demonstrate that, with the number of predictors growing at an exponential rate of the sample size, the proposed procedure possesses consistency in ranking, which is both useful in its own right and can lead to consistency in selection. The procedure is computationally efficient and simple, and exhibits a competent empirical performance in our intensive simulations and real data analysis.
引用
收藏
页码:590 / 601
页数:12
相关论文
共 50 条
  • [1] Gini Correlation for Feature Screening
    Zhang, Jun-ying
    Liu, Xiao-feng
    Zhang, Ri-quan
    Wang, Hang
    [J]. ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2021, 37 (03): : 590 - 601
  • [2] Gini Correlation for Feature Screening
    Jun-ying Zhang
    Xiao-feng Liu
    Ri-quan Zhang
    Hang Wang
    [J]. Acta Mathematicae Applicatae Sinica, English Series, 2021, 37 : 590 - 601
  • [3] Grouped feature screening for ultrahigh-dimensional classification via Gini distance correlation
    Sang, Yongli
    Dang, Xin
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2024, 204
  • [4] Stable correlation and robust feature screening
    Guo, Xu
    Li, Runze
    Liu, Wanjun
    Zhu, Lixing
    [J]. SCIENCE CHINA-MATHEMATICS, 2022, 65 (01) : 153 - 168
  • [5] Stable correlation and robust feature screening
    Xu Guo
    Runze Li
    Wanjun Liu
    Lixing Zhu
    [J]. Science China Mathematics, 2022, 65 (01) : 153 - 168
  • [6] Stable correlation and robust feature screening
    Xu Guo
    Runze Li
    Wanjun Liu
    Lixing Zhu
    [J]. Science China Mathematics, 2022, 65 : 153 - 168
  • [7] On the proper bounds of the Gini correlation
    Schechtman, E
    Yitzhaki, S
    [J]. ECONOMICS LETTERS, 1999, 63 (02) : 133 - 138
  • [8] Symmetric Gini covariance and correlation
    Sang, Yongli
    Dang, Xin
    Sang, Hailin
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2016, 44 (03): : 323 - 342
  • [9] Group feature screening based on Gini impurity for ultrahigh-dimensional multi-classification
    Wang, Zhongzheng
    Deng, Guangming
    Xu, Haiyun
    [J]. AIMS MATHEMATICS, 2023, 8 (02): : 4342 - 4362
  • [10] Feature Screening via Distance Correlation Learning
    Li, Runze
    Zhong, Wei
    Zhu, Liping
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (499) : 1129 - 1139