Group Feature Screening Based on Information Gain Ratio for Ultrahigh-Dimensional Data

被引:3
|
作者
Wang, Zhongzheng [1 ]
Deng, Guangming [1 ,2 ]
Yu, Jianqi [1 ]
机构
[1] Guilin Univ Technol, Coll Sci, Guilin 541000, Peoples R China
[2] Guilin Univ Technol, Appl Stat Inst, Guilin 541000, Peoples R China
基金
中国国家自然科学基金;
关键词
REGRESSION; SELECTION; LASSO;
D O I
10.1155/2022/1600986
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Most model-free feature screening approaches focus on the -individual predictor; therefore, they are not able to incorporate structured predictors like grouped variables. In this article, we propose a group screening procedure via the information gain ratio for a classification model, which is a direct extension of the original sure independence screening procedure and also model-free. The proposed method yields a better screening performance and classification accuracy. It is demonstrated that the proposed group screening method possesses the sure screening property and ranking consistency properties under certain regularity conditions. Through simulation studies and real-world data analysis, we demonstrate the proposed method with the finite sample performance.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Group feature screening for ultrahigh-dimensional data missing at random
    He, Hanji
    Li, Meini
    Deng, Guangming
    [J]. AIMS MATHEMATICS, 2024, 9 (02): : 4032 - 4056
  • [2] A selective overview of feature screening for ultrahigh-dimensional data
    JingYuan Liu
    Wei Zhong
    RunZe Li
    [J]. Science China Mathematics, 2015, 58 : 1 - 22
  • [3] A selective overview of feature screening for ultrahigh-dimensional data
    Liu JingYuan
    Zhong Wei
    Li RunZe
    [J]. SCIENCE CHINA-MATHEMATICS, 2015, 58 (10) : 2033 - 2054
  • [4] A selective overview of feature screening for ultrahigh-dimensional data
    LIU JingYuan
    ZHONG Wei
    LI RunZe
    [J]. Science China Mathematics, 2015, 58 (10) : 2033 - 2054
  • [5] Model-Free Feature Screening for Ultrahigh-Dimensional Data
    Zhu, Li-Ping
    Li, Lexin
    Li, Runze
    Zhu, Li-Xing
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (496) : 1464 - 1475
  • [6] Nonparametric independence feature screening for ultrahigh-dimensional survival data
    Pan, Jing
    Yu, Yuan
    Zhou, Yong
    [J]. METRIKA, 2018, 81 (07) : 821 - 847
  • [7] Nonparametric independence feature screening for ultrahigh-dimensional survival data
    Jing Pan
    Yuan Yu
    Yong Zhou
    [J]. Metrika, 2018, 81 : 821 - 847
  • [8] Quantile-Composited Feature Screening for Ultrahigh-Dimensional Data
    Chen, Shuaishuai
    Lu, Jun
    [J]. MATHEMATICS, 2023, 11 (10)
  • [9] Nonparametric independence feature screening for ultrahigh-dimensional missing data
    Fang, Jianglin
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (10) : 5670 - 5689
  • [10] Group feature screening based on Gini impurity for ultrahigh-dimensional multi-classification
    Wang, Zhongzheng
    Deng, Guangming
    Xu, Haiyun
    [J]. AIMS MATHEMATICS, 2023, 8 (02): : 4342 - 4362