Entropy-based model-free feature screening for ultrahigh-dimensional multiclass classification

被引:24
|
作者
Ni, Lyu [1 ]
Fang, Fang [1 ]
机构
[1] East China Normal Univ, Sch Stat, Shanghai 200241, Peoples R China
关键词
entropy; feature screening; information gain; multiclass classification; sure screening property; VARYING COEFFICIENT MODELS; KOLMOGOROV FILTER;
D O I
10.1080/10485252.2016.1167206
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Most feature screening methods for ultrahigh-dimensional classification explicitly or implicitly assume the covariates are continuous. However, in the practice, it is quite common that both categorical and continuous covariates appear in the data, and applicable feature screening method is very limited. To handle this non-trivial situation, we propose an entropy-based feature screening method, which is model free and provides a unified screening procedure for both categorical and continuous covariates. We establish the sure screening and ranking consistency properties of the proposed procedure. We investigate the finite sample performance of the proposed procedure by simulation studies and illustrate the method by a real data analysis.
引用
收藏
页码:515 / 530
页数:16
相关论文
共 50 条
  • [1] Model-Free Feature Screening for Ultrahigh-Dimensional Data
    Zhu, Li-Ping
    Li, Lexin
    Li, Runze
    Zhu, Li-Xing
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (496) : 1464 - 1475
  • [2] A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data
    Xue, Jingnan
    Liang, Faming
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (04) : 803 - 813
  • [3] Model-free feature screening for ultrahigh dimensional classification
    Sheng, Ying
    Wang, Qihua
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 178
  • [4] Model-free feature screening for ultrahigh-dimensional data conditional on some variables
    Yi Liu
    Qihua Wang
    [J]. Annals of the Institute of Statistical Mathematics, 2018, 70 : 283 - 301
  • [5] Model-free feature screening for ultrahigh-dimensional data conditional on some variables
    Liu, Yi
    Wang, Qihua
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2018, 70 (02) : 283 - 301
  • [6] Model-free slice screening for ultrahigh-dimensional survival data
    Zhang, Jing
    Liu, Yanyan
    [J]. JOURNAL OF APPLIED STATISTICS, 2021, 48 (10) : 1755 - 1774
  • [7] On Exact Feature Screening in Ultrahigh-Dimensional Binary Classification
    Roy, Sarbojit
    Sarkar, Soham
    Dutta, Subhajit
    Ghosh, Anil K.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (02) : 448 - 462
  • [8] Survival Impact Index and Ultrahigh-Dimensional Model-Free Screening with Survival Outcomes
    Li, Jialiang
    Zheng, Qi
    Peng, Limin
    Huang, Zhipeng
    [J]. BIOMETRICS, 2016, 72 (04) : 1145 - 1154
  • [9] Model-free feature screening for ultrahigh dimensional censored regression
    Tingyou Zhou
    Liping Zhu
    [J]. Statistics and Computing, 2017, 27 : 947 - 961
  • [10] A NEW MODEL-FREE FEATURE SCREENING PROCEDURE FOR ULTRAHIGH-DIMENSIONAL INTERVAL-CENSORED FAILURE TIME DATA
    Zhang, Jing
    Du, Mingyue
    Liu, Yanyan
    Sun, Jianguo
    [J]. STATISTICA SINICA, 2023, 33 (03) : 1809 - 1830