Quantile-Composited Feature Screening for Ultrahigh-Dimensional Data

被引:0
|
作者
Chen, Shuaishuai [1 ]
Lu, Jun [2 ]
机构
[1] Shandong Univ, Sch Math, Jinan 250100, Peoples R China
[2] Natl Univ Def & Technol, Sch Sci, Changsha 410000, Peoples R China
基金
中国国家自然科学基金;
关键词
feature screening; discriminative analysis; quantile-composited; CLASSIFICATION;
D O I
10.3390/math11102398
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Ultrahigh-dimensional grouped data are frequently encountered by biostatisticians working on multi-class categorical problems. To rapidly screen out the null predictors, this paper proposes a quantile-composited feature screening procedure. The new method first transforms the continuous predictor to a Bernoulli variable, by thresholding the predictor at a certain quantile. Consequently, the independence between the response and each predictor is easy to judge, by employing the Pearson chi-square statistic. The newly proposed method has the following salient features: (1) it is robust against high-dimensional heterogeneous data; (2) it is model-free, without specifying any regression structure between the covariate and outcome variable; (3) it enjoys a low computational cost, with the computational complexity controlled at the sample size level. Under some mild conditions, the new method was shown to achieve the sure screening property without imposing any moment condition on the predictors. Numerical studies and real data analyses further confirmed the effectiveness of the new screening procedure.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Conditional quantile screening in ultrahigh-dimensional heterogeneous data
    Wu, Yuanshan
    Yin, Guosheng
    [J]. BIOMETRIKA, 2015, 102 (01) : 65 - 76
  • [2] A selective overview of feature screening for ultrahigh-dimensional data
    JingYuan Liu
    Wei Zhong
    RunZe Li
    [J]. Science China Mathematics, 2015, 58 : 1 - 22
  • [3] A selective overview of feature screening for ultrahigh-dimensional data
    Liu JingYuan
    Zhong Wei
    Li RunZe
    [J]. SCIENCE CHINA-MATHEMATICS, 2015, 58 (10) : 2033 - 2054
  • [4] A selective overview of feature screening for ultrahigh-dimensional data
    LIU JingYuan
    ZHONG Wei
    LI RunZe
    [J]. Science China Mathematics, 2015, 58 (10) : 2033 - 2054
  • [5] Model-Free Feature Screening for Ultrahigh-Dimensional Data
    Zhu, Li-Ping
    Li, Lexin
    Li, Runze
    Zhu, Li-Xing
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (496) : 1464 - 1475
  • [6] Nonparametric independence feature screening for ultrahigh-dimensional survival data
    Pan, Jing
    Yu, Yuan
    Zhou, Yong
    [J]. METRIKA, 2018, 81 (07) : 821 - 847
  • [7] Nonparametric independence feature screening for ultrahigh-dimensional survival data
    Jing Pan
    Yuan Yu
    Yong Zhou
    [J]. Metrika, 2018, 81 : 821 - 847
  • [8] Group feature screening for ultrahigh-dimensional data missing at random
    He, Hanji
    Li, Meini
    Deng, Guangming
    [J]. AIMS MATHEMATICS, 2024, 9 (02): : 4032 - 4056
  • [9] Nonparametric independence feature screening for ultrahigh-dimensional missing data
    Fang, Jianglin
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (10) : 5670 - 5689
  • [10] The Sparse MLE for Ultrahigh-Dimensional Feature Screening
    Xu, Chen
    Chen, Jiahua
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1257 - 1269