Quantile-Composited Feature Screening for Ultrahigh-Dimensional Data

被引:0
|
作者
Chen, Shuaishuai [1 ]
Lu, Jun [2 ]
机构
[1] Shandong Univ, Sch Math, Jinan 250100, Peoples R China
[2] Natl Univ Def & Technol, Sch Sci, Changsha 410000, Peoples R China
基金
中国国家自然科学基金;
关键词
feature screening; discriminative analysis; quantile-composited; CLASSIFICATION;
D O I
10.3390/math11102398
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Ultrahigh-dimensional grouped data are frequently encountered by biostatisticians working on multi-class categorical problems. To rapidly screen out the null predictors, this paper proposes a quantile-composited feature screening procedure. The new method first transforms the continuous predictor to a Bernoulli variable, by thresholding the predictor at a certain quantile. Consequently, the independence between the response and each predictor is easy to judge, by employing the Pearson chi-square statistic. The newly proposed method has the following salient features: (1) it is robust against high-dimensional heterogeneous data; (2) it is model-free, without specifying any regression structure between the covariate and outcome variable; (3) it enjoys a low computational cost, with the computational complexity controlled at the sample size level. Under some mild conditions, the new method was shown to achieve the sure screening property without imposing any moment condition on the predictors. Numerical studies and real data analyses further confirmed the effectiveness of the new screening procedure.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Feature screening in ultrahigh-dimensional varying-coefficient Cox model
    Yang, Guangren
    Zhang, Ling
    Li, Runze
    Huang, Yuan
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 284 - 297
  • [42] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Yi Liu
    Acta Mathematicae Applicatae Sinica, English Series, 2019, 35 : 845 - 861
  • [43] Unified mean-variance feature screening for ultrahigh-dimensional regression
    Liming Wang
    Xingxiang Li
    Xiaoqing Wang
    Peng Lai
    Computational Statistics, 2022, 37 : 1887 - 1918
  • [44] FEATURE SCREENING IN ULTRAHIGH-DIMENSIONAL GENERALIZED VARYING-COEFFICIENT MODELS
    Yang, Guangren
    Yang, Songshan
    Li, Runze
    STATISTICA SINICA, 2020, 30 (02) : 1049 - 1067
  • [45] Covariate Information Number for Feature Screening in Ultrahigh-Dimensional Supervised Problems
    Nandy, Debmalya
    Chiaromonte, Francesca
    Li, Runze
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1516 - 1529
  • [46] Unified mean-variance feature screening for ultrahigh-dimensional regression
    Wang, Liming
    Li, Xingxiang
    Wang, Xiaoqing
    Lai, Peng
    COMPUTATIONAL STATISTICS, 2022, 37 (04) : 1887 - 1918
  • [47] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Yi LIU
    Acta Mathematicae Applicatae Sinica, 2019, 35 (04) : 845 - 861
  • [48] Feature screening based on distance correlation for ultrahigh-dimensional censored data with covariate measurement error
    Chen, Li-Pang
    COMPUTATIONAL STATISTICS, 2021, 36 (02) : 857 - 884
  • [49] Fast robust feature screening for ultrahigh-dimensional varying coefficient models
    Ma, Xuejun
    Chen, Xin
    Zhang, Jingxiao
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2017, 87 (04) : 724 - 732
  • [50] Feature screening based on distance correlation for ultrahigh-dimensional censored data with covariate measurement error
    Li-Pang Chen
    Computational Statistics, 2021, 36 : 857 - 884