Model-free feature screening via a modified composite quantile correlation

被引:4
|
作者
Xu, Kai [1 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
Modified composite quantile correlation; Feature screening; Sure screening property; Rank consistency property; ELASTIC-NET; SELECTION; REGRESSION; LASSO;
D O I
10.1016/j.jspi.2017.03.006
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we introduce a modified composite quantile correlation (MCQC for short) to rank the relative importance of each predictor in ultrahigh dimensional regressions. We advocate using the MCQC for three reasons. First, our metric is a natural extension of quantile correlation (QC) and composite quantile correlation (CQC) considered by Li et al. (2015) and Ma and Zhang (2016), respectively. Second, the MCQC uses local information flows of model variables and is nonnegative and equals zero if and only if two random variables are independent. This indicates that the MCQC can detect nonlinear effects including interactions and heterogeneity. Third, the MCQC is conceptually simple, easy to implement and robust to the presence of extreme values and outliers in the observations. We also show that, under mild conditions, the MCQC-based procedure has the desirable sure screening property, which guarantees that all important predictors can be retained after screening with probability approaching one, and rank consistency property. Simulation results demonstrate that in comparison with the existing counterparts, the MCQC-based screening procedure has an excellent capability of detecting nonlinear dependence relationships especially when the variables are highly correlated. We also illustrate the MCQC-based screening procedure through an empirical example. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:22 / 35
页数:14
相关论文
共 50 条
  • [1] Robust model-free feature screening via quantile correlation
    Ma, Xuejun
    Zhang, Jingxiao
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 143 : 472 - 480
  • [2] Model-free feature screening via distance correlation for ultrahigh dimensional survival data
    Jing Zhang
    Yanyan Liu
    Hengjian Cui
    [J]. Statistical Papers, 2021, 62 : 2711 - 2738
  • [3] Model-free feature screening via distance correlation for ultrahigh dimensional survival data
    Zhang, Jing
    Liu, Yanyan
    Cui, Hengjian
    [J]. STATISTICAL PAPERS, 2021, 62 (06) : 2711 - 2738
  • [4] Model-free sure screening via maximum correlation
    Huang, Qiming
    Zhu, Yu
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 148 : 89 - 106
  • [5] Distribution-free and model-free multivariate feature screening via multivariate rank distance correlation
    Zhao, Shaofei
    Fu, Guifang
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 192
  • [6] Model-free conditional screening via conditional distance correlation
    Jun Lu
    Lu Lin
    [J]. Statistical Papers, 2020, 61 : 225 - 244
  • [7] MODEL-FREE FEATURE SCREENING FOR ULTRAHIGH DIMENSIONAL DATATHROUGH A MODIFIED BLUM-KIEFER-ROSENBLATT CORRELATION
    Zhou, Yeqing
    Zhu, Liping
    [J]. STATISTICA SINICA, 2018, 28 (03) : 1351 - 1370
  • [8] Model-free conditional screening via conditional distance correlation
    Lu, Jun
    Lin, Lu
    [J]. STATISTICAL PAPERS, 2020, 61 (01) : 225 - 244
  • [9] A note on quantile feature screening via distance correlation
    Xiaolin Chen
    Xiaojing Chen
    Yi Liu
    [J]. Statistical Papers, 2019, 60 : 1741 - 1762
  • [10] A note on quantile feature screening via distance correlation
    Chen, Xiaolin
    Chen, Xiaojing
    Liu, Yi
    [J]. STATISTICAL PAPERS, 2019, 60 (05) : 1741 - 1762