Model-free feature screening via a modified composite quantile correlation

被引:4
|
作者
Xu, Kai [1 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
Modified composite quantile correlation; Feature screening; Sure screening property; Rank consistency property; ELASTIC-NET; SELECTION; REGRESSION; LASSO;
D O I
10.1016/j.jspi.2017.03.006
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we introduce a modified composite quantile correlation (MCQC for short) to rank the relative importance of each predictor in ultrahigh dimensional regressions. We advocate using the MCQC for three reasons. First, our metric is a natural extension of quantile correlation (QC) and composite quantile correlation (CQC) considered by Li et al. (2015) and Ma and Zhang (2016), respectively. Second, the MCQC uses local information flows of model variables and is nonnegative and equals zero if and only if two random variables are independent. This indicates that the MCQC can detect nonlinear effects including interactions and heterogeneity. Third, the MCQC is conceptually simple, easy to implement and robust to the presence of extreme values and outliers in the observations. We also show that, under mild conditions, the MCQC-based procedure has the desirable sure screening property, which guarantees that all important predictors can be retained after screening with probability approaching one, and rank consistency property. Simulation results demonstrate that in comparison with the existing counterparts, the MCQC-based screening procedure has an excellent capability of detecting nonlinear dependence relationships especially when the variables are highly correlated. We also illustrate the MCQC-based screening procedure through an empirical example. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:22 / 35
页数:14
相关论文
共 50 条
  • [41] Model-free feature screening for ultrahigh dimensional data via a Pearson chi-square based index
    Ma, Weidong
    Xiao, Jingsong
    Yang, Ying
    Ye, Fei
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2022, 92 (15) : 3222 - 3248
  • [42] Variable Screening via Quantile Partial Correlation
    Ma, Shujie
    Li, Runze
    Tsai, Chih-Ling
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 650 - 663
  • [43] QUANTILE-ADAPTIVE MODEL-FREE VARIABLE SCREENING FOR HIGH-DIMENSIONAL HETEROGENEOUS DATA
    He, Xuming
    Wang, Lan
    Hong, Hyokyoung Grace
    [J]. ANNALS OF STATISTICS, 2013, 41 (01): : 342 - 369
  • [44] A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data
    Xue, Jingnan
    Liang, Faming
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (04) : 803 - 813
  • [46] A model-free conditional screening approach via sufficient dimension reduction
    Huo, Lei
    Wen, Xuerong Meggie
    Yu, Zhou
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2020, 32 (04) : 970 - 988
  • [47] On the predictability of model-free implied correlation
    Markopoulou, Chryssa
    Skintzi, Vasiliki
    Refenes, Apostolos
    [J]. INTERNATIONAL JOURNAL OF FORECASTING, 2016, 32 (02) : 527 - 547
  • [48] Model-free data screening and cleaning
    Tarter, Michael E.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2011, 3 (02): : 168 - 176
  • [49] Model-free feature screening for ultrahigh-dimensional data conditional on some variables
    Yi Liu
    Qihua Wang
    [J]. Annals of the Institute of Statistical Mathematics, 2018, 70 : 283 - 301
  • [50] Model-free feature screening for ultra-high dimensional competing risks data
    Chen, Xiaolin
    Zhang, Yahui
    Liu, Yi
    Chen, Xiaojing
    [J]. STATISTICS & PROBABILITY LETTERS, 2020, 164