Model-free feature screening for ultrahigh-dimensional data conditional on some variables

被引:0
|
作者
Yi Liu
Qihua Wang
机构
[1] Chinese Academy of Sciences,Academy of Mathematics and Systems Science
[2] China University of Petroleum,College of Science
[3] Shenzhen University,Institute of Statistical Science
关键词
Conditional distance correlation; Feature selection; Sure screening property; High-dimensional data;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, the conditional distance correlation (CDC) is used as a measure of correlation to develop a conditional feature screening procedure given some significant variables for ultrahigh-dimensional data. The proposed procedure is model free and is called conditional distance correlation-sure independence screening (CDC-SIS for short). That is, we do not specify any model structure between the response and the predictors, which is appealing in some practical problems of ultrahigh-dimensional data analysis. The sure screening property of the CDC-SIS is proved and a simulation study was conducted to evaluate the finite sample performances. Real data analysis is used to illustrate the proposed method. The results indicate that CDC-SIS performs well.
引用
收藏
页码:283 / 301
页数:18
相关论文
共 50 条
  • [41] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Yi LIU
    [J]. Acta Mathematicae Applicatae Sinica, 2019, 35 (04) : 845 - 861
  • [42] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Yi Liu
    [J]. Acta Mathematicae Applicatae Sinica, English Series, 2019, 35 : 845 - 861
  • [43] Group Feature Screening Based on Information Gain Ratio for Ultrahigh-Dimensional Data
    Wang, Zhongzheng
    Deng, Guangming
    Yu, Jianqi
    [J]. JOURNAL OF MATHEMATICS, 2022, 2022
  • [44] On Exact Feature Screening in Ultrahigh-Dimensional Binary Classification
    Roy, Sarbojit
    Sarkar, Soham
    Dutta, Subhajit
    Ghosh, Anil K.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (02) : 448 - 462
  • [45] Feature screening for ultrahigh-dimensional additive logistic models
    Wang, Lei
    Ma, Xuejun
    Zhang, Jingxiao
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2020, 205 : 306 - 317
  • [46] Independent feature screening for ultrahigh-dimensional models with interactions
    Yunquan Song
    Xuehu Zhu
    Lu Lin
    [J]. Journal of the Korean Statistical Society, 2014, 43 : 567 - 583
  • [47] Independent feature screening for ultrahigh-dimensional models with interactions
    Song, Yunquan
    Zhu, Xuehu
    Lin, Lu
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2014, 43 (04) : 567 - 583
  • [48] Conditional distance correlation screening for sparse ultrahigh-dimensional models
    Song, Fengli
    Chen, Yurong
    Lai, Peng
    [J]. APPLIED MATHEMATICAL MODELLING, 2020, 81 : 232 - 252
  • [49] Model-free feature screening for ultrahigh dimensional data via a Pearson chi-square based index
    Ma, Weidong
    Xiao, Jingsong
    Yang, Ying
    Ye, Fei
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2022, 92 (15) : 3222 - 3248
  • [50] Model-free feature screening for high-dimensional survival data
    Lin, Yuanyuan
    Liu, Xianhui
    Hao, Meiling
    [J]. SCIENCE CHINA-MATHEMATICS, 2018, 61 (09) : 1617 - 1636