Model-free feature screening for ultrahigh-dimensional data conditional on some variables

被引:10
|
作者
Liu, Yi [1 ,2 ]
Wang, Qihua [1 ,3 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
[2] China Univ Petr, Coll Sci, Qingdao 266580, Peoples R China
[3] Shenzhen Univ, Inst Stat Sci, Shenzhen 518006, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional distance correlation; Feature selection; Sure screening property; High-dimensional data; VARYING COEFFICIENT MODELS; DISTANCE CORRELATION; FEATURE-SELECTION;
D O I
10.1007/s10463-016-0597-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, the conditional distance correlation (CDC) is used as a measure of correlation to develop a conditional feature screening procedure given some significant variables for ultrahigh-dimensional data. The proposed procedure is model free and is called conditional distance correlation-sure independence screening (CDC-SIS for short). That is, we do not specify any model structure between the response and the predictors, which is appealing in some practical problems of ultrahigh-dimensional data analysis. The sure screening property of the CDC-SIS is proved and a simulation study was conducted to evaluate the finite sample performances. Real data analysis is used to illustrate the proposed method. The results indicate that CDC-SIS performs well.
引用
收藏
页码:283 / 301
页数:19
相关论文
共 50 条
  • [31] Model-free conditional feature screening for ultra-high dimensional right censored data
    Chen, Xiaolin
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (12) : 2425 - 2446
  • [32] The Sparse MLE for Ultrahigh-Dimensional Feature Screening
    Xu, Chen
    Chen, Jiahua
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1257 - 1269
  • [33] Interaction Screening for Ultrahigh-Dimensional Data
    Hao, Ning
    Zhang, Hao Helen
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1285 - 1301
  • [34] Feature screening in ultrahigh-dimensional varying-coefficient Cox model
    Yang, Guangren
    Zhang, Ling
    Li, Runze
    Huang, Yuan
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 284 - 297
  • [35] The cumulative Kolmogorov filter for model-free screening in ultrahigh dimensional data
    Kim, Arlene Kyoung Hee
    Shin, Seung Jun
    [J]. STATISTICS & PROBABILITY LETTERS, 2017, 126 : 238 - 243
  • [36] Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
    Zhang, Jing
    Zhou, Haibo
    Liu, Yanyan
    Cai, Jianwen
    [J]. LIFETIME DATA ANALYSIS, 2021, 27 (04) : 632 - 661
  • [37] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Liu, Yi
    [J]. ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2019, 35 (04): : 845 - 861
  • [38] Model-Free Conditional Feature Screening with FDR Control
    Tong, Zhaoxue
    Cai, Zhanrui
    Yang, Songshan
    Li, Runze
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (544) : 2575 - 2587
  • [39] A simple model-free survival conditional feature screening
    Chen, Xiaolin
    Zhang, Yahui
    Chen, Xiaojing
    Liu, Yi
    [J]. STATISTICS & PROBABILITY LETTERS, 2019, 146 : 156 - 160
  • [40] Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
    Jing Zhang
    Haibo Zhou
    Yanyan Liu
    Jianwen Cai
    [J]. Lifetime Data Analysis, 2021, 27 : 632 - 661