A nonparametric feature screening method for ultrahigh-dimensional missing response

被引:9
|
作者
Li, Xiaoxia [1 ,2 ]
Tang, Niansheng [1 ,2 ]
Xie, Jinhan [1 ,2 ]
Yan, Xiaodong [1 ,2 ]
机构
[1] Yunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650500, Yunnan, Peoples R China
[2] Shandong Univ, Sch Econ, Jinan 250100, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature screening; Imputation; Marginal Spearman rank correlation; Missing at random; Ultrahigh-dimensional data; VARIABLE SELECTION; KOLMOGOROV FILTER; MODEL SELECTION; LINEAR-MODELS; LIKELIHOOD; SURVIVAL;
D O I
10.1016/j.csda.2019.106828
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses the feature screening issue for ultrahigh-dimensional data with responses missing at random. A novel nonparametric feature screening procedure is developed to identify the important features via the conditionally imputing marginal Spearman rank correlation. The proposed nonparametric screening approach has several desirable merits. First, it is nonparametric without assuming any regression form of predictors on response variable. Second, it is robust to outliers and heavy-tailed data. Third, under some regularity conditions, it is shown that the proposed feature screening procedure has the sure screening and ranking consistency properties. Simulation studies evidence that the proposed screening procedure outperforms several existing model-free screening procedures. An example taken from the microarray diffuse large-B-cell lymphoma study is used to illustrate the proposed methodologies. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Nonparametric independence feature screening for ultrahigh-dimensional missing data
    Fang, Jianglin
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (10) : 5670 - 5689
  • [2] Nonparametric independence feature screening for ultrahigh-dimensional survival data
    Pan, Jing
    Yu, Yuan
    Zhou, Yong
    [J]. METRIKA, 2018, 81 (07) : 821 - 847
  • [3] Nonparametric independence feature screening for ultrahigh-dimensional survival data
    Jing Pan
    Yuan Yu
    Yong Zhou
    [J]. Metrika, 2018, 81 : 821 - 847
  • [4] Feature Screening for Nonparametric and Semiparametric Models with Ultrahigh-Dimensional Covariates
    Zhang Junying
    Zhang Riquan
    Zhang Jiajia
    [J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2018, 31 (05) : 1350 - 1361
  • [5] Feature Screening for Nonparametric and Semiparametric Models with Ultrahigh-Dimensional Covariates
    Junying Zhang
    Riquan Zhang
    Jiajia Zhang
    [J]. Journal of Systems Science and Complexity, 2018, 31 : 1350 - 1361
  • [6] Feature Screening for Nonparametric and Semiparametric Models with Ultrahigh-Dimensional Covariates
    ZHANG Junying
    ZHANG Riquan
    ZHANG Jiajia
    [J]. Journal of Systems Science & Complexity, 2018, 31 (05) : 1350 - 1361
  • [7] Group feature screening for ultrahigh-dimensional data missing at random
    He, Hanji
    Li, Meini
    Deng, Guangming
    [J]. AIMS MATHEMATICS, 2024, 9 (02): : 4032 - 4056
  • [8] A new nonparametric screening method for ultrahigh-dimensional survival data
    Liu, Yanyan
    Zhang, Jing
    Zhao, Xingqiu
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 119 : 74 - 85
  • [9] Feature screening in ultrahigh-dimensional partially linear models with missing responses at random
    Tang, Niansheng
    Xia, Linli
    Yan, Xiaodong
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 133 : 208 - 227
  • [10] The Sparse MLE for Ultrahigh-Dimensional Feature Screening
    Xu, Chen
    Chen, Jiahua
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1257 - 1269