A nonparametric feature screening method for ultrahigh-dimensional missing response

被引:9
|
作者
Li, Xiaoxia [1 ,2 ]
Tang, Niansheng [1 ,2 ]
Xie, Jinhan [1 ,2 ]
Yan, Xiaodong [1 ,2 ]
机构
[1] Yunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650500, Yunnan, Peoples R China
[2] Shandong Univ, Sch Econ, Jinan 250100, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature screening; Imputation; Marginal Spearman rank correlation; Missing at random; Ultrahigh-dimensional data; VARIABLE SELECTION; KOLMOGOROV FILTER; MODEL SELECTION; LINEAR-MODELS; LIKELIHOOD; SURVIVAL;
D O I
10.1016/j.csda.2019.106828
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses the feature screening issue for ultrahigh-dimensional data with responses missing at random. A novel nonparametric feature screening procedure is developed to identify the important features via the conditionally imputing marginal Spearman rank correlation. The proposed nonparametric screening approach has several desirable merits. First, it is nonparametric without assuming any regression form of predictors on response variable. Second, it is robust to outliers and heavy-tailed data. Third, under some regularity conditions, it is shown that the proposed feature screening procedure has the sure screening and ranking consistency properties. Simulation studies evidence that the proposed screening procedure outperforms several existing model-free screening procedures. An example taken from the microarray diffuse large-B-cell lymphoma study is used to illustrate the proposed methodologies. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Covariate Information Number for Feature Screening in Ultrahigh-Dimensional Supervised Problems
    Nandy, Debmalya
    Chiaromonte, Francesca
    Li, Runze
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1516 - 1529
  • [32] Unified mean-variance feature screening for ultrahigh-dimensional regression
    Wang, Liming
    Li, Xingxiang
    Wang, Xiaoqing
    Lai, Peng
    COMPUTATIONAL STATISTICS, 2022, 37 (04) : 1887 - 1918
  • [33] Fast robust feature screening for ultrahigh-dimensional varying coefficient models
    Ma, Xuejun
    Chen, Xin
    Zhang, Jingxiao
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2017, 87 (04) : 724 - 732
  • [34] Interaction Screening for Ultrahigh-Dimensional Data
    Hao, Ning
    Zhang, Hao Helen
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1285 - 1301
  • [35] Group Feature Screening Based on Information Gain Ratio for Ultrahigh-Dimensional Data
    Wang, Zhongzheng
    Deng, Guangming
    Yu, Jianqi
    JOURNAL OF MATHEMATICS, 2022, 2022
  • [36] Grouped feature screening for ultrahigh-dimensional classification via Gini distance correlation
    Sang, Yongli
    Dang, Xin
    JOURNAL OF MULTIVARIATE ANALYSIS, 2024, 204
  • [37] An efficient algorithm for joint feature screening in ultrahigh-dimensional Cox’s model
    Xiaolin Chen
    Catherine Chunling Liu
    Sheng Xu
    Computational Statistics, 2021, 36 : 885 - 910
  • [38] An efficient algorithm for joint feature screening in ultrahigh-dimensional Cox's model
    Chen, Xiaolin
    Liu, Catherine Chunling
    Xu, Sheng
    COMPUTATIONAL STATISTICS, 2021, 36 (02) : 885 - 910
  • [39] Robust feature screening for multi-response trans-elliptical regression model with ultrahigh-dimensional covariates
    He, Yong
    Sun, Hao
    Ji, Jiadong
    Zhang, Xinsheng
    RANDOM MATRICES-THEORY AND APPLICATIONS, 2020, 9 (04)
  • [40] Model-free feature screening for ultrahigh-dimensional data conditional on some variables
    Yi Liu
    Qihua Wang
    Annals of the Institute of Statistical Mathematics, 2018, 70 : 283 - 301