A nonparametric feature screening method for ultrahigh-dimensional missing response

被引:9
|
作者
Li, Xiaoxia [1 ,2 ]
Tang, Niansheng [1 ,2 ]
Xie, Jinhan [1 ,2 ]
Yan, Xiaodong [1 ,2 ]
机构
[1] Yunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650500, Yunnan, Peoples R China
[2] Shandong Univ, Sch Econ, Jinan 250100, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature screening; Imputation; Marginal Spearman rank correlation; Missing at random; Ultrahigh-dimensional data; VARIABLE SELECTION; KOLMOGOROV FILTER; MODEL SELECTION; LINEAR-MODELS; LIKELIHOOD; SURVIVAL;
D O I
10.1016/j.csda.2019.106828
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses the feature screening issue for ultrahigh-dimensional data with responses missing at random. A novel nonparametric feature screening procedure is developed to identify the important features via the conditionally imputing marginal Spearman rank correlation. The proposed nonparametric screening approach has several desirable merits. First, it is nonparametric without assuming any regression form of predictors on response variable. Second, it is robust to outliers and heavy-tailed data. Third, under some regularity conditions, it is shown that the proposed feature screening procedure has the sure screening and ranking consistency properties. Simulation studies evidence that the proposed screening procedure outperforms several existing model-free screening procedures. An example taken from the microarray diffuse large-B-cell lymphoma study is used to illustrate the proposed methodologies. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Model-free feature screening for ultrahigh-dimensional data conditional on some variables
    Liu, Yi
    Wang, Qihua
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2018, 70 (02) : 283 - 301
  • [42] Non-marginal feature screening for additive hazard model with ultrahigh-dimensional covariates
    Liu, Zili
    Xiong, Zikang
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (06) : 1876 - 1894
  • [43] FEATURE SCREENING FOR TIME-VARYING COEFFICIENT MODELS WITH ULTRAHIGH-DIMENSIONAL LONGITUDINAL DATA
    Chu, Wanghuan
    Li, Runze
    Reimherr, Matthew
    ANNALS OF APPLIED STATISTICS, 2016, 10 (02): : 596 - 617
  • [44] FORWARD ADDITIVE REGRESSION FOR ULTRAHIGH-DIMENSIONAL NONPARAMETRIC ADDITIVE MODELS
    Zhong, Wei
    Duan, Sunpeng
    Zhu, Liping
    STATISTICA SINICA, 2020, 30 (01) : 175 - 192
  • [45] Feature screening and variable selection for partially linear models with ultrahigh-dimensional longitudinal data
    Liu, Jingyuan
    NEUROCOMPUTING, 2016, 195 : 202 - 210
  • [46] Group feature screening based on Gini impurity for ultrahigh-dimensional multi-classification
    Wang, Zhongzheng
    Deng, Guangming
    Xu, Haiyun
    AIMS MATHEMATICS, 2023, 8 (02): : 4342 - 4362
  • [47] Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
    Peng, Mengjiao
    Xiang, Liming
    STATISTICS IN MEDICINE, 2023, 42 (13) : 2082 - 2100
  • [48] Support vector machine in ultrahigh-dimensional feature space
    Kazemi, Mohammad
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (03) : 517 - 535
  • [49] Feature Screening and Error Variance Estimation for Ultrahigh-Dimensional Linear Model with Measurement Errors
    Cui, Hengjian
    Zou, Feng
    Ling, Li
    COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2025, 13 (01) : 139 - 171
  • [50] Communication-Efficient Feature Screening for Ultrahigh-Dimensional Data Under Quantile Regression
    Diao, Tianbo
    Li, Bo
    STAT, 2025, 14 (02):