Robust model-free feature screening for ultrahigh dimensional surrogate data

被引:4
|
作者
Lai, Peng [1 ]
Chen, Yuanxing [2 ]
Zhang, Jie [1 ]
Dai, Bingying [2 ]
Zhang, Qingzhao [2 ,3 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Math & Stat, Nanjing, Jiangsu, Peoples R China
[2] Xiamen Univ, Sch Econ, Dept Stat, Xiamen, Fujian, Peoples R China
[3] Xiamen Univ, Key Lab Econometr, Minist Educ, Xiamen, Fujian, Peoples R China
[4] Xiamen Univ, Wang Yanan Inst Studies Econ, Xiamen, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Ultrahigh dimensional data; missing at random; feature screening; sure screening property; VARIABLE SELECTION; EFFICIENT;
D O I
10.1080/00949655.2019.1690492
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper is concerned with the feature screening for the ultrahigh dimensional data with covariates missing at random, and some surrogate variables are available. We propose a marginal screening procedure based on the augmented inverse probability weighted methods and the nonparametric imputation technique. Our proposed screening method utilizes the surrogate information efficiently to overcome the missing data problem. It is model free and possesses the sure screening property under some regular conditions. Monte Carlo simulation studies and a real data application are conducted to examine the performance of the proposed procedure.
引用
收藏
页码:550 / 569
页数:20
相关论文
共 50 条
  • [41] Model free feature screening with dependent variable in ultrahigh dimensional binary classification
    Lai, Peng
    Song, Fengli
    Chen, Kaiwen
    Liu, Zhi
    [J]. STATISTICS & PROBABILITY LETTERS, 2017, 125 : 141 - 148
  • [42] Model-free data screening and cleaning
    Tarter, Michael E.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2011, 3 (02): : 168 - 176
  • [43] An efficient model-free approach to interaction screening for high dimensional data
    Xiong, Wei
    Pan, Han
    Wang, Jianrong
    Tian, Maozai
    [J]. STATISTICS IN MEDICINE, 2023, 42 (10) : 1583 - 1605
  • [44] Joint model-free feature screening for ultra-high dimensional semi-competing risks data
    Lu, Shuiyun
    Chen, Xiaolin
    Xu, Sheng
    Liu, Chunling
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 147
  • [45] Feature Screening for Ultrahigh Dimensional Categorical Data With Applications
    Huang, Danyang
    Li, Runze
    Wang, Hansheng
    [J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2014, 32 (02) : 237 - 244
  • [46] Model-Free Conditional Feature Screening with FDR Control
    Tong, Zhaoxue
    Cai, Zhanrui
    Yang, Songshan
    Li, Runze
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (544) : 2575 - 2587
  • [47] A simple model-free survival conditional feature screening
    Chen, Xiaolin
    Zhang, Yahui
    Chen, Xiaojing
    Liu, Yi
    [J]. STATISTICS & PROBABILITY LETTERS, 2019, 146 : 156 - 160
  • [48] Model-free conditional feature screening with exposure variables
    Zhou, Yeqing
    Liu, Jingyuan
    Hao, Zhihui
    Zhui, Liping
    [J]. STATISTICS AND ITS INTERFACE, 2019, 12 (02) : 239 - 251
  • [49] FEATURE SCREENING IN ULTRAHIGH DIMENSIONAL COX'S MODEL
    Yang, Guangren
    Yu, Ye
    Lie, Runze
    Buu, Anne
    [J]. STATISTICA SINICA, 2016, 26 (03) : 881 - 901
  • [50] A selective overview of feature screening for ultrahigh-dimensional data
    JingYuan Liu
    Wei Zhong
    RunZe Li
    [J]. Science China Mathematics, 2015, 58 : 1 - 22