Model free feature screening for ultrahigh dimensional data with responses missing at random

被引:28
|
作者
Lai, Peng [1 ]
Liu, Yiming [2 ]
Liu, Zhi [3 ,4 ]
Wan, Yi [3 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Math & Stat, Nanjing, Jiangsu, Peoples R China
[2] Nanyang Technol Univ, Div Math Sci, Singapore, Singapore
[3] Univ Macau, Zhuhai, Peoples R China
[4] UMacau Res Inst, Zhuhai, Peoples R China
基金
中国国家自然科学基金;
关键词
Ultrahigh dimensional data; Missing at random; Feature screening; Sure screening property; NONCONCAVE PENALIZED LIKELIHOOD; COX REGRESSION-ANALYSIS; GENE-EXPRESSION DATA; ORACLE PROPERTIES; SELECTION; LASSO;
D O I
10.1016/j.csda.2016.08.008
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The paper concerns the feature screening for the ultrahigh dimensional data with responses missing at random. A model free feature screening procedure based on the inverse probability weighted methods has been proposed, where the Kolmogorov filter method is used to screen the important features under an unknown propensity score function. The suggested screening procedure has several desirable advantages. First, it has property of robust to heavy-tailed distributions of predictors and the presence of potential outliers. Second, it is a model free procedure with mild model assumptions. Third, it can deal with the missing data problem with responses missing at random. Monte Carlo simulation studies are conducted to examine the performance of the proposed procedure and a real data application is also conducted to evaluate and illustrate the proposed methods. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:201 / 216
页数:16
相关论文
共 50 条
  • [1] FEATURE SCREENING VIA DISTANCE CORRELATION FOR ULTRAHIGH DIMENSIONAL DATA WITH RESPONSES MISSING AT RANDOM
    Xia, Linli
    Tang, Niansheng
    [J]. STATISTICA SINICA, 2023, 33 : 1169 - 1191
  • [2] Group feature screening for ultrahigh-dimensional data missing at random
    He, Hanji
    Li, Meini
    Deng, Guangming
    [J]. AIMS MATHEMATICS, 2024, 9 (02): : 4032 - 4056
  • [3] Feature screening for ultrahigh dimensional categorical data with covariates missing at random
    Ni, Lyu
    Fang, Fang
    Shao, Jun
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 142
  • [4] Feature screening in ultrahigh-dimensional partially linear models with missing responses at random
    Tang, Niansheng
    Xia, Linli
    Yan, Xiaodong
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 133 : 208 - 227
  • [5] Model-Free Feature Screening for Ultrahigh-Dimensional Data
    Zhu, Li-Ping
    Li, Lexin
    Li, Runze
    Zhu, Li-Xing
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (496) : 1464 - 1475
  • [6] Feature screening for ultrahigh-dimensional survival data when failure indicators are missing at random
    Fang, Jianglin
    [J]. STATISTICAL PAPERS, 2021, 62 (03) : 1141 - 1166
  • [7] Feature screening for ultrahigh-dimensional survival data when failure indicators are missing at random
    Jianglin Fang
    [J]. Statistical Papers, 2021, 62 : 1141 - 1166
  • [8] Model-free conditional independence feature screening for ultrahigh dimensional data
    WANG Lu Heng
    LIU Jing Yuan
    LI Yong
    LI Run Ze
    [J]. Science China Mathematics, 2017, 60 (03) : 551 - 568
  • [9] Robust model-free feature screening for ultrahigh dimensional surrogate data
    Lai, Peng
    Chen, Yuanxing
    Zhang, Jie
    Dai, Bingying
    Zhang, Qingzhao
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020, 90 (03) : 550 - 569
  • [10] Model-free conditional independence feature screening for ultrahigh dimensional data
    Wang LuHeng
    Liu JingYuan
    Li Yong
    Li RunZe
    [J]. SCIENCE CHINA-MATHEMATICS, 2017, 60 (03) : 551 - 568