Unified mean-variance feature screening for ultrahigh-dimensional regression

被引:0
|
作者
Liming Wang
Xingxiang Li
Xiaoqing Wang
Peng Lai
机构
[1] Nanjing University of Finance and Economics Hongshan College,School of Mathematics and Statistics
[2] Nanjing University of Information Science and Technology,School of Mathematics and Statistics
[3] Xi’an Jiaotong University,School of Public Administration
[4] Nanjing University of Finance and Economics,undefined
来源
Computational Statistics | 2022年 / 37卷
关键词
Ultrahigh-dimensional data; Mean-variance; Kernel smoothing estimate; Unified marginal utility; Sure screening property;
D O I
暂无
中图分类号
学科分类号
摘要
Feature screening is a popular and efficient statistical technique in processing ultrahigh-dimensional data. When a regression model consists both categorical and continuous predictors, a unified feature screening procedure is needed. Thus, we propose a unified mean-variance sure independence screening (UMV-SIS) for this setup. The mean-variance (MV), an effective utility to measure the dependence between two random variables, is widely used in feature screening for discriminant analysis. In this paper, we advocate using the kernel smoothing method to estimate MV between two continuous variables, thereby extending it to screen categorical and continuous predictors simultaneously. Besides the uniformity for screening, UMV-SIS is a model-free procedure without any specification of a regression model; this broadens the scope of its application. In theory, we show that the UMV-SIS procedure has the sure screening and ranking consistency properties under mild conditions. To solve some difficulties in marginal feature screening for linear model and further enhance the screening performance of our proposed method, an iterative UMV-SIS procedure is developed. The promising performances of the new method are supported by extensive numerical examples.
引用
收藏
页码:1887 / 1918
页数:31
相关论文
共 50 条
  • [21] Quantile-Composited Feature Screening for Ultrahigh-Dimensional Data
    Chen, Shuaishuai
    Lu, Jun
    [J]. MATHEMATICS, 2023, 11 (10)
  • [22] Efficient feature screening for ultrahigh-dimensional varying coefficient models
    Chen, Xin
    Ma, Xuejun
    Wang, Xueqin
    Zhang, Jingxiao
    [J]. STATISTICS AND ITS INTERFACE, 2017, 10 (03) : 407 - 412
  • [23] Feature Screening for Nonparametric and Semiparametric Models with Ultrahigh-Dimensional Covariates
    ZHANG Junying
    ZHANG Riquan
    ZHANG Jiajia
    [J]. Journal of Systems Science & Complexity, 2018, 31 (05) : 1350 - 1361
  • [24] Nonparametric independence feature screening for ultrahigh-dimensional missing data
    Fang, Jianglin
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (10) : 5670 - 5689
  • [25] Feature screening for ultrahigh-dimensional binary classification via linear projection
    Lai, Peng
    Wang, Mingyue
    Song, Fengli
    Zhou, Yanqiu
    [J]. AIMS MATHEMATICS, 2023, 8 (06): : 14270 - 14287
  • [26] Feature screening in ultrahigh-dimensional varying-coefficient Cox model
    Yang, Guangren
    Zhang, Ling
    Li, Runze
    Huang, Yuan
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 284 - 297
  • [27] FEATURE SCREENING IN ULTRAHIGH-DIMENSIONAL GENERALIZED VARYING-COEFFICIENT MODELS
    Yang, Guangren
    Yang, Songshan
    Li, Runze
    [J]. STATISTICA SINICA, 2020, 30 (02) : 1049 - 1067
  • [28] Covariate Information Number for Feature Screening in Ultrahigh-Dimensional Supervised Problems
    Nandy, Debmalya
    Chiaromonte, Francesca
    Li, Runze
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1516 - 1529
  • [29] Fast robust feature screening for ultrahigh-dimensional varying coefficient models
    Ma, Xuejun
    Chen, Xin
    Zhang, Jingxiao
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2017, 87 (04) : 724 - 732
  • [30] Interaction Screening for Ultrahigh-Dimensional Data
    Hao, Ning
    Zhang, Hao Helen
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1285 - 1301