Unified mean-variance feature screening for ultrahigh-dimensional regression

被引:0
|
作者
Liming Wang
Xingxiang Li
Xiaoqing Wang
Peng Lai
机构
[1] Nanjing University of Finance and Economics Hongshan College,School of Mathematics and Statistics
[2] Nanjing University of Information Science and Technology,School of Mathematics and Statistics
[3] Xi’an Jiaotong University,School of Public Administration
[4] Nanjing University of Finance and Economics,undefined
来源
Computational Statistics | 2022年 / 37卷
关键词
Ultrahigh-dimensional data; Mean-variance; Kernel smoothing estimate; Unified marginal utility; Sure screening property;
D O I
暂无
中图分类号
学科分类号
摘要
Feature screening is a popular and efficient statistical technique in processing ultrahigh-dimensional data. When a regression model consists both categorical and continuous predictors, a unified feature screening procedure is needed. Thus, we propose a unified mean-variance sure independence screening (UMV-SIS) for this setup. The mean-variance (MV), an effective utility to measure the dependence between two random variables, is widely used in feature screening for discriminant analysis. In this paper, we advocate using the kernel smoothing method to estimate MV between two continuous variables, thereby extending it to screen categorical and continuous predictors simultaneously. Besides the uniformity for screening, UMV-SIS is a model-free procedure without any specification of a regression model; this broadens the scope of its application. In theory, we show that the UMV-SIS procedure has the sure screening and ranking consistency properties under mild conditions. To solve some difficulties in marginal feature screening for linear model and further enhance the screening performance of our proposed method, an iterative UMV-SIS procedure is developed. The promising performances of the new method are supported by extensive numerical examples.
引用
收藏
页码:1887 / 1918
页数:31
相关论文
共 50 条
  • [1] Unified mean-variance feature screening for ultrahigh-dimensional regression
    Wang, Liming
    Li, Xingxiang
    Wang, Xiaoqing
    Lai, Peng
    [J]. COMPUTATIONAL STATISTICS, 2022, 37 (04) : 1887 - 1918
  • [2] A modified mean-variance feature-screening procedure for ultrahigh-dimensional discriminant analysis
    He, Shengmei
    Ma, Shuangge
    Xu, Wangli
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 137 : 155 - 169
  • [3] The Sparse MLE for Ultrahigh-Dimensional Feature Screening
    Xu, Chen
    Chen, Jiahua
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1257 - 1269
  • [4] Feature Screening and Error Variance Estimation for Ultrahigh-Dimensional Linear Model with Measurement Errors
    Cui, Hengjian
    Zou, Feng
    Ling, Li
    [J]. COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2023,
  • [5] Fused mean-variance filter for feature screening
    Yan, Xiaodong
    Tang, Niansheng
    Xie, Jinhan
    Ding, Xianwen
    Wang, Zhiqiang
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 122 : 18 - 32
  • [6] On Exact Feature Screening in Ultrahigh-Dimensional Binary Classification
    Roy, Sarbojit
    Sarkar, Soham
    Dutta, Subhajit
    Ghosh, Anil K.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (02) : 448 - 462
  • [7] Feature screening for ultrahigh-dimensional additive logistic models
    Wang, Lei
    Ma, Xuejun
    Zhang, Jingxiao
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2020, 205 : 306 - 317
  • [8] A selective overview of feature screening for ultrahigh-dimensional data
    LIU JingYuan
    ZHONG Wei
    LI RunZe
    [J]. Science China Mathematics, 2015, 58 (10) : 2033 - 2054
  • [9] A selective overview of feature screening for ultrahigh-dimensional data
    Liu JingYuan
    Zhong Wei
    Li RunZe
    [J]. SCIENCE CHINA-MATHEMATICS, 2015, 58 (10) : 2033 - 2054
  • [10] A selective overview of feature screening for ultrahigh-dimensional data
    JingYuan Liu
    Wei Zhong
    RunZe Li
    [J]. Science China Mathematics, 2015, 58 : 1 - 22