A semi-parametric approach to feature selection in high-dimensional linear regression models

被引:2
|
作者
Liu, Yuyang [1 ]
Pi, Pengfei [1 ]
Luo, Shan [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-parametric; Sequential feature selection; Estimated partial profile score; Score matching; Selection consistency; VARIABLE SELECTION; ROBUST; LASSO;
D O I
10.1007/s00180-022-01254-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a novel semi-parametric approach to feature selection in high-dimensional linear regression models. This sequential procedure is robust to the unknown error distribution including heavy-tailed distributions. At each step of this procedure, we add the feature with the largest absolute value of the estimated partial profile score into the model. The procedure terminates when a model selection criterion is met. Theoretically, we establish this procedure's selection consistency under regular conditions. Computationally, extensive numerical studies together with a real data application are provided to demonstrate its advantage over existing representative methods in terms of selection accuracy and computation cost.
引用
收藏
页码:979 / 1000
页数:22
相关论文
共 50 条
  • [31] Meta-heuristic algorithms for parameter estimation of semi-parametric linear regression models
    Zheng, Guoqing
    Zhang, Pingjian
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (02) : 801 - 808
  • [32] Unbiased Prediction and Feature Selection in High-Dimensional Survival Regression
    Laimighofer, Michael
    Krumsiek, Jan
    Buettner, Florian
    Theis, Fabian J.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2016, 23 (04) : 279 - 290
  • [33] TESTING HIGH-DIMENSIONAL REGRESSION COEFFICIENTS IN LINEAR MODELS
    Zhao, Alex
    Li, Changcheng
    Li, Runze
    Zhang, Zhe
    ANNALS OF STATISTICS, 2024, 52 (05): : 2034 - 2058
  • [34] Tests for high-dimensional partially linear regression models
    Shi, Hongwei
    Yang, Weichao
    Sun, Bowen
    Guo, Xu
    STATISTICAL PAPERS, 2025, 66 (03)
  • [35] Empirical likelihood for high-dimensional linear regression models
    Hong Guo
    Changliang Zou
    Zhaojun Wang
    Bin Chen
    Metrika, 2014, 77 : 921 - 945
  • [36] Empirical likelihood for high-dimensional linear regression models
    Guo, Hong
    Zou, Changliang
    Wang, Zhaojun
    Chen, Bin
    METRIKA, 2014, 77 (07) : 921 - 945
  • [37] A semi-parametric estimator for censored selection models with endogeneity
    Lee, MJ
    Vella, F
    JOURNAL OF ECONOMETRICS, 2006, 130 (02) : 235 - 252
  • [38] A STEPWISE REGRESSION METHOD AND CONSISTENT MODEL SELECTION FOR HIGH-DIMENSIONAL SPARSE LINEAR MODELS
    Ing, Ching-Kang
    Lai, Tze Leung
    STATISTICA SINICA, 2011, 21 (04) : 1473 - 1513
  • [39] RELATIVE COST BASED MODEL SELECTION FOR SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION MODELS
    Gohain, Prakash B.
    Jansson, Magnus
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 5515 - 5519
  • [40] Variable selection in high-dimensional partially linear additive models for composite quantile regression
    Guo, Jie
    Tang, Manlai
    Tian, Maozai
    Zhu, Kai
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 65 : 56 - 67