A semi-parametric approach to feature selection in high-dimensional linear regression models

被引:2
|
作者
Liu, Yuyang [1 ]
Pi, Pengfei [1 ]
Luo, Shan [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-parametric; Sequential feature selection; Estimated partial profile score; Score matching; Selection consistency; VARIABLE SELECTION; ROBUST; LASSO;
D O I
10.1007/s00180-022-01254-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a novel semi-parametric approach to feature selection in high-dimensional linear regression models. This sequential procedure is robust to the unknown error distribution including heavy-tailed distributions. At each step of this procedure, we add the feature with the largest absolute value of the estimated partial profile score into the model. The procedure terminates when a model selection criterion is met. Theoretically, we establish this procedure's selection consistency under regular conditions. Computationally, extensive numerical studies together with a real data application are provided to demonstrate its advantage over existing representative methods in terms of selection accuracy and computation cost.
引用
收藏
页码:979 / 1000
页数:22
相关论文
共 50 条
  • [21] Generalized dynamic semi-parametric factor models for high-dimensional non-stationary time series
    Song, Song
    Haerdle, Wolfgang K.
    Ritov, Ya'acov
    ECONOMETRICS JOURNAL, 2014, 17 (02): : S101 - S131
  • [22] Mixtures of Semi-Parametric Generalised Linear Models
    Millard, Salomon M.
    Kanfer, Frans H. J.
    SYMMETRY-BASEL, 2022, 14 (02):
  • [23] Drawing inferences for high-dimensional linear models: A selection-assisted partial regression and smoothing approach
    Fei, Zhe
    Zhu, Ji
    Banerjee, Moulinath
    Li, Yi
    BIOMETRICS, 2019, 75 (02) : 551 - 561
  • [24] A One Covariate at a Time, Multiple Testing Approach to Variable Selection in High-Dimensional Linear Regression Models
    Chudik, A.
    Kapetanios, G.
    Pesaran, M. Hashem
    ECONOMETRICA, 2018, 86 (04) : 1479 - 1512
  • [25] A Semi-parametric Model for Decision Making in High-Dimensional Sensory Discrimination Tasks
    Keeley, Stephen
    Letham, Benjamin
    Sanders, Craig
    Tymms, Chase
    Shvartsman, Michael
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 40 - 47
  • [26] Measurement errors in semi-parametric generalised regression models
    Hattab, Mohammad W.
    Ruppert, David
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2023, 65 (04) : 344 - 363
  • [27] Feature selection in finite mixture of sparse normal linear models in high-dimensional feature space
    Khalili, Abbas
    Chen, Jiahua
    Lin, Shili
    BIOSTATISTICS, 2011, 12 (01) : 156 - 172
  • [28] A Model Selection Criterion for High-Dimensional Linear Regression
    Owrang, Arash
    Jansson, Magnus
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (13) : 3436 - 3446
  • [29] Consistent group selection in high-dimensional linear regression
    Wei, Fengrong
    Huang, Jian
    BERNOULLI, 2010, 16 (04) : 1369 - 1384
  • [30] Consistency of the semi-parametric MLE in linear regression models with interval-censored data
    Yu, QQ
    Wong, GYC
    Kong, FH
    SCANDINAVIAN JOURNAL OF STATISTICS, 2006, 33 (02) : 367 - 378