A semi-parametric approach to feature selection in high-dimensional linear regression models

被引:2
|
作者
Liu, Yuyang [1 ]
Pi, Pengfei [1 ]
Luo, Shan [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-parametric; Sequential feature selection; Estimated partial profile score; Score matching; Selection consistency; VARIABLE SELECTION; ROBUST; LASSO;
D O I
10.1007/s00180-022-01254-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a novel semi-parametric approach to feature selection in high-dimensional linear regression models. This sequential procedure is robust to the unknown error distribution including heavy-tailed distributions. At each step of this procedure, we add the feature with the largest absolute value of the estimated partial profile score into the model. The procedure terminates when a model selection criterion is met. Theoretically, we establish this procedure's selection consistency under regular conditions. Computationally, extensive numerical studies together with a real data application are provided to demonstrate its advantage over existing representative methods in terms of selection accuracy and computation cost.
引用
收藏
页码:979 / 1000
页数:22
相关论文
共 50 条
  • [21] A One Covariate at a Time, Multiple Testing Approach to Variable Selection in High-Dimensional Linear Regression Models
    Chudik, A.
    Kapetanios, G.
    Pesaran, M. Hashem
    [J]. ECONOMETRICA, 2018, 86 (04) : 1479 - 1512
  • [22] Drawing inferences for high-dimensional linear models: A selection-assisted partial regression and smoothing approach
    Fei, Zhe
    Zhu, Ji
    Banerjee, Moulinath
    Li, Yi
    [J]. BIOMETRICS, 2019, 75 (02) : 551 - 561
  • [23] Mixtures of Semi-Parametric Generalised Linear Models
    Millard, Salomon M.
    Kanfer, Frans H. J.
    [J]. SYMMETRY-BASEL, 2022, 14 (02):
  • [24] A Semi-parametric Model for Decision Making in High-Dimensional Sensory Discrimination Tasks
    Keeley, Stephen
    Letham, Benjamin
    Sanders, Craig
    Tymms, Chase
    Shvartsman, Michael
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 40 - 47
  • [25] Feature selection in finite mixture of sparse normal linear models in high-dimensional feature space
    Khalili, Abbas
    Chen, Jiahua
    Lin, Shili
    [J]. BIOSTATISTICS, 2011, 12 (01) : 156 - 172
  • [26] Consistent group selection in high-dimensional linear regression
    Wei, Fengrong
    Huang, Jian
    [J]. BERNOULLI, 2010, 16 (04) : 1369 - 1384
  • [27] Measurement errors in semi-parametric generalised regression models
    Hattab, Mohammad W.
    Ruppert, David
    [J]. AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2023, 65 (04) : 344 - 363
  • [28] A Model Selection Criterion for High-Dimensional Linear Regression
    Owrang, Arash
    Jansson, Magnus
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (13) : 3436 - 3446
  • [29] Consistency of the semi-parametric MLE in linear regression models with interval-censored data
    Yu, QQ
    Wong, GYC
    Kong, FH
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2006, 33 (02) : 367 - 378
  • [30] Meta-heuristic algorithms for parameter estimation of semi-parametric linear regression models
    Zheng, Guoqing
    Zhang, Pingjian
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (02) : 801 - 808