A semi-parametric approach to feature selection in high-dimensional linear regression models

被引:2
|
作者
Liu, Yuyang [1 ]
Pi, Pengfei [1 ]
Luo, Shan [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-parametric; Sequential feature selection; Estimated partial profile score; Score matching; Selection consistency; VARIABLE SELECTION; ROBUST; LASSO;
D O I
10.1007/s00180-022-01254-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a novel semi-parametric approach to feature selection in high-dimensional linear regression models. This sequential procedure is robust to the unknown error distribution including heavy-tailed distributions. At each step of this procedure, we add the feature with the largest absolute value of the estimated partial profile score into the model. The procedure terminates when a model selection criterion is met. Theoretically, we establish this procedure's selection consistency under regular conditions. Computationally, extensive numerical studies together with a real data application are provided to demonstrate its advantage over existing representative methods in terms of selection accuracy and computation cost.
引用
收藏
页码:979 / 1000
页数:22
相关论文
共 50 条
  • [1] A semi-parametric approach to feature selection in high-dimensional linear regression models
    Yuyang Liu
    Pengfei Pi
    Shan Luo
    [J]. Computational Statistics, 2023, 38 : 979 - 1000
  • [2] Semi-parametric Approach to Random Forests for High-Dimensional Bayesian Optimisation
    Kuzmanovski, Vladimir
    Hollmen, Jaakko
    [J]. DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 418 - 428
  • [3] Cluster feature selection in high-dimensional linear models
    Lin, Bingqing
    Pang, Zhen
    Wang, Qihua
    [J]. RANDOM MATRICES-THEORY AND APPLICATIONS, 2018, 7 (01)
  • [4] Variable selection in finite mixture of semi-parametric regression models
    Ormoz, Ehsan
    Eskandari, Farzad
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2016, 45 (03) : 695 - 711
  • [5] Evaluation of a Semi-Parametric Model for High-Dimensional FES Control
    Schearer, Eric M.
    Liao, Yu-Wei
    Perreault, Eric J.
    Tresch, Matthew C.
    Memberg, William D.
    Kirsch, Robert F.
    Lynch, Kevin M.
    [J]. 2015 7TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING (NER), 2015, : 304 - 307
  • [6] Variable selection in semi-parametric models
    Zhang, Hongmei
    Maity, Arnab
    Arshad, Hasan
    Holloway, John
    Karmaus, Wilfried
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (04) : 1736 - 1752
  • [7] A sequential approach to feature selection in high-dimensional additive models
    Gong, Yuan
    Chen, Zehua
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2021, 215 : 289 - 298
  • [8] A semi-parametric Bayesian approach to generalized linear mixed models
    Kleinman, KP
    Ibrahim, JG
    [J]. STATISTICS IN MEDICINE, 1998, 17 (22) : 2579 - 2596
  • [9] Variable selection in high-dimensional sparse multiresponse linear regression models
    Luo, Shan
    [J]. STATISTICAL PAPERS, 2020, 61 (03) : 1245 - 1267
  • [10] Variable selection in high-dimensional sparse multiresponse linear regression models
    Shan Luo
    [J]. Statistical Papers, 2020, 61 : 1245 - 1267