A semi-parametric approach to feature selection in high-dimensional linear regression models

被引:2
|
作者
Liu, Yuyang [1 ]
Pi, Pengfei [1 ]
Luo, Shan [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-parametric; Sequential feature selection; Estimated partial profile score; Score matching; Selection consistency; VARIABLE SELECTION; ROBUST; LASSO;
D O I
10.1007/s00180-022-01254-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a novel semi-parametric approach to feature selection in high-dimensional linear regression models. This sequential procedure is robust to the unknown error distribution including heavy-tailed distributions. At each step of this procedure, we add the feature with the largest absolute value of the estimated partial profile score into the model. The procedure terminates when a model selection criterion is met. Theoretically, we establish this procedure's selection consistency under regular conditions. Computationally, extensive numerical studies together with a real data application are provided to demonstrate its advantage over existing representative methods in terms of selection accuracy and computation cost.
引用
收藏
页码:979 / 1000
页数:22
相关论文
共 50 条