Robust Variable Selection Based on Relaxed Lad Lasso

被引:0
|
作者
Li, Hongyu [1 ]
Xu, Xieting [2 ]
Lu, Yajun [3 ]
Yu, Xi [4 ]
Zhao, Tong [2 ]
Zhang, Rufei [5 ,6 ,7 ]
机构
[1] Woosuk Univ, Grad Sch, Wanju Gun 55338, South Korea
[2] Hebei GEO Univ, Coll Econ, Shijiazhuang 050031, Hebei, Peoples R China
[3] Chongqing Technol & Business Univ, Sch Business Adm, Chongqing 400067, Peoples R China
[4] HBIS Supply Chain Management Co Ltd, Shijiazhuang 050001, Hebei, Peoples R China
[5] Hebei GEO Univ, Hebei Ctr Ecol & Environm Geol Res, Shijiazhuang 050031, Hebei, Peoples R China
[6] Hebei GEO Univ, Reaserch Ctr Nutural Resources Assets, Shijiazhuang 050031, Hebei, Peoples R China
[7] Hebei Prov Mineral Resources Dev & Management & T, Shijiazhuang 050031, Hebei, Peoples R China
来源
SYMMETRY-BASEL | 2022年 / 14卷 / 10期
关键词
variable selection; relaxed lasso; least absolute deviation; consistency; heavy-tailed; REGRESSION SHRINKAGE;
D O I
10.3390/sym14102161
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Least absolute deviation is proposed as a robust estimator to solve the problem when the error has an asymmetric heavy-tailed distribution or outliers. In order to be insensitive to the above situation and select the truly important variables from a large number of predictors in the linear regression, this paper introduces a two-stage variable selection method named relaxed lad lasso, which enables the model to obtain robust sparse solutions in the presence of outliers or heavy-tailed errors by combining least absolute deviation with relaxed lasso. Compared with lasso, this method is not only immune to the rapid growth of noise variables but also maintains a better convergence rate, which is O-p (n(-1/2)). In addition, we prove that the relaxed lad lasso estimator has the property of consistency at large samples; that is, the model selects the number of important variables with a high probability of convergence to one. Through the simulation and empirical results, we further verify the outstanding performance of relaxed lad lasso in terms of prediction accuracy and the correct selection of informative variables under the heavy-tailed distribution.
引用
收藏
页数:18
相关论文
共 50 条