Robust Methods for High-Dimensional Linear Learning

被引:0
|
作者
Merad, Ibrahim [1 ]
Gaiffas, Stephane [1 ,2 ]
机构
[1] Univ Paris Diderot, LPSM, UMR 8001, Paris, France
[2] Ecole Normale Super, DMA, Paris, France
关键词
robust methods; heavy-tailed data; outliers; sparse recovery; mirror descent; general; ization error; VARIABLE SELECTION; REGRESSION SHRINKAGE; ORACLE INEQUALITIES; LASSO; ESTIMATORS; SPARSITY; SLOPE; REGULARIZATION; RECOVERY; BOUNDS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose statistically robust and computationally efficient linear learning methods in the highdimensional batch setting, where the number of features d may exceed the sample size n. We employ, in a generic learning setting, two algorithms depending on whether the considered loss function is gradient-Lipschitz or not. Then, we instantiate our framework on several applications including vanilla sparse, group-sparse and low-rank matrix recovery. This leads, for each application, to efficient and robust learning algorithms, that reach near-optimal estimation rates under heavy-tailed distributions and the presence of outliers. For vanilla s-sparsity, we are able to reach the s log(d)/n rate under heavy-tails and eta-corruption, at a computational cost comparable to that of non-robust analogs. We provide an efficient implementation of our algorithms in an open-source Python library called linlearn, by means of which we carry out numerical experiments which confirm our theoretical findings together with a comparison to other recent approaches proposed in the literature.
引用
收藏
页数:44
相关论文
共 50 条