Robust Ridge Regression for High-Dimensional Data

被引:68
|
作者
Maronna, Ricardo A. [1 ,2 ]
机构
[1] Univ La Plata, Dept Math, RA-1900 La Plata, Argentina
[2] CICPBA, Buenos Aires, DF, Argentina
关键词
MM estimate; S estimate; Shrinking; SELECTION;
D O I
10.1198/TECH.2010.09114
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Ridge regression, being based on the minimization of a quadratic loss function, is sensitive to outliers. Current proposals for robust ridge-regression estimators are sensitive to "bad leverage observations," cannot be employed when the number of predictors p is larger than the number of observations n, and have a low robustness when the ratio pin is large. In this article a ridge-regression estimate based on repeated M estimation ("MM estimation") is proposed. It is a penalized regression MM estimator, in which the quadratic loss is replaced by an average of rho(r(i)/(sigma) over cap), where r(i) are the residuals and (sigma) over cap the residual scale from an initial estimator, which is a penalized S estimator; and rho is a bounded function. The MM estimator can be computed for p > n and is robust for large p/n. A fast algorithm is proposed. The advantages of the proposed approach over its competitors are demonstrated through both simulated and real data. Supplemental materials are available online.
引用
收藏
页码:44 / 53
页数:10
相关论文
共 50 条