A new multiple outliers identification method in linear regression

被引:1
|
作者
Bagdonavicius, Vilijandas [1 ]
Petkevicius, Linas [2 ]
机构
[1] Vilnius Univ, Inst Appl Math, Naugarduko St 24, LT-03225 Vilnius, Lithuania
[2] Vilnius Univ, Inst Comp Sci, Didlaukio Str 47, LT-08303 Vilnius, Lithuania
关键词
Outlier identification; Linear regression; Multiple outliers; Outlier region; Robust estimators; INFLUENTIAL OBSERVATIONS;
D O I
10.1007/s00184-019-00731-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A new method for multiple outliers identification in linear regression models is developed. It is relatively simple and easy to use. The method is based on a result giving asymptotic properties of extreme studentized residuals. This result is proved under rather general conditions on estimation procedure and covariate distribution. An extensive simulation study shows that the proposed method has superior performance as compared to various existing methods in terms of masking and swamping values. Advantage of the method is particularly visible in case of large datasets and (or) large numbers of outliers. The analysis of several well-known real data examples confirms that in most cases the new method identifies outliers better than other commonly used methods.
引用
收藏
页码:275 / 296
页数:22
相关论文
共 50 条