A procedure for the detection of multivariate outliers

被引:30
|
作者
Kosinski, AS [1 ]
机构
[1] Emory Univ, Rollins Sch Publ Hlth, Dept Biostat, Atlanta, GA 30322 USA
关键词
data analysis; robust estimation; Mahalanobis distance; elemental subset; forward procedure; minimum volume ellipsoid; masking;
D O I
10.1016/S0167-9473(98)00073-5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Single case diagnostics are susceptible to a masking effect. This has led to the development of methods for detecting of multiple multivariate outliers. The available methods work well but may not be able to always detect outliers in data with contamination fraction greater than 35%, as reported by Rocke and Woodruff (1996, J. Am. Statist. Assoc. 91, 1047-1061). In this paper we propose a new method for detection of outliers which is very resistant to such high contamination of data with outliers. The simulation results indicate that, while maintaining the nominal level, the proposed method is never worse and detects outliers better than the Rocke and Woodruff method for data highly contaminated (35-45%) with outliers. Improved performance was also noted for data with smaller contamination fraction (15-20%) when outliers were situated closer to the "good" data. Several data sets are used to illustrate the proposed procedure. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:145 / 161
页数:17
相关论文
共 50 条