Outliers in multilevel data

被引:87
|
作者
Langford, IH [1 ]
Lewis, T
机构
[1] Univ E Anglia, Ctr Social & Econ Res Global Environm, Sch Environm Sci, Norwich NR4 7TJ, Norfolk, England
[2] Univ London, Inst Educ, London WC1N 1AZ, England
关键词
cluster analysis; hierarchical data; influential data points; leverage; multilevel modelling; outlier detection; reduction in deviance; studentized residuals;
D O I
10.1111/1467-985X.00094
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
This paper offers the data analyst a range of practical procedures for dealing with outliers in multilevel data. It first develops several techniques for data exploration for outliers and outlier analysis and then applies these to the detailed analysis of outliers in two large scale multilevel data sets from educational contexts. The techniques include the use of deviance reduction, measures based on residuals, leverage values, hierarchical cluster analysis and a measure called DFITS. Outlier analysis is more complex in a multilevel data set than in, say, a univariate sample or a set of regression data, where the concept of an outlying value is straightforward. In the multilevel situation one has to consider, for example, at what level o(-) levels a particular response is ou!lying, and in respect of which explanatory variables; furthermore, the treatment of a particular response at one level may affect its status or the status of other units at other levels in the model.
引用
收藏
页码:121 / 153
页数:33
相关论文
共 50 条
  • [31] Qualitative Data Clustering to Detect Outliers
    Nowak-Brzezinska, Agnieszka
    Lazarz, Weronika
    [J]. ENTROPY, 2021, 23 (07)
  • [32] PROCESSING OUTLIERS IN STATISTICAL-DATA
    MUHLBAUER, JA
    [J]. ACS SYMPOSIUM SERIES, 1985, 284 : 37 - 47
  • [33] Logical Approach to Finding Outliers in Data
    Lyutikova L.A.
    [J]. Journal of Mathematical Sciences, 2022, 260 (2) : 202 - 209
  • [34] A Method for Detecting Outliers in Functional Data
    Yu, Fengmin
    Liu, Liming
    Jin, Liying
    Yu, Nanxiang
    Shang, Hua
    [J]. IECON 2017 - 43RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2017, : 7405 - 7410
  • [35] Interpretation of multivariate outliers for compositional data
    Filzmoser, Peter
    Hron, Karel
    Reimann, Clemens
    [J]. COMPUTERS & GEOSCIENCES, 2012, 39 : 77 - 85
  • [36] From outliers to prototypes:: Ordering data
    Harmeling, Stefan
    Dornhege, Guido
    Tax, David
    Meinecke, Frank
    Mueller, Klaus-Robert
    [J]. NEUROCOMPUTING, 2006, 69 (13-15) : 1608 - 1618
  • [37] Robust Data Processing in the Presence of Outliers
    Griszin, Jurij
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2010, 86 (03): : 25 - 27
  • [38] Exponential regression for censored data with outliers
    Zhang, Jing
    Liu, Yanyan
    Wu, Yuanshan
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (03) : 431 - 442
  • [39] OUTLIERS IN EXPERIMENTAL-DATA AND THEIR TREATMENT
    MILLER, JN
    [J]. ANALYST, 1993, 118 (05) : 455 - 461
  • [40] Detecting Outliers in Multivariate Laboratory Data
    Southworth, Harry
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2008, 18 (06) : 1178 - 1183