Outliers in multilevel data

被引:87
|
作者
Langford, IH [1 ]
Lewis, T
机构
[1] Univ E Anglia, Ctr Social & Econ Res Global Environm, Sch Environm Sci, Norwich NR4 7TJ, Norfolk, England
[2] Univ London, Inst Educ, London WC1N 1AZ, England
关键词
cluster analysis; hierarchical data; influential data points; leverage; multilevel modelling; outlier detection; reduction in deviance; studentized residuals;
D O I
10.1111/1467-985X.00094
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
This paper offers the data analyst a range of practical procedures for dealing with outliers in multilevel data. It first develops several techniques for data exploration for outliers and outlier analysis and then applies these to the detailed analysis of outliers in two large scale multilevel data sets from educational contexts. The techniques include the use of deviance reduction, measures based on residuals, leverage values, hierarchical cluster analysis and a measure called DFITS. Outlier analysis is more complex in a multilevel data set than in, say, a univariate sample or a set of regression data, where the concept of an outlying value is straightforward. In the multilevel situation one has to consider, for example, at what level o(-) levels a particular response is ou!lying, and in respect of which explanatory variables; furthermore, the treatment of a particular response at one level may affect its status or the status of other units at other levels in the model.
引用
收藏
页码:121 / 153
页数:33
相关论文
共 50 条
  • [1] Outliers in multilevel data - Discussion on the paper by Langford and Lewis
    Prescott, P
    Longford, NT
    Raab, G
    Parpia, T
    Herzberg, AM
    Bell, JF
    Clarke, BR
    Fung, WK
    Wakefield, J
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1998, 161 : 153 - 160
  • [2] Detection of outliers in multilevel models
    Shi, Lei
    Chen, Gemai
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2008, 138 (10) : 3189 - 3199
  • [3] ON THE DETECTION OF MULTIVARIATE DATA OUTLIERS AND REGRESSION OUTLIERS
    LAZRAQ, A
    CLEROUX, R
    [J]. DATA ANALYSIS, LEARNING SYMBOLIC AND NUMERIC KNOWLEDGE, 1989, : 133 - 140
  • [4] PROCESSING DATA FOR OUTLIERS
    DIXON, WJ
    [J]. BIOMETRICS, 1953, 9 (01) : 74 - 89
  • [5] Outliers in Smartphone Sensor Data Reveal Outliers in Daily Happiness
    Buda, Teodora Sandra
    Khwaja, Mohammed
    Matic, Aleksandar
    [J]. PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2021, 5 (01):
  • [6] Multilevel modeling in the presence of outliers: A comparison of robust estimation methods
    Finch, Holmes
    [J]. PSICOLOGICA, 2017, 38 (01): : 57 - 92
  • [7] Correlation of Outliers in Multivariate Data
    Kaszuba, Bartosz
    [J]. DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 265 - 272
  • [8] Outliers in microarray data analysis
    Pearson, RK
    Gonye, GE
    Schwaber, JS
    [J]. METHODS OF MICROARRAY DATA ANALYSIS III, 2003, : 41 - 55
  • [9] Finding the Outliers in Scanpath Data
    Burch, Michael
    Kumar, Ayush
    Mueller, Klaus
    Kervezee, Titus
    Nuijten, Wouter
    Oostenbach, Rens
    Peeters, Lucas
    Smit, Gijs
    [J]. ETRA 2019: 2019 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, 2019,
  • [10] OUTLIERS IN TIME SERIES DATA
    Deneshkumar, V.
    Kannan, K. Senthamarai
    [J]. INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2011, 7 (02): : 685 - 691