Categorical data analysis: Away from ANOVAs (transformation or not) and towards logit mixed models

被引:2415
|
作者
Jaeger, T. Florian [1 ]
机构
[1] Univ Rochester, Rochester, NY 14627 USA
关键词
Arcsine-square-root transformation; Logistic regression; Mixed logit models; Categorical data analysis;
D O I
10.1016/j.jml.2007.11.007
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper identifies several serious problems with the widespread use of ANOVAs for the analysis of categorical outcome variables such as forced-choice variables, question-answer accuracy, choice in production (e.g. in syntactic priming research), et cetera. I show that even after applying the arcsine-square-root transformation to proportional data, ANOVA can yield spurious results. I discuss conceptual issues underlying these problems and alternatives provided by modern statistics. Specifically, I introduce ordinary logit models (i.e. logistic regression), which are well-suited to analyze categorical data and offer many advantages over ANOVA. Unfortunately, ordinary logit models do not include random effect modeling. To address this issue, I describe mixed logit models (Generalized Linear Mixed Models for binomially distributed outcomes, Breslow and Clayton [Breslow, N. E. & Clayton, D. G. (1993). Approximate inference in generalized linear mixed models. Journal of the American Statistical Society 88(421), 9-25]), which combine the advantages of ordinary logit models with the ability to account for random subject and item effects in one step of analysis. Throughout the paper, I use a psycholinguistic data set to compare the different statistical methods. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:434 / 446
页数:13
相关论文
共 50 条