Type I Error and Statistical Power of the Mantel-Haenszel Procedure for Detecting DIF: A Meta-Analysis

被引:18
|
作者
Guilera, Georgina [1 ,2 ]
Gomez-Benito, Juana [1 ,2 ]
Dolores Hidalgo, Maria [3 ]
Sanchez-Meca, Julio [3 ]
机构
[1] Univ Barcelona, Fac Psychol, Dept Behav Sci Methods, E-08007 Barcelona, Spain
[2] Univ Barcelona, Inst Brain Cognit & Behav IR3C, E-08007 Barcelona, Spain
[3] Univ Murcia, Fac Psychol, Dept Basic Psychol & Methodol, Murcia, Spain
关键词
Type I error; statistical power; Mantel-Haenszel; differential item functioning; meta-analysis; ITEM FUNCTIONING DETECTION; CHI-SQUARE TEST; LOGISTIC-REGRESSION; SAMPLE-SIZE; METHODOLOGICAL RESEARCH; ERRONEOUS DETECTION; NONUNIFORM DIF; IRT; IDENTIFICATION; SIBTEST;
D O I
10.1037/a0034306
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
This article presents a meta-analysis of studies investigating the effectiveness of the Mantel-Haenszel (MH) procedure when used to detect differential item functioning (DIF). Studies were located electronically in the main databases, representing the codification of 3,774 different simulation conditions, 1,865 related to Type I error and 1,909 to statistical power. The homogeneity of effect-size distributions was assessed by the Q statistic. The extremely high heterogeneity in both error rates (I-2 = 94.70) and power (I-2 = 99.29), due to the fact that numerous studies test the procedure in extreme conditions, means that the main interest of the results lies in explaining the variability in detection rates. One-way analysis of variance was used to determine the effects of each variable on detection rates, showing that the MH test was more effective when purification procedures were used, when the data fitted the Rasch model, when test contamination was below 20%, and with sample sizes above 500. The results imply a series of recommendations for practitioners who wish to study DIF with the MH test. A limitation, one inherent to all meta-analyses, is that not all the possible moderator variables, or the levels of variables, have been explored. This serves to remind us of certain gaps in the scientific literature (i.e., regarding the direction of DIF or variances in ability distribution) and is an aspect that methodologists should consider in future simulation studies.
引用
收藏
页码:553 / 571
页数:19
相关论文
共 50 条