Impact of differential item functioning on group score reporting in the context of large-scale assessments

被引:0
|
作者
Sean Joo
Usama Ali
Frederic Robin
Hyo Jeong Shin
机构
[1] University of Kansas,
[2] Educational Testing Service,undefined
[3] Sogang University,undefined
[4] South Valley University,undefined
关键词
Large-scale assessment; Programme for International Student Assessment; Differential item functioning; Group score reporting; Jackknife sampling;
D O I
暂无
中图分类号
学科分类号
摘要
We investigated the potential impact of differential item functioning (DIF) on group-level mean and standard deviation estimates using empirical and simulated data in the context of large-scale assessment. For the empirical investigation, PISA 2018 cognitive domains (Reading, Mathematics, and Science) data were analyzed using Jackknife sampling to explore the impact of DIF on the country scores and their standard errors. We found that the countries that have a large number of DIF items tend to increase the difference of the country scores computed with and without the DIF adjustment. In addition, standard errors of the country score differences also increased with the number of DIF items. For the simulation study, we evaluated bias and root mean squared error (RMSE) of the group mean and standard deviation estimates using the multigroup item response theory (IRT) model to explore the extent to which DIF items create a bias of the group mean scores and how effectively the DIF adjustment corrects the bias under various conditions. We found that the DIF adjustment reduced the bias by 50% on average. The implications and limitations of the study are further discussed.
引用
收藏
相关论文
共 50 条