Examining severity and centrality effects in TestDaF writing and speaking assessments: An extended Bayesian many-facet Rasch analysis

被引:7
|
作者
Eckes, Thomas [1 ]
Jin, Kuan-Yu [2 ]
机构
[1] Univ Bochum, TestDaF Inst, Univ Str 134, D-44799 Bochum, Germany
[2] Hong Kong Examinat & Assessment Author, Hong Kong, Peoples R China
关键词
Rater effects; rater centrality; facets models; performance assessment; Bayesian statistics; MCMC estimation; RATER TYPES; MODEL; QUALITY;
D O I
10.1080/15305058.2021.1963260
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Severity and centrality are two main kinds of rater effects posing threats to the validity and fairness of performance assessments. Adopting Jin and Wang's (2018) extended facets modeling approach, we separately estimated the magnitude of rater severity and centrality effects in the web-based TestDaF (Test of German as a Foreign Language) writing and speaking assessments using Bayesian MCMC methods. The findings revealed that (a) the extended facets model had a better data-model fit than models that ignored either or both kinds of rater effects, (b) rating scale and partial credit versions of the extended model differed in terms of data-model fit for writing and speaking, (c) rater severity and centrality estimates were not significantly correlated with each other, and (d) centrality effects had a demonstrable impact on examinee rank orderings. The discussion focuses on implications for the analysis and evaluation of rating quality in performance assessments.
引用
收藏
页码:131 / 153
页数:23
相关论文
共 9 条