The reliability of single task assessment in longitudinal L2 writing research

被引:2
|
作者
Wu, May Y. [1 ]
Steinkrauss, Rasmus [1 ]
Lowie, Wander [1 ]
机构
[1] Univ Groningen, Dept Appl Linguist, Fac Arts, Groningen, Netherlands
关键词
L2 writing assessment; Generalizability theory; CAF; Complex dynamic systems theory; Task topic; SYNTACTIC COMPLEXITY; GENERALIZABILITY; ACCURACY; DECISION; FLUENCY; TOOLS;
D O I
10.1016/j.jslw.2022.100950
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Single task writing assessments used in longitudinal studies have raised concerns regarding their reliability. By means of Generalizability Theory (GT), this study investigated the reliability of L2 writing assessments scored on different CAF measures, focusing on a) the reliability of single task writing assessments and on the effects of b) task topics and c) task-taking occasions on assessment reliability. We investigated analytic quantitative scores obtained from five CAF measures through a 1-day dataset and a 21-day dataset, consisting of 90 essays from 18 Chinese learners of English who did not follow any formal language instruction during the investigation. The results show that although some CAF scores (e.g., fluency) of single task assessments have distinctly higher reliability than other scores, the general conclusion is that single task assessments are not reliable from a GT perspective. Task topic introduces some score variance to the assessment result, yet this amount of variance differs profoundly between the CAF measures due to the functional vari-ability, which corresponds with Complex Dynamic Systems Theory assumptions suggesting sub-systems of an L2 do not develop synchronously. Finally, occasion, i.e., whether two samples were written on the same day or within 21 days, barely introduces score variance.
引用
收藏
页数:13
相关论文
共 50 条