Essay Selection Methods for Adaptive Rater Monitoring

被引:3
|
作者
Wang, Chun [1 ]
Song, Tian [2 ]
Wang, Zhuoran [1 ]
Wolfe, Edward [3 ]
机构
[1] Univ Minnesota, Minneapolis, MN USA
[2] Pearson VUE, Bloomington, MN USA
[3] ETS, Princeton, NJ USA
关键词
Rasch partial credit model; essay selection; Fisher information matrix; interim scoring; RASCH MODEL; ITEM; INFORMATION; AGREEMENT;
D O I
10.1177/0146621616672855
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Constructed-response items are commonly used in educational and psychological testing, and the answers to those items are typically scored by human raters. In the current rater monitoring processes, validity scoring is used to ensure that the scores assigned by raters do not deviate severely from the standards of rating quality. In this article, an adaptive rater monitoring approach that may potentially improve the efficiency of current rater monitoring practice is proposed. Based on the Rasch partial credit model and known development in multidimensional computerized adaptive testing, two essay selection methods-namely, the D-optimal method and the Single Fisher information method-are proposed. These two methods intend to select the most appropriate essays based on what is already known about a rater's performance. Simulation studies, using a simulated essay bank and a cloned real essay bank, show that the proposed adaptive rater monitoring methods can recover rater parameters with much fewer essay questions. Future challenges and potential solutions are discussed in the end.
引用
收藏
页码:60 / 79
页数:20
相关论文
共 50 条
  • [41] The Effects of Different Rater Training Procedures on ESL Essay Raters' Rating Accuracy
    Rethinasamy, Souba
    PERTANIKA JOURNAL OF SOCIAL SCIENCE AND HUMANITIES, 2021, 29 : 401 - 419
  • [42] Stability of rater judgments in holistic and analytic essay-coding in primary school
    Boehme, Katrin
    Robitzsch, Alexander
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 263 - 263
  • [43] Analysing rater agreement: Manifest variable methods
    Hudry, Kristelle
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2007, 60 : 421 - 422
  • [44] A Comparison of Constrained Item Selection Methods in Multidimensional Computerized Adaptive Testing
    Su, Ya-Hui
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2016, 40 (05) : 346 - 360
  • [45] New Item Selection Methods for Cognitive Diagnosis Computerized Adaptive Testing
    Kaplan, Mehmet
    de la Torre, Jimmy
    Ramon Barrada, Juan
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2015, 39 (03) : 167 - 188
  • [46] Stratified Item Selection Methods in Cognitive Diagnosis Computerized Adaptive Testing
    Yang, Jing
    Chang, Hua-Hua
    Tao, Jian
    Shi, Ningzhong
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2020, 44 (05) : 346 - 361
  • [47] Analysis of nonlinear networks using domain methods and an adaptive selection of functions
    Wenzler, A
    Lüder, E
    FREQUENZ, 1999, 53 (1-2) : 12 - 17
  • [48] A comparison of item-selection methods for adaptive tests with content constraints
    van der Linden, WJ
    JOURNAL OF EDUCATIONAL MEASUREMENT, 2005, 42 (03) : 283 - 302
  • [49] The selection of design methods for river water quality monitoring networks: a review
    Thuy Hoang Nguyen
    Björn Helm
    Hiroshan Hettiarachchi
    Serena Caucci
    Peter Krebs
    Environmental Earth Sciences, 2019, 78
  • [50] The selection of design methods for river water quality monitoring networks: a review
    Thuy Hoang Nguyen
    Helm, Bjoern
    Hettiarachchi, Hiroshan
    Caucci, Serena
    Krebs, Peter
    ENVIRONMENTAL EARTH SCIENCES, 2019, 78 (03)