Essay Selection Methods for Adaptive Rater Monitoring

被引:3
|
作者
Wang, Chun [1 ]
Song, Tian [2 ]
Wang, Zhuoran [1 ]
Wolfe, Edward [3 ]
机构
[1] Univ Minnesota, Minneapolis, MN USA
[2] Pearson VUE, Bloomington, MN USA
[3] ETS, Princeton, NJ USA
关键词
Rasch partial credit model; essay selection; Fisher information matrix; interim scoring; RASCH MODEL; ITEM; INFORMATION; AGREEMENT;
D O I
10.1177/0146621616672855
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Constructed-response items are commonly used in educational and psychological testing, and the answers to those items are typically scored by human raters. In the current rater monitoring processes, validity scoring is used to ensure that the scores assigned by raters do not deviate severely from the standards of rating quality. In this article, an adaptive rater monitoring approach that may potentially improve the efficiency of current rater monitoring practice is proposed. Based on the Rasch partial credit model and known development in multidimensional computerized adaptive testing, two essay selection methods-namely, the D-optimal method and the Single Fisher information method-are proposed. These two methods intend to select the most appropriate essays based on what is already known about a rater's performance. Simulation studies, using a simulated essay bank and a cloned real essay bank, show that the proposed adaptive rater monitoring methods can recover rater parameters with much fewer essay questions. Future challenges and potential solutions are discussed in the end.
引用
收藏
页码:60 / 79
页数:20
相关论文
共 50 条
  • [1] Imaging methods in monitoring gout - a pictorial essay
    Korzen, Maria
    Nowakowska-Plaza, Anna
    Leszkiewicz, Marek
    Sudol-Szopinska, Iwona
    JOURNAL OF ULTRASONOGRAPHY, 2024, 24 (97) : 8 - 8
  • [2] Measuring Essay Assessment: Intra-rater and Inter-rater Reliability
    Kayapinar, Ulas
    EURASIAN JOURNAL OF EDUCATIONAL RESEARCH, 2014, (57): : 113 - 135
  • [3] Effects of marking method and rater experience on ESL essay scores and rater performance
    Barkaoui, Khaled
    ASSESSMENT IN EDUCATION-PRINCIPLES POLICY & PRACTICE, 2011, 18 (03) : 279 - 293
  • [4] A Comparison of Methods for Adaptive Treatment Selection
    Friede, Tim
    Stallard, Nigel
    BIOMETRICAL JOURNAL, 2008, 50 (05) : 767 - 781
  • [5] Adaptive Sensor Selection for Monitoring Stochastic Processes
    Park, Shinkyu
    Ratti, Carlo
    Rus, Daniela
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 6766 - 6772
  • [6] Adaptive Selection of Latent Variables for Process Monitoring
    Luo, Lijia
    Bao, Shiyi
    Mao, Jianfeng
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2019, 58 (21) : 9075 - 9086
  • [7] Rater Effects on Essay Scoring: A Multilevel Analysis of Severity Drift, Central Tendency, and Rater Experience
    Leckie, George
    Baird, Jo-Anne
    JOURNAL OF EDUCATIONAL MEASUREMENT, 2011, 48 (04) : 399 - 418
  • [8] Adaptive floating search methods in feature selection
    Somol, P
    Pudil, P
    Novovicová, J
    Paclík, P
    PATTERN RECOGNITION LETTERS, 1999, 20 (11-13) : 1157 - 1163
  • [9] Comparison of e-rater (R) Automated Essay Scoring Model Calibration Methods Based on Distributional Targets
    Zhang, Mo
    Williamson, David M.
    Breyer, F. Jay
    Trapani, Catherine
    INTERNATIONAL JOURNAL OF TESTING, 2012, 12 (04) : 345 - 364
  • [10] Selection of optimal methods for intelligent process monitoring
    Shapovalov, R
    Whiteley, JR
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2003, : 679 - 684