Essay Selection Methods for Adaptive Rater Monitoring

被引:3
|
作者
Wang, Chun [1 ]
Song, Tian [2 ]
Wang, Zhuoran [1 ]
Wolfe, Edward [3 ]
机构
[1] Univ Minnesota, Minneapolis, MN USA
[2] Pearson VUE, Bloomington, MN USA
[3] ETS, Princeton, NJ USA
关键词
Rasch partial credit model; essay selection; Fisher information matrix; interim scoring; RASCH MODEL; ITEM; INFORMATION; AGREEMENT;
D O I
10.1177/0146621616672855
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Constructed-response items are commonly used in educational and psychological testing, and the answers to those items are typically scored by human raters. In the current rater monitoring processes, validity scoring is used to ensure that the scores assigned by raters do not deviate severely from the standards of rating quality. In this article, an adaptive rater monitoring approach that may potentially improve the efficiency of current rater monitoring practice is proposed. Based on the Rasch partial credit model and known development in multidimensional computerized adaptive testing, two essay selection methods-namely, the D-optimal method and the Single Fisher information method-are proposed. These two methods intend to select the most appropriate essays based on what is already known about a rater's performance. Simulation studies, using a simulated essay bank and a cloned real essay bank, show that the proposed adaptive rater monitoring methods can recover rater parameters with much fewer essay questions. Future challenges and potential solutions are discussed in the end.
引用
收藏
页码:60 / 79
页数:20
相关论文
共 50 条
  • [21] SELECTION OF ANALYTIC METHODS FOR THERAPEUTIC DRUG-MONITORING
    JATLOW, P
    HUMAN PATHOLOGY, 1984, 15 (05) : 404 - 414
  • [22] SELECTION OF OPTIMUM CUTTING TOOL MONITORING METHODS.
    Poduraev, V.N.
    1600, (06):
  • [23] Online monitoring with local smoothing methods and adaptive ridging
    Einbeck, J
    Kauermann, G
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2003, 73 (12) : 913 - 929
  • [24] A model of rater behavior in essay grading based on signal detection theory
    DeCarlo, LT
    JOURNAL OF EDUCATIONAL MEASUREMENT, 2005, 42 (01) : 53 - 76
  • [25] A Hierarchical Rater Model Approach for Integrating Automated Essay Scoring Models
    Fink, Aron
    Gombert, Sebastian
    Liu, Tuo
    Drachsler, Hendrik
    Frey, Andreas
    ZEITSCHRIFT FUR PSYCHOLOGIE-JOURNAL OF PSYCHOLOGY, 2024, 232 (03): : 209 - 218
  • [26] Stumping e-rater:: challenging the validity of automated essay scoring
    Powers, DE
    Burstein, JC
    Chodorow, M
    Fowles, ME
    Kukich, K
    COMPUTERS IN HUMAN BEHAVIOR, 2002, 18 (02) : 103 - 134
  • [27] Predictive Modeling of Rater Behavior: Implications for Quality Assurance in Essay Scoring
    Bejar, Isaac I.
    Li, Chen
    McCaffrey, Daniel
    APPLIED MEASUREMENT IN EDUCATION, 2020, 33 (03) : 234 - 247
  • [28] Metric-based methods for adaptive model selection and regularization
    Schuurmans, D
    Southey, F
    MACHINE LEARNING, 2002, 48 (1-3) : 51 - 84
  • [29] Adaptive model selection using orthogonal least squares methods
    Stark, J
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1997, 453 (1956): : 21 - 42
  • [30] Metric-Based Methods for Adaptive Model Selection and Regularization
    Dale Schuurmans
    Finnegan Southey
    Machine Learning, 2002, 48 : 51 - 84