Essay Selection Methods for Adaptive Rater Monitoring

被引:3
|
作者
Wang, Chun [1 ]
Song, Tian [2 ]
Wang, Zhuoran [1 ]
Wolfe, Edward [3 ]
机构
[1] Univ Minnesota, Minneapolis, MN USA
[2] Pearson VUE, Bloomington, MN USA
[3] ETS, Princeton, NJ USA
关键词
Rasch partial credit model; essay selection; Fisher information matrix; interim scoring; RASCH MODEL; ITEM; INFORMATION; AGREEMENT;
D O I
10.1177/0146621616672855
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Constructed-response items are commonly used in educational and psychological testing, and the answers to those items are typically scored by human raters. In the current rater monitoring processes, validity scoring is used to ensure that the scores assigned by raters do not deviate severely from the standards of rating quality. In this article, an adaptive rater monitoring approach that may potentially improve the efficiency of current rater monitoring practice is proposed. Based on the Rasch partial credit model and known development in multidimensional computerized adaptive testing, two essay selection methods-namely, the D-optimal method and the Single Fisher information method-are proposed. These two methods intend to select the most appropriate essays based on what is already known about a rater's performance. Simulation studies, using a simulated essay bank and a cloned real essay bank, show that the proposed adaptive rater monitoring methods can recover rater parameters with much fewer essay questions. Future challenges and potential solutions are discussed in the end.
引用
收藏
页码:60 / 79
页数:20
相关论文
共 50 条
  • [31] Online Feature Selection by Adaptive Sub-gradient Methods
    Zhai, Tingting
    Wang, Hao
    Koriche, Frederic
    Gao, Yang
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT II, 2019, 11052 : 430 - 446
  • [32] Monitoring Rater Quality in Observational Systems: Issues Due to Unreliable Estimates of Rater Quality
    White, Mark
    Ronfeldt, Matt
    EDUCATIONAL ASSESSMENT, 2024, 29 (02) : 124 - 146
  • [33] METHODS OF INCREASING GLOBAL COMMUNICATIONS CAPACITY BY ADAPTIVE SELECTION OF CHANNELS
    MOXON, LA
    RADIO AND ELECTRONIC ENGINEER, 1969, 38 (05): : 305 - &
  • [34] Adaptive hybrid methods for Feature selection based on Aggregation of Information gain and Clustering methods
    Thangaiah, P. Ranjit Jeba
    Shriram, R.
    Vivekanandan, K.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (02): : 164 - 169
  • [35] A comparison of methods for monitoring individual performances in taste selection tests
    Calviño, A
    Garrido, D
    Drunday, F
    Tamasi, O
    JOURNAL OF SENSORY STUDIES, 2005, 20 (04) : 301 - 312
  • [36] A comparison of methods to generate adaptive reference ranges in longitudinal monitoring
    Roshan, Davood
    Ferguson, John
    Pedlar, Charles R.
    Simpkin, Andrew
    Wyns, William
    Sullivan, Frank
    Newell, John
    PLOS ONE, 2021, 16 (02):
  • [37] The Cinematic Essay as Adaptive Process
    Warner, Rick
    ADAPTATION-THE JOURNAL OF LITERATURE ON SCREEN STUDIES, 2013, 6 (01): : 1 - 24
  • [38] Comparing three methods for estimating rater objectivity
    Langendorfer, SJ
    JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2001, 23 : S58 - S59
  • [39] Analyzing rater agreement: Manifest variable methods
    Engelhard, G
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2006, 30 (02) : 154 - 156
  • [40] Variability in ESL Essay Rating Processes: The Role of the Rating Scale and Rater Experience
    Barkaoui, Khaled
    LANGUAGE ASSESSMENT QUARTERLY, 2010, 7 (01) : 54 - 74