Score resolution: An investigation of the reliability and validity of resolved scores

被引:22
|
作者
Johnson, RL [1 ]
Penny, J
Fisher, S
Kuhs, T
机构
[1] Univ S Carolina, Coll Educ, Ctr Excellence Assessment Student Learning, Columbia, SC 29223 USA
[2] Castle Worldwide Inc, Morrisville, NC USA
关键词
D O I
10.1207/S15324818AME1604_3
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
When raters assign different scores to a performance task, a method for resolving rating differences is required to report a single score to the examinee. Recent studies indicate that decisions about examinees, such as pass/fail decisions, differ across resolution methods. Previous studies also investigated the interrater reliability of operational scores formed through different resolution methods, however, reliability might have been overestimated because adjudication was conditional on raters' initial disagreement. This study used a replication design involving autonomous teams of raters to investigate the reliability associated with three forms of resolution: averaging the original raters' scores, averaging the original raters' scores with an adjudicator's score, and matching the adjudicator's score with the closest original score. This study Also examined validity coefficients for resolved scores and two types of criterion scores. Findings include (a) interrater reliability was slightly higher for the resolution method that averages the scores of the original raters and the adjudicator and (b) the lowest validity coefficients were associated most frequently with the method that matches the adjudicator's score with the closest original score.
引用
收藏
页码:299 / 322
页数:24
相关论文
共 50 条
  • [1] Score resolution and the interrater reliability of holistic scores in rating essays
    Johnson, RL
    Penny, J
    Gordon, B
    [J]. WRITTEN COMMUNICATION, 2001, 18 (02) : 229 - 249
  • [2] Validity and reliability of the SPORTS score
    Blonna, Davide
    Castoldi, Filippo
    Delicio, Davide
    Bruzzone, Matteo
    Dettoni, Federico
    Bonasia, Davide Edoardo
    Rossi, Roberto
    [J]. KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY, 2012, 20 (02) : 356 - 360
  • [3] Reliability and validity of a steadiness score
    Clark, DO
    Callahan, CM
    Counsell, SR
    [J]. JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2005, 53 (09) : 1582 - 1586
  • [4] Validity and reliability of the SPORTS score
    Davide Blonna
    Filippo Castoldi
    Davide Delicio
    Matteo Bruzzone
    Federico Dettoni
    Davide Edoardo Bonasia
    Roberto Rossi
    [J]. Knee Surgery, Sports Traumatology, Arthroscopy, 2012, 20 : 356 - 360
  • [5] Reliability and validity of a steadiness score
    Clark, DO
    Callahan, CM
    Counsell, SR
    [J]. JOURNAL OF GENERAL INTERNAL MEDICINE, 2005, 20 : 58 - 58
  • [6] Reliability, validity, and utility of tests and scores
    Hoffmann, Andreas
    [J]. EUROPEAN JOURNAL OF PREVENTIVE CARDIOLOGY, 2024, 31 (06) : 667 - 667
  • [7] Note on reliability and validity of change scores
    Williams, RH
    Zimmerman, DW
    Cummings, N
    [J]. PERCEPTUAL AND MOTOR SKILLS, 1996, 82 (03) : 785 - 786
  • [8] The reliability vs the validity of test scores
    Carr, HA
    [J]. PSYCHOLOGICAL REVIEW, 1938, 45 : 435 - 440
  • [9] The reliability and validity of weighted composite scores
    Kane, M
    Case, SM
    [J]. APPLIED MEASUREMENT IN EDUCATION, 2004, 17 (03) : 221 - 240
  • [10] The Penn Shoulder Score: Reliability and validity
    Leggin, BG
    Michener, LA
    Shaffer, MA
    Brenneman, SK
    Iannotti, JP
    Williams, GR
    [J]. JOURNAL OF ORTHOPAEDIC & SPORTS PHYSICAL THERAPY, 2006, 36 (03): : 138 - 151