Item response theory model highlighting rating scale of a rubric and rater-rubric interaction in objective structured clinical examination

被引:0
|
作者
Uto, Masaki [1 ]
Tsuruta, Jun [2 ]
Araki, Kouji [3 ]
Ueno, Maomi [1 ]
机构
[1] Univ Electrocommun, Dept Comp & Network Engn, Chofu, Tokyo, Japan
[2] Tokyo Med & Dent Univ, Inst Educ, Bunkyo Ku, Tokyo, Japan
[3] Tokyo Med & Dent Univ, Grad Sch Med & Dent Sci, Educ Syst Dent, Bunkyo Ku, Tokyo, Japan
来源
PLOS ONE | 2024年 / 19卷 / 09期
基金
日本学术振兴会;
关键词
GENERALIZABILITY THEORY; RASCH MODEL; PERFORMANCE; DRIFT; TIME;
D O I
10.1371/journal.pone.0309887
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objective structured clinical examinations (OSCEs) are a widely used performance assessment for medical and dental students. A common limitation of OSCEs is that the evaluation results depend on the characteristics of raters and a scoring rubric. To overcome this limitation, item response theory (IRT) models such as the many-facet Rasch model have been proposed to estimate examinee abilities while taking into account the characteristics of raters and evaluation items in a rubric. However, conventional IRT models have two impractical assumptions: constant rater severity across all evaluation items in a rubric and an equal interval rating scale among evaluation items, which can decrease model fitting and ability measurement accuracy. To resolve this problem, we propose a new IRT model that introduces two parameters: (1) a rater-item interaction parameter representing the rater severity for each evaluation item and (2) an item-specific step-difficulty parameter representing the difference in rating scales among evaluation items. We demonstrate the effectiveness of the proposed model by applying it to actual data collected from a medical interview test conducted at Tokyo Medical and Dental University as part of a post-clinical clerkship OSCE. The experimental results showed that the proposed model was well-fitted to our OSCE data and measured ability accurately. Furthermore, it provided abundant information on rater and item characteristics that conventional models cannot, helping us to better understand rater and item properties.
引用
收藏
页数:23
相关论文
共 15 条
  • [1] A Multidimensional Item Response Theory Model for Rubric-Based Writing Assessment
    Uto, Masaki
    ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2021), PT I, 2021, 12748 : 420 - 432
  • [2] Temporal stability of objective structured clinical exams: a longitudinal study employing item response theory
    Baig, Lubna A.
    Violato, Claudio
    BMC MEDICAL EDUCATION, 2012, 12
  • [3] Temporal stability of objective structured clinical exams: a longitudinal study employing item response theory
    Lubna A Baig
    Claudio Violato
    BMC Medical Education, 12
  • [4] Development and psychometric testing of the 10-item satisfaction with Nursing Skill Examination: Objective Structured Clinical Assessment scale
    Hunt, Leanne
    Ramjan, Lucie M.
    Daly, Miranda
    Lewis, Peter
    O'reilly, Rebecca
    Willis, Sue
    Salamonson, Yenna
    NURSE EDUCATION IN PRACTICE, 2020, 45
  • [5] Item Response Theory Reveals Variability of Functional Impairment within Clinical Dementia Rating Scale Stages
    Miller, Tyler M.
    Balsis, Steve
    Lowe, Deborah A.
    Benge, Jared F.
    Doody, Rachelle S.
    DEMENTIA AND GERIATRIC COGNITIVE DISORDERS, 2011, 32 (05) : 362 - 366
  • [6] Item response theory in early phase clinical trials: Utilization of a reference model to analyze the Montgomery-Asberg Depression Rating Scale
    Otto, Marije E.
    Bergmann, Kirsten R.
    de Kam, Marieke L.
    Recourt, Kasper
    Jacobs, Gabriel E.
    van Esdonk, Michiel J.
    CPT-PHARMACOMETRICS & SYSTEMS PHARMACOLOGY, 2023, 12 (10): : 1425 - 1436
  • [7] Evaluation of a skin self examination attitude scale using an item response theory model approach
    Djaja, Ngadiman
    Youl, Pip
    Aitken, Joanne
    Janda, Monika
    HEALTH AND QUALITY OF LIFE OUTCOMES, 2014, 12
  • [8] Evaluation of a skin self examination attitude scale using an item response theory model approach
    Ngadiman Djaja
    Pip Youl
    Joanne Aitken
    Monika Janda
    Health and Quality of Life Outcomes, 12
  • [9] A many-facet Rasch measurement model approach to investigating objective structured clinical examination item parameter drift
    Coetzee, Karen
    Monteiro, Sandra
    Amirthalingam, Luxshi
    JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2025, 31 (01)
  • [10] Impact of Task-based Checklist Scoring and Two Domains Global Rating Scale in Objective Structured Clinical Examination of Pharmacy Students
    Veettil, Sajesh Kalkandi
    Rajiah, Kingston
    INDIAN JOURNAL OF PHARMACEUTICAL EDUCATION AND RESEARCH, 2016, 50 (01) : 17 - 23