Semi-automatic coding of open-ended text responses in large-scale assessments

被引:6
|
作者
Andersen, Nico [1 ]
Zehner, Fabian [1 ,2 ]
Goldhammer, Frank [1 ,2 ]
机构
[1] DIPF, Leibniz Inst Res & Informat Educ, Rostocker Str 6, D-60323 Frankfurt, Germany
[2] Ctr Int Student Assessment ZIB eV, Frankfurt, Germany
关键词
clustering; eco; effort reduction; exploring coding assistant; semi-automatic coding; support human raters; AGREEMENT; TRENDS;
D O I
10.1111/jcal.12717
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Background In the context of large-scale educational assessments, the effort required to code open-ended text responses is considerably more expensive and time-consuming than the evaluation of multiple-choice responses because it requires trained personnel and long manual coding sessions. Aim Our semi-supervised coding method eco (exploring coding assistant) dynamically supports human raters by automatically coding a subset of the responses. Method We map normalized response texts into a semantic space and cluster response vectors based on their semantic similarity. Assuming that similar codes represent semantically similar responses, we propagate codes to responses in optimally homogeneous clusters. Cluster homogeneity is assessed by strategically querying informative responses and presenting them to a human rater. Following each manual coding, the method estimates the code distribution respecting a certainty interval and assumes a homogeneous distribution if certainty exceeds a predefined threshold. If a cluster is determined to certainly comprise homogeneous responses, all remaining responses are coded accordingly automatically. We evaluated the method in a simulation using different data sets. Results With an average miscoding of about 3%, the method reduced the manual coding effort by an average of about 52%. Conclusion Combining the advantages of automatic and manual coding produces considerable coding accuracy and reduces the required manual effort.
引用
收藏
页码:841 / 854
页数:14
相关论文
共 50 条
  • [21] Self-coding: A method to assess semantic validity and bias when coding open-ended responses
    Glazier, Rebecca A.
    Boydstun, Amber E.
    Feezell, Jessica T.
    RESEARCH & POLITICS, 2021, 8 (03)
  • [22] A Hybrid Text Summarization Technique of Student Open-Ended Responses to Online Educational Surveys
    Karousos, Nikos
    Vorvilas, George
    Pantazi, Despoina
    Verykios, Vassilios S.
    ELECTRONICS, 2024, 13 (18)
  • [23] Application of latent semantic analysis for open-ended responses in a large, epidemiologic study
    Travis D Leleu
    Isabel G Jacobson
    Cynthia A LeardMann
    Besa Smith
    Peter W Foltz
    Paul J Amoroso
    Marcia A Derr
    Margaret AK Ryan
    Tyler C Smith
    BMC Medical Research Methodology, 11
  • [24] Application of latent semantic analysis for open-ended responses in a large, epidemiologic study
    Leleu, Travis D.
    Jacobson, Isabel G.
    LeardMann, Cynthia A.
    Smith, Besa
    Foltz, Peter W.
    Amoroso, Paul J.
    Derr, Marcia A.
    Ryan, Margaret A. K.
    Smith, Tyler C.
    BMC MEDICAL RESEARCH METHODOLOGY, 2011, 11
  • [25] Semi-automatic porting of a large-scale CFD code to multi-graphics processing unit clusters
    Corrigan, Andrew
    Loehner, Rainald
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2012, 69 (11) : 1786 - 1796
  • [26] Updating of large-scale structural dynamic models by a semi-automatic two-level optimisation procedure
    Calvi, A
    Morris, AJ
    EUROPEAN CONFERENCE ON SPACECRAFT STRUCTURES, MATERIALS AND MECHANICAL TESTING, PROCEEDINGS, 2001, 468 : 167 - 174
  • [27] An Accelerated Method of a Generalized Transition Matrix Model Using Characteristic Basis Functions for Large-Scale Open-Ended Cavities
    Kim, Inhwan
    Im, Hyeong-Rae
    Hong, Ic-Pyo
    Lee, Hyunsoo
    Yook, Jong-Gwan
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2024, 72 (08) : 6813 - 6818
  • [28] Large-scale model experimental study on cyclic penetration process and vertical bearing characteristics of open-ended pipe piles
    Zhu, Huai-long
    Zhu, Bi-tang
    Luo, Ru-ping
    Xu, Chang-jie
    ROCK AND SOIL MECHANICS, 2024, 45 (11) : 3173 - 3184
  • [29] Technique for high axial shielding factor performance of large-scale, thin, open-ended, cylindrical Metglas magnetic shields
    Malkowski, S.
    Adhikari, R.
    Hona, B.
    Mattie, C.
    Woods, D.
    Yan, H.
    Plaster, B.
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2011, 82 (07):
  • [30] MarioGPT: Open-Ended Text2Level Generation through Large Language Models
    Sudhakaran, Shyam
    Gonzalez-Duque, Miguel
    Freiberger, Matthias
    Glanois, Claire
    Najarro, Elias
    Risi, Sebastian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,