Semi-automatic coding of open-ended text responses in large-scale assessments

被引:6
|
作者
Andersen, Nico [1 ]
Zehner, Fabian [1 ,2 ]
Goldhammer, Frank [1 ,2 ]
机构
[1] DIPF, Leibniz Inst Res & Informat Educ, Rostocker Str 6, D-60323 Frankfurt, Germany
[2] Ctr Int Student Assessment ZIB eV, Frankfurt, Germany
关键词
clustering; eco; effort reduction; exploring coding assistant; semi-automatic coding; support human raters; AGREEMENT; TRENDS;
D O I
10.1111/jcal.12717
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Background In the context of large-scale educational assessments, the effort required to code open-ended text responses is considerably more expensive and time-consuming than the evaluation of multiple-choice responses because it requires trained personnel and long manual coding sessions. Aim Our semi-supervised coding method eco (exploring coding assistant) dynamically supports human raters by automatically coding a subset of the responses. Method We map normalized response texts into a semantic space and cluster response vectors based on their semantic similarity. Assuming that similar codes represent semantically similar responses, we propagate codes to responses in optimally homogeneous clusters. Cluster homogeneity is assessed by strategically querying informative responses and presenting them to a human rater. Following each manual coding, the method estimates the code distribution respecting a certainty interval and assumes a homogeneous distribution if certainty exceeds a predefined threshold. If a cluster is determined to certainly comprise homogeneous responses, all remaining responses are coded accordingly automatically. We evaluated the method in a simulation using different data sets. Results With an average miscoding of about 3%, the method reduced the manual coding effort by an average of about 52%. Conclusion Combining the advantages of automatic and manual coding produces considerable coding accuracy and reduces the required manual effort.
引用
收藏
页码:841 / 854
页数:14
相关论文
共 50 条
  • [1] Automatic coding of open-ended surveys using text categorization techniques
    Giorgetti, D
    Prodanof, I
    Sebastiani, F
    ASC 2003: THE IMPACT OF TECHNOLOGY ON THE SURVEY PROCESS, 2003, : 173 - 184
  • [2] Semi-Automatic Suggestion Generation for Young Novice Programmers in an Open-Ended Context
    Ichinco, Michelle
    Kelleher, Caitlin
    PROCEEDINGS OF THE 2018 ACM CONFERENCE ON INTERACTION DESIGN AND CHILDREN (IDC 2018), 2018, : 405 - 412
  • [3] Automatic grading and hinting in open-ended text questions
    Sychev, Oleg
    Anikin, Anton
    Prokudin, Artem
    COGNITIVE SYSTEMS RESEARCH, 2020, 59 : 264 - 272
  • [4] Interactive Coding of Responses to Open-Ended Questions in Russian
    Senderovich, Nikita
    Maysuradze, Archil
    KNOWLEDGE ENGINEERING AND SEMANTIC WEB, KESW 2015, 2015, 518 : 195 - 209
  • [5] The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis
    Haensch, Anna-Carolina
    Weiss, Bernd
    Steins, Patricia
    Chyrva, Priscilla
    Bitz, Katja
    FRONTIERS IN BIG DATA, 2022, 5
  • [6] Large-scale model test on installation characteristics of open-ended pipe pile
    Liu J.-W.
    Wang L.-Z.
    Zhu N.
    Zhang C.-W.
    Zhao G.-X.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2019, 41 (02): : 269 - 277
  • [7] Towards Large-Scale Simulations of Open-Ended Evolution in Continuous Cellular Automata
    Chan, Bert Wang-Chak
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 127 - 130
  • [8] Automatic Coding of Text Answers to Open-Ended Questions: Should You Double Code the Training Data?
    He, Zhoushanyue
    Schonlau, Matthias
    SOCIAL SCIENCE COMPUTER REVIEW, 2020, 38 (06) : 754 - 765
  • [9] Large-scale analysis of open-ended student responses in questions involving randomly-generated combinations of values
    Epp, Erik
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [10] Semi-automatic porting of a large-scale Fortran CFD code to GPUs
    Corrigan, Andrew
    Camelli, Fernando
    Loehner, Rainald
    Mut, Fernando
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2012, 69 (02) : 314 - 331