Semi-automatic coding of open-ended text responses in large-scale assessments

被引:6
|
作者
Andersen, Nico [1 ]
Zehner, Fabian [1 ,2 ]
Goldhammer, Frank [1 ,2 ]
机构
[1] DIPF, Leibniz Inst Res & Informat Educ, Rostocker Str 6, D-60323 Frankfurt, Germany
[2] Ctr Int Student Assessment ZIB eV, Frankfurt, Germany
关键词
clustering; eco; effort reduction; exploring coding assistant; semi-automatic coding; support human raters; AGREEMENT; TRENDS;
D O I
10.1111/jcal.12717
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Background In the context of large-scale educational assessments, the effort required to code open-ended text responses is considerably more expensive and time-consuming than the evaluation of multiple-choice responses because it requires trained personnel and long manual coding sessions. Aim Our semi-supervised coding method eco (exploring coding assistant) dynamically supports human raters by automatically coding a subset of the responses. Method We map normalized response texts into a semantic space and cluster response vectors based on their semantic similarity. Assuming that similar codes represent semantically similar responses, we propagate codes to responses in optimally homogeneous clusters. Cluster homogeneity is assessed by strategically querying informative responses and presenting them to a human rater. Following each manual coding, the method estimates the code distribution respecting a certainty interval and assumes a homogeneous distribution if certainty exceeds a predefined threshold. If a cluster is determined to certainly comprise homogeneous responses, all remaining responses are coded accordingly automatically. We evaluated the method in a simulation using different data sets. Results With an average miscoding of about 3%, the method reduced the manual coding effort by an average of about 52%. Conclusion Combining the advantages of automatic and manual coding produces considerable coding accuracy and reduces the required manual effort.
引用
收藏
页码:841 / 854
页数:14
相关论文
共 50 条
  • [31] JTrans: An open-source software for semi-automatic text-to-speech alignment
    LORIA INRIA, UMR 7503, France
    Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH, (1823-1826):
  • [32] JTrans: an open-source software for semi-automatic text-to-speech alignment
    Cerisara, C.
    Mella, O.
    Fohr, D.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1799 - 1802
  • [33] Automatic Coding of Open-ended Questions into Multiple Classes: Whether and How to Use Double Coded Data
    He, Zhoushanyue
    Schonlau, Matthias
    SURVEY RESEARCH METHODS, 2020, 14 (03): : 267 - 278
  • [34] Large-scale ecological red line planning in urban agglomerations using a semi-automatic intelligent zoning method
    Lin, Jinyao
    Li, Xia
    SUSTAINABLE CITIES AND SOCIETY, 2019, 46
  • [35] Open-sourced semi-automatic program for ultrasound assessments of femoral trochlea cartilage health
    White, McKenzie S.
    Palmieri-Smith, Riann M.
    Lepley, Lindsey K.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2024, 27 (04) : 531 - 537
  • [36] A supporting system for coding of the answers from an open-ended question - An automatic coding system for SSM occupational data by case frame
    Takahashi, K
    SOCIOLOGICAL THEORY AND METHODS, 2000, 15 (01) : 149 - 164
  • [37] Coding Text Answers to Open-ended Questions: Human Coders and Statistical Learning Algorithms Make Similar Mistakes
    He, Zhoushanyue
    Schonlau, Matthias
    METHODS DATA ANALYSES, 2021, 15 (01): : 103 - 119
  • [38] Semi-automatic acoustic model generation from large unsynchronized audio and text chunks
    Alessandrini, Michele
    Biagetti, Giorgio
    Curzi, Alessandro
    Turchetti, Claudio
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1692 - 1695
  • [39] Petri net models for the semi-automatic construction of large scale biological networks
    Chen, Ming
    Hariharaputran, Sridhar
    Hofestaedt, Ralf
    Kormeier, Benjamin
    Spangardt, Sarah
    NATURAL COMPUTING, 2011, 10 (03) : 1077 - 1097
  • [40] Automatic label curation from large-scale text corpus
    Avasthi, Sandhya
    Chauhan, Ritu
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (01):