Semi-automatic coding of open-ended text responses in large-scale assessments

被引：6

作者：

Andersen, Nico ^{[1
]}

Zehner, Fabian ^{[1
,2
]}

Goldhammer, Frank ^{[1
,2
]}

机构：

[1] DIPF, Leibniz Inst Res & Informat Educ, Rostocker Str 6, D-60323 Frankfurt, Germany

[2] Ctr Int Student Assessment ZIB eV, Frankfurt, Germany

来源：

JOURNAL OF COMPUTER ASSISTED LEARNING | 2023年 / 39卷 / 03期

关键词：

clustering; eco; effort reduction; exploring coding assistant; semi-automatic coding; support human raters; AGREEMENT; TRENDS;

D O I：

10.1111/jcal.12717

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

Background In the context of large-scale educational assessments, the effort required to code open-ended text responses is considerably more expensive and time-consuming than the evaluation of multiple-choice responses because it requires trained personnel and long manual coding sessions. Aim Our semi-supervised coding method eco (exploring coding assistant) dynamically supports human raters by automatically coding a subset of the responses. Method We map normalized response texts into a semantic space and cluster response vectors based on their semantic similarity. Assuming that similar codes represent semantically similar responses, we propagate codes to responses in optimally homogeneous clusters. Cluster homogeneity is assessed by strategically querying informative responses and presenting them to a human rater. Following each manual coding, the method estimates the code distribution respecting a certainty interval and assumes a homogeneous distribution if certainty exceeds a predefined threshold. If a cluster is determined to certainly comprise homogeneous responses, all remaining responses are coded accordingly automatically. We evaluated the method in a simulation using different data sets. Results With an average miscoding of about 3%, the method reduced the manual coding effort by an average of about 52%. Conclusion Combining the advantages of automatic and manual coding produces considerable coding accuracy and reduces the required manual effort.

引用

页码：841 / 854

页数：14

共 50 条

[1] Automatic coding of open-ended surveys using text categorization techniques
Giorgetti, D
Prodanof, I
Sebastiani, F
ASC 2003: THE IMPACT OF TECHNOLOGY ON THE SURVEY PROCESS, 2003, : 173 - 184
[2] Semi-Automatic Suggestion Generation for Young Novice Programmers in an Open-Ended Context
Ichinco, Michelle
Kelleher, Caitlin
PROCEEDINGS OF THE 2018 ACM CONFERENCE ON INTERACTION DESIGN AND CHILDREN (IDC 2018), 2018, : 405 - 412
[3] Automatic grading and hinting in open-ended text questions
Sychev, Oleg
Anikin, Anton
Prokudin, Artem
COGNITIVE SYSTEMS RESEARCH, 2020, 59 : 264 - 272
[4] Interactive Coding of Responses to Open-Ended Questions in Russian
Senderovich, Nikita
Maysuradze, Archil
KNOWLEDGE ENGINEERING AND SEMANTIC WEB, KESW 2015, 2015, 518 : 195 - 209
[5] The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis
Haensch, Anna-Carolina
Weiss, Bernd
Steins, Patricia
Chyrva, Priscilla
Bitz, Katja
FRONTIERS IN BIG DATA, 2022, 5
[6] Large-scale model test on installation characteristics of open-ended pipe pile
Liu J.-W.
Wang L.-Z.
Zhu N.
Zhang C.-W.
Zhao G.-X.
Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2019, 41 (02): : 269 - 277
[7] Towards Large-Scale Simulations of Open-Ended Evolution in Continuous Cellular Automata
Chan, Bert Wang-Chak
PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 127 - 130
[8] Automatic Coding of Text Answers to Open-Ended Questions: Should You Double Code the Training Data?
He, Zhoushanyue
Schonlau, Matthias
SOCIAL SCIENCE COMPUTER REVIEW, 2020, 38 (06) : 754 - 765
[9] Large-scale analysis of open-ended student responses in questions involving randomly-generated combinations of values
Epp, Erik
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
[10] Semi-automatic porting of a large-scale Fortran CFD code to GPUs
Corrigan, Andrew
Camelli, Fernando
Loehner, Rainald
Mut, Fernando
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2012, 69 (02) : 314 - 331

← 1 2 3 4 5 →