Automated Analysis of Algorithm Descriptions Quality, Through Large Language Models

被引：0

作者：

Sterbini, Andrea ^{[1
]}

Temperini, Marco ^{[2
]}

机构：

[1] Sapienza Univ Rome, Dept Comp Sci, Rome, Italy

[2] Sapienza Univ Rome, Dept Comp Control & Management Engn, Rome, Italy

来源：

GENERATIVE INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, PT I, ITS 2024 | 2024年 / 14798卷

关键词：

Large Language Models; LLM-based Text Similarity; Peer Assessment; Automated Assessment;

D O I：

10.1007/978-3-031-63028-6_20

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we propose a method to classify the students' textual descriptions of algorithms. This work is based on a wealth of data (programming tasks, related algorithm descriptions, and Peer Assessment data), coming from 6 years of use of the system Q2A, in a "Fundamentals of Computer Programming" course, given at first year in our university's Computer Science curriculum. The descriptions are submitted, as part of the answer to a computer programming task, through Q2A, and are subject to (formative) Peer Assessment. The proposed classification method aims to support the teacher on the analysis of the quite numerous students' descriptions, in ours as well as in other similar systems. We 1) process the students' submissions, by topic automated extraction (BERTopic) and by separate Large Language Models, 2) compute their degree of suitability as "algorithm description", in a scale from BAD to GOOD, and 3) compare the obtained classification with those coming from the teacher's direct assessment (expert: one of the authors), and from the Peer Assessment. The automated classification does correlate with both the expert classification and the grades given by the peers to the "clarity" of the descriptions. This result is encouraging in view of the production of a Q2A subsystem allowing the teacher to analyse the students' submissions guided by an automated classification, and ultimately support fully automated grading.

引用

页码：258 / 271

页数：14

共 50 条

[31] Revisiting Automated Topic Model Evaluation with Large Language Models
Stammbach, Dominik
Zouhar, Vilem
Hoyle, Alexander
Sachan, Mrinmaya
Ash, Elliott
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 9348 - 9357
[32] Automated Paper Screening for Clinical Reviews Using Large Language Models: Data Analysis Study
Guo, Eddie
Gupta, Mehul
Deng, Jiawen
Park, Ye-Jean
Paget, Michael
Naugler, Christopher
JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
[33] Are Large Language Models Reliable Argument Quality Annotators?
Mirzakhmedova, Nailia
Gohsen, Marcel
Chang, Chia Hao
Stein, Benno
ROBUST ARGUMENTATION MACHINES, RATIO 2024, 2024, 14638 : 129 - 146
[34] QoEXplainer: Mediating Explainable Quality of Experience Models with Large Language Models
Wehner, Nikolas
Feldhus, Nils
Seufert, Michael
Moeller, Sebastian
Hossfeld, Tobias
2024 16TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX 2024, 2024, : 72 - 75
[35] Aligning Large Language Models through Synthetic Feedback
Kim, Sungdong
Bae, Sanghwan
Shin, Jamin
Kang, Soyoung
Kwak, Donghyun
Yoo, Kang Min
Seo, Minjoon
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13677 - 13700
[36] Toward Keyword Generation through Large Language Models
Lee, Wanhae
Chun, Minki
Jeong, Hyeonhak
Jung, Hyunggu
COMPANION PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023 COMPANION, 2023, : 37 - 40
[37] The effectiveness of large language models with RAG for auto-annotating trait and phenotype descriptions
Kainer, David
BIOLOGY METHODS & PROTOCOLS, 2025, 10 (01):
[38] Automated analysis of natural language properties for UML models
Konrad, S
Cheng, BHC
SATELLITE EVENTS AT THE MODELS 2005 CONFERENCE, 2006, 3844 : 48 - 57
[39] FinSoSent: Advancing Financial Market Sentiment Analysis through Pretrained Large Language Models
Delgadillo, Josiel
Kinyua, Johnson
Mutigwe, Charles
BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (08)
[40] Trend Analysis of Large Language Models through a Developer Community: A Focus on Stack Overflow
Son, Jungha
Kim, Boyoung
INFORMATION, 2023, 14 (11)

← 1 2 3 4 5 →