Automated Analysis of Algorithm Descriptions Quality, Through Large Language Models

被引:0
|
作者
Sterbini, Andrea [1 ]
Temperini, Marco [2 ]
机构
[1] Sapienza Univ Rome, Dept Comp Sci, Rome, Italy
[2] Sapienza Univ Rome, Dept Comp Control & Management Engn, Rome, Italy
来源
GENERATIVE INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, PT I, ITS 2024 | 2024年 / 14798卷
关键词
Large Language Models; LLM-based Text Similarity; Peer Assessment; Automated Assessment;
D O I
10.1007/978-3-031-63028-6_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a method to classify the students' textual descriptions of algorithms. This work is based on a wealth of data (programming tasks, related algorithm descriptions, and Peer Assessment data), coming from 6 years of use of the system Q2A, in a "Fundamentals of Computer Programming" course, given at first year in our university's Computer Science curriculum. The descriptions are submitted, as part of the answer to a computer programming task, through Q2A, and are subject to (formative) Peer Assessment. The proposed classification method aims to support the teacher on the analysis of the quite numerous students' descriptions, in ours as well as in other similar systems. We 1) process the students' submissions, by topic automated extraction (BERTopic) and by separate Large Language Models, 2) compute their degree of suitability as "algorithm description", in a scale from BAD to GOOD, and 3) compare the obtained classification with those coming from the teacher's direct assessment (expert: one of the authors), and from the Peer Assessment. The automated classification does correlate with both the expert classification and the grades given by the peers to the "clarity" of the descriptions. This result is encouraging in view of the production of a Q2A subsystem allowing the teacher to analyse the students' submissions guided by an automated classification, and ultimately support fully automated grading.
引用
收藏
页码:258 / 271
页数:14
相关论文
共 50 条
  • [31] Revisiting Automated Topic Model Evaluation with Large Language Models
    Stammbach, Dominik
    Zouhar, Vilem
    Hoyle, Alexander
    Sachan, Mrinmaya
    Ash, Elliott
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 9348 - 9357
  • [32] Automated Paper Screening for Clinical Reviews Using Large Language Models: Data Analysis Study
    Guo, Eddie
    Gupta, Mehul
    Deng, Jiawen
    Park, Ye-Jean
    Paget, Michael
    Naugler, Christopher
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [33] Are Large Language Models Reliable Argument Quality Annotators?
    Mirzakhmedova, Nailia
    Gohsen, Marcel
    Chang, Chia Hao
    Stein, Benno
    ROBUST ARGUMENTATION MACHINES, RATIO 2024, 2024, 14638 : 129 - 146
  • [34] QoEXplainer: Mediating Explainable Quality of Experience Models with Large Language Models
    Wehner, Nikolas
    Feldhus, Nils
    Seufert, Michael
    Moeller, Sebastian
    Hossfeld, Tobias
    2024 16TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX 2024, 2024, : 72 - 75
  • [35] Aligning Large Language Models through Synthetic Feedback
    Kim, Sungdong
    Bae, Sanghwan
    Shin, Jamin
    Kang, Soyoung
    Kwak, Donghyun
    Yoo, Kang Min
    Seo, Minjoon
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13677 - 13700
  • [36] Toward Keyword Generation through Large Language Models
    Lee, Wanhae
    Chun, Minki
    Jeong, Hyeonhak
    Jung, Hyunggu
    COMPANION PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023 COMPANION, 2023, : 37 - 40
  • [37] The effectiveness of large language models with RAG for auto-annotating trait and phenotype descriptions
    Kainer, David
    BIOLOGY METHODS & PROTOCOLS, 2025, 10 (01):
  • [38] Automated analysis of natural language properties for UML models
    Konrad, S
    Cheng, BHC
    SATELLITE EVENTS AT THE MODELS 2005 CONFERENCE, 2006, 3844 : 48 - 57
  • [39] FinSoSent: Advancing Financial Market Sentiment Analysis through Pretrained Large Language Models
    Delgadillo, Josiel
    Kinyua, Johnson
    Mutigwe, Charles
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (08)
  • [40] Trend Analysis of Large Language Models through a Developer Community: A Focus on Stack Overflow
    Son, Jungha
    Kim, Boyoung
    INFORMATION, 2023, 14 (11)