CSEPrompts: A Benchmark of Introductory Computer Science Prompts

被引:0
|
作者
Raihan, Nishat [1 ]
Goswami, Dhiman [1 ]
Puspo, Sadiya Sayara Chowdhury [1 ]
Newman, Christian [2 ]
Ranasinghe, Tharindu [3 ]
Zampieri, Marcos [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
[2] Rochester Inst Technol, Rochester, NY USA
[3] Aston Univ, Birmingham, AL USA
关键词
Benchmark Dataset; Code LLM; Prompting;
D O I
10.1007/978-3-031-62700-2_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in AI, machine learning, and NLP have led to the development of a new generation of Large Language Models (LLMs) that are trained on massive amounts of data and often have trillions of parameters. Commercial applications (e.g., ChatGPT) have made this technology available to the general public, thus making it possible to use LLMs to produce high-quality texts for academic and professional purposes. Schools and universities are aware of the increasing use of AI-generated content by students and they have been researching the impact of this new technology and its potential misuse. Educational programs in Computer Science (CS) and related fields are particularly affected because LLMs are also capable of generating programming code in various programming languages. To help understand the potential impact of publicly available LLMs in CS education, we introduce CSEPrompts (https://github.com/mraihan-gmu/CSEPrompts), a framework with hundreds of programming exercise prompts and multiple-choice questions retrieved from introductory CS and programming courses. We also provide experimental results on CSEPrompts to evaluate the performance of several LLMs with respect to generating Python code and answering basic computer science and programming questions.
引用
收藏
页码:45 / 54
页数:10
相关论文
共 50 条
  • [1] The computer as a materials science benchmark
    Campbell, DJ
    Lorenz, JK
    Ellis, AB
    Kuech, TF
    Lisensky, GC
    Whittingham, MS
    JOURNAL OF CHEMICAL EDUCATION, 1998, 75 (03) : 297 - 312
  • [2] The Computer as a Materials Science Benchmark
    Campbell, D. J.
    Lorenz, J. K.
    Ellis, A. B.
    Kuech, T. F.
    Journal of Chemical Education, 75 (03):
  • [3] Inspire with Introductory Computer Science Response
    Guzdial, Mark
    COMMUNICATIONS OF THE ACM, 2009, 52 (07) : 8 - 8
  • [4] Supporting reflection in introductory computer science
    Fekete, A
    Kay, J
    Kingston, J
    Wimalaratne, K
    SIGCSE 2000: PROCEEDINGS OF THE THIRTY-FIRST SIGCSE TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, 2000, 32 (01): : 144 - 148
  • [5] Enhancement of the introductory computer science curriculum
    Jones, RP
    Ruehr, KF
    Salter, RM
    FRONTIERS IN EDUCATION FIE'96 - 26TH ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1-3: TECHNOLOGY-BASED RE-ENGINEERING ENGINEERING EDUCATION, 1996, : 117 - 120
  • [6] Service Learning in Introductory Computer Science
    Egan, Mary Anne L.
    Johnson, Mathew
    ITICSE 2010: PROCEEDINGS OF THE 2010 ACM SIGCSE ANNUAL CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, 2010, : 8 - 12
  • [7] Integrating a Science Perspective into an Introductory Computer Science Course
    Magee, John J.
    Han, Li
    2013 IEEE INTEGRATED STEM EDUCATION CONFERENCE (ISEC), 2013,
  • [8] New directions in the introductory computer science curriculum
    Tucker, Allen B.
    Wegner, Peter
    SIGCSE Bulletin (Association for Computing Machinery, Special Interest Group on Computer Science Education), 1994, 26 (01): : 11 - 15
  • [9] INTRODUCTORY ROBOTICS FOR COMPUTER-SCIENCE STUDENTS
    MCKEEVER, D
    UNIVERSITY COMPUTING, 1988, 10 (04): : 194 - 199
  • [10] Integrated introductory course for computer science and engineering
    Parrish, Allen
    Borie, Richard
    Cordes, David
    Dixon, Brandon
    Jackson, Jeff
    Pimmel, Russ
    Proceedings - Frontiers in Education Conference, 1999, 1 : 3 - 12