A System to Grade Computer Programming Skills using Machine Learning

被引:42
|
作者
Srikant, Shashank [1 ]
Aggarwal, Varun [1 ]
机构
[1] Aspiring Minds, Boston, MA 02108 USA
关键词
Recruitment; Automatic grading; MOOC; Feature engineering; Supervised learning; One-class learning;
D O I
10.1145/2623330.2623377
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The automatic evaluation of computer programs is a nascent area of research with a potential for large-scale impact. Extant program assessment systems score mostly based on the number of test-cases passed, providing no insight into the competency of the programmer. In this paper, we present a system to grade computer programs automatically. In addition to grading a program on its programming practices and complexity, the key kernel of the system is a machine-learning based algorithm which determines closeness of the logic of the given program to a correct program. This algorithm uses a set of highly-informative features, derived from the abstract representations of a given program, that capture the program's functionality. These features are then used to learn a model to grade the programs, which are built against evaluations done by experts. We show that the regression models provide much better grading than the ubiquitous test-case-pass based grading and rivals the grading accuracy of other open-response problems such as essay grading . We also show that our novel features add significant value over and above basic keyword/expression count features. In addition to this, we propose a novel way of posing computer-program grading as a one-class modeling problem and report encouraging preliminary results. We show the value of the system through a case study in a real-world industrial deployment. To the best of the authors' knowledge, this is the first time a system using machine learning has been developed and used for grading programs. The work is timely with regard to the recent boom in Massively Online Open Courseware (MOOCs), which promises to produce a significant amount of hand-graded digitized data.
引用
收藏
页码:1887 / 1896
页数:10
相关论文
共 50 条
  • [31] Machine Learning of Bayesian Networks Using Constraint Programming
    van Beek, Peter
    Hoffmann, Hella-Franziska
    PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, CP 2015, 2015, 9255 : 429 - 445
  • [32] Programming quantum annealing computers using machine learning
    Behrman, E. C.
    Steck, J. E.
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 288 - 293
  • [33] Prediction of Frailty Grade Using Machine Learning Models
    Erdas, Cagatay Berke
    Olcer, Didem
    2022 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO'22), 2022,
  • [34] Impacts of Using Creative Thinking Skills and Open Data on Programming Design in a Computer-supported Collaborative Learning Environment
    Chang, Jia-Wei
    Wang, Tzone-I
    Lee, Ming-Che
    Su, Chien-Yuan
    Chang, Po-Chin
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT), 2016, : 396 - 400
  • [35] An interactive e-learning system for improving web programming skills
    Elgamal, A. F.
    Abas, H. A.
    Baladoh, E. -S. M.
    EDUCATION AND INFORMATION TECHNOLOGIES, 2013, 18 (01) : 29 - 46
  • [36] Development of Computer Games for Training Programming Skills
    Shabalina, Olga
    Vorobkalov, Pavel
    Kataev, Alexander
    Davtian, Alexander
    Blanchfield, Peter
    PROCEEDINGS OF THE 6TH EUROPEAN CONFERENCE ON GAMES BASED LEARNING, 2012, : 460 - 470
  • [37] PolyGloT: A personalized and gamified eTutoring system for learning modelling and programming skills
    Bucchiarone, Antonio
    Martorella, Tommaso
    Frageri, Davide
    Colombo, Diego
    SCIENCE OF COMPUTER PROGRAMMING, 2024, 231
  • [38] Establishment of a Daqu Grade Classification Model Based on Computer Vision and Machine Learning
    Zhao, Mengke
    Han, Chaoyue
    Xue, Tinghui
    Ren, Chao
    Nie, Xiao
    Jing, Xu
    Hao, Haiyong
    Liu, Qifang
    Jia, Liyan
    FOODS, 2025, 14 (04)
  • [39] Teaching Computer Programming with Mathematics for Generating Digital Videos and Machine Learning Optimization
    Pattichis, Marios
    Lee, Hakeoung Hannah
    Celedon-Pattichis, Sylvia
    LopezLeiva, Carlos
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2023, PT I, 2023, 14184 : 196 - 205
  • [40] MACHINE LEARNING BASED MODELLING FOR ESTIMATION OF THE FUNDAMENTAL TIME PERIOD OF PRECAST CONCRETE STRUCTURES USING COMPUTER PROGRAMMING
    Dahiya, Nitin
    Saini, Babita
    Chalak, H. D.
    CIVIL ENGINEERING JOURNAL-STAVEBNI OBZOR, 2021, 30 (02): : 547 - 559