Grading Documentation with Machine Learning

被引:0
|
作者
Messer, Marcus [1 ]
Shi, Miaojing [2 ]
Brown, Neil C. C. [1 ]
Kolling, Michael [1 ]
机构
[1] Kings Coll London, Dept Informat, London, England
[2] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China
关键词
Automated Grading; Assessment; Computer Science Education; Machine Learning; Large Language Models; Documentation; Programming Education;
D O I
10.1007/978-3-031-64302-6_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Professional developers, and especially students learning to program, often write poor documentation. While automated assessment for programming is becoming more common in educational settings, often using unit tests for code functionality and static analysis for code quality, documentation assessment is typically limited to detecting the presence and the correct formatting of a docstring based on a specified style guide. We aim to investigate how machine learning can be utilised to aid in automating the assessment of documentation quality. We classify a large set of publicly available human-annotated relevance scores between a natural language string and a code string, using traditional approaches, such as Logistic Regression and Random Forest, fine-tuned large language models, such as BERT and GPT, and Low-Rank Adaptation of large language models. Our most accurate mode was a fine-tuned CodeBERT model, resulting in a test accuracy of 89%.
引用
收藏
页码:105 / 117
页数:13
相关论文
共 50 条
  • [41] Automatic Gleason grading of prostate cancer using SLIM and machine learning
    Nguyen, Tan H.
    Sridharan, Shamira
    Marcias, Virgilia
    Balla, Andre K.
    Do, Minh N.
    Popescu, Gabriel
    [J]. QUANTITATIVE PHASE IMAGING II, 2016, 9718
  • [42] Disease Classification and Grading of Orange using Machine Learning and Fuzzy Logic
    Behera, Santi Kumari
    Jena, Lipsarani
    Rath, Amiya Kumar
    Sethy, Prabira Kumar
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 678 - 682
  • [43] Machine Learning Approach for Automatic Short Answer Grading: A Systematic Review
    Galhardi, Lucas Busatta
    Brancher, Jacques Duilio
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2018, 2018, 11238 : 380 - 391
  • [44] A review on machine learning techniques for the assessment of image grading in breast mammogram
    Khalil ur Rehman
    Jianqiang Li
    Yan Pei
    Anaa Yasin
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 2609 - 2635
  • [45] Machine Learning for Cataract Classification/Grading on Ophthalmic Imaging Modalities: A Survey
    Xiao-Qing Zhang
    Yan Hu
    Zun-Jie Xiao
    Jian-Sheng Fang
    Risa Higashita
    Jiang Liu
    [J]. Machine Intelligence Research, 2022, 19 (03) : 184 - 208
  • [46] A Machine Learning Classification Algorithm for Vocabulary Grading in Chinese Language Teaching
    Zhang, Yinbing
    Song, Jihua
    Peng, Weiming
    Guo, Dongdong
    Song, Tianbao
    [J]. TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2021, 28 (03): : 845 - 855
  • [47] AUTOMATED HISTOPATHOLOGIC DIAGNOSIS AND GLEASON GRADING OF PROSTATE BIOPSIES WITH MACHINE LEARNING
    Kott, Ohad
    Linsley, Drew
    Amin, Ali
    Karagounis, Andreas
    Golijanin, Dragan
    Serre, Thomas
    Gershman, Boris
    [J]. JOURNAL OF UROLOGY, 2019, 201 (04): : E215 - E215
  • [48] Machine Learning for Cataract Classification/Grading on Ophthalmic Imaging Modalities: A Survey
    Xiao-Qing Zhang
    Yan Hu
    Zun-Jie Xiao
    Jian-Sheng Fang
    Risa Higashita
    Jiang Liu
    [J]. Machine Intelligence Research, 2022, 19 : 184 - 208
  • [49] Grading and sorting technique of dragon fruits using machine learning algorithms
    Patil, Pallavi U.
    Lande, Sudhir B.
    Nagalkar, Vinay J.
    Nikam, Sonal B.
    Wakchaure, G. C.
    [J]. JOURNAL OF AGRICULTURE AND FOOD RESEARCH, 2021, 4
  • [50] Development of a Dynamic Diagnosis Grading System for Infertility Using Machine Learning
    Liao, ShuJie
    Pan, Wei
    Dai, Wan-qiang
    Jin, Lei
    Huang, Ge
    Wang, Renjie
    Hu, Cheng
    Pan, Wulin
    Tu, Haiting
    [J]. JAMA NETWORK OPEN, 2020, 3 (11) : E2023654