Automatic Code Documentation Generation Using GPT-3

被引:29
|
作者
Khan, Junaed Younus [1 ]
Uddin, Gias [1 ]
机构
[1] Univ Calgary, DISA Lab, Calgary, AB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
code documentation; GPT-3; Machine Learning;
D O I
10.1145/3551349.3559548
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Source code documentation is an important artifact for efficient software development. Code documentation could greatly benefit from automation since manual documentation is often labouring, resource and time-intensive. In this paper, we employed Codex for automatic code documentation creation. Codex is a GPT-3 based model pre-trained on both natural and programming languages. We find that Codex outperforms existing techniques even with basic settings like one-shot learning (i.e., providing only one example for training). Codex achieves an overall BLEU score of 20.6 for six different programming languages (11.2% improvement over earlier state-of-the-art techniques). Thus, Codex shows promise and warrants in-depth future studies for automatic code documentation generation to support diverse development tasks.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Fine-tuning GPT-3 for legal rule classification
    Liga, Davide
    Robaldo, Livio
    COMPUTER LAW & SECURITY REVIEW, 2023, 51
  • [32] Evaluating GPT-3 Generated Explanations for Hateful Content Moderation
    Wang, Han
    Hee, Ming Shan
    Awal, Md Rabiul
    Choo, Kenny TsuWei
    Lee, Roy Ka-Wei
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6255 - 6263
  • [33] Investigating the Potential of GPT-3 in Providing Feedback for Programming Assessments
    Balse, Rishabh
    Valaboju, Bharath
    Singhal, Shreya
    Warriem, Jayakrishnan Madathil
    Prasad, Prajish
    PROCEEDINGS OF THE 2023 CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, ITICSE 2023, VOL 1, 2023, : 292 - 298
  • [34] The diagnostic and triage accuracy of the GPT-3 artificial model: an observational
    Levine, David M.
    Tuwani, Rudraksh
    Kompa, Benjamin
    Varma, Amita
    Finlayson, Samuel G.
    Mehrotra, Ateev
    Beam, Andrew
    LANCET DIGITAL HEALTH, 2024, 6 (08): : e555 - e561
  • [35] AUTOMATIC ROMANIAN TEXT GENERATION USING GPT-2
    Buzea, Marius Cristian
    Trausan-Matu, Stefan
    Rebedea, Traian
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2022, 84 (04): : 15 - 30
  • [36] Do Natural Language Processing models understand simulations? Application of GPT-3 to translate simulation source code to English
    Jackson, Ilya
    Rolf, Benjamin
    IFAC PAPERSONLINE, 2023, 56 (02): : 221 - 226
  • [37] Visconde: Multi-document QA with GPT-3 and Neural Reranking
    Pereira, Jayr
    Fidalgo, Robson
    Lotufo, Roberto
    Nogueira, Rodrigo
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT II, 2023, 13981 : 534 - 543
  • [38] GPT-3 vs Object Oriented Programming Assignments: An Experience Report
    Cipriano, Bruno Pereira
    Alves, Pedro
    PROCEEDINGS OF THE 2023 CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, ITICSE 2023, VOL 1, 2023, : 61 - 67
  • [39] Student Voices on GPT-3, Writing Assignments, and the Future College Classroom
    Kim, Bada
    Robins, Sarah
    Huang, Jihui
    TEACHING PHILOSOPHY, 2024, 47 (02) : 213 - 231
  • [40] The Tables Have Turned: GPT-3 Distinguishing Passwords from Honeywords
    Chakraborty, Nilesh
    Yamout, Youssef
    Zulkernine, Mohammad
    2023 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY, CNS, 2023,