Automatic Code Documentation Generation Using GPT-3

被引：29

作者：

Khan, Junaed Younus ^{[1
]}

Uddin, Gias ^{[1
]}

机构：

[1] Univ Calgary, DISA Lab, Calgary, AB, Canada

来源：

PROCEEDINGS OF THE 37TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE 2022 | 2022年

基金：

加拿大自然科学与工程研究理事会;

关键词：

code documentation; GPT-3; Machine Learning;

D O I：

10.1145/3551349.3559548

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Source code documentation is an important artifact for efficient software development. Code documentation could greatly benefit from automation since manual documentation is often labouring, resource and time-intensive. In this paper, we employed Codex for automatic code documentation creation. Codex is a GPT-3 based model pre-trained on both natural and programming languages. We find that Codex outperforms existing techniques even with basic settings like one-shot learning (i.e., providing only one example for training). Codex achieves an overall BLEU score of 20.6 for six different programming languages (11.2% improvement over earlier state-of-the-art techniques). Thus, Codex shows promise and warrants in-depth future studies for automatic code documentation generation to support diverse development tasks.

引用

页数：6

共 50 条

[31] Fine-tuning GPT-3 for legal rule classification
Liga, Davide
Robaldo, Livio
COMPUTER LAW & SECURITY REVIEW, 2023, 51
[32] Evaluating GPT-3 Generated Explanations for Hateful Content Moderation
Wang, Han
Hee, Ming Shan
Awal, Md Rabiul
Choo, Kenny TsuWei
Lee, Roy Ka-Wei
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6255 - 6263
[33] Investigating the Potential of GPT-3 in Providing Feedback for Programming Assessments
Balse, Rishabh
Valaboju, Bharath
Singhal, Shreya
Warriem, Jayakrishnan Madathil
Prasad, Prajish
PROCEEDINGS OF THE 2023 CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, ITICSE 2023, VOL 1, 2023, : 292 - 298
[34] The diagnostic and triage accuracy of the GPT-3 artificial model: an observational
Levine, David M.
Tuwani, Rudraksh
Kompa, Benjamin
Varma, Amita
Finlayson, Samuel G.
Mehrotra, Ateev
Beam, Andrew
LANCET DIGITAL HEALTH, 2024, 6 (08): : e555 - e561
[35] AUTOMATIC ROMANIAN TEXT GENERATION USING GPT-2
Buzea, Marius Cristian
Trausan-Matu, Stefan
Rebedea, Traian
UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2022, 84 (04): : 15 - 30
[36] Do Natural Language Processing models understand simulations? Application of GPT-3 to translate simulation source code to English
Jackson, Ilya
Rolf, Benjamin
IFAC PAPERSONLINE, 2023, 56 (02): : 221 - 226
[37] Visconde: Multi-document QA with GPT-3 and Neural Reranking
Pereira, Jayr
Fidalgo, Robson
Lotufo, Roberto
Nogueira, Rodrigo
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT II, 2023, 13981 : 534 - 543
[38] GPT-3 vs Object Oriented Programming Assignments: An Experience Report
Cipriano, Bruno Pereira
Alves, Pedro
PROCEEDINGS OF THE 2023 CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, ITICSE 2023, VOL 1, 2023, : 61 - 67
[39] Student Voices on GPT-3, Writing Assignments, and the Future College Classroom
Kim, Bada
Robins, Sarah
Huang, Jihui
TEACHING PHILOSOPHY, 2024, 47 (02) : 213 - 231
[40] The Tables Have Turned: GPT-3 Distinguishing Passwords from Honeywords
Chakraborty, Nilesh
Yamout, Youssef
Zulkernine, Mohammad
2023 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY, CNS, 2023,

← 1 2 3 4 5 →