Action Word Prediction for Neural Source Code Summarization

被引:15
|
作者
Haque, Sakib [1 ]
Bansal, Aakash [1 ]
Wu, Lingfei [2 ]
McMillan, Collin [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci, Notre Dame, IN 46556 USA
[2] IBM Res, Yorktown Hts, NY USA
关键词
neural networks; source code summarization; automatic documentation generation; AI in SE; GENERATION;
D O I
10.1109/SANER50967.2021.00038
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Source code summarization is the task of creating short, natural language descriptions of source code. Code summarization is the backbone of much software documentation such as JavaDocs, in which very brief comments such as "adds the customer object" help programmers quickly understand a snippet of code. In recent years, automatic code summarization has become a high value target of research, with approaches based on neural networks making rapid progress. However, as we will show in this paper, the production of good summaries relies on the production of the action word in those summaries: the meaning of the example above would be completely changed if "removes" were substituted for "adds." In this paper, we advocate for a special emphasis on action word prediction as an important stepping stone problem towards better code summarization - current techniques try to predict the action word along with the whole summary, and yet action word prediction on its own is quite difficult. We show the value of the problem for code summaries, explore the performance of current baselines, and provide recommendations for future research.
引用
下载
收藏
页码:330 / 341
页数:12
相关论文
共 50 条
  • [21] Recommendations for Datasets for Source Code Summarization
    LeClair, Alex
    McMillan, Collin
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3931 - 3937
  • [22] Pyramid Attention For Source Code Summarization
    Chai, Lei
    Li, Ming
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [23] Distilled GPT for source code summarization
    Su, Chia-Yi
    McMillan, Collin
    AUTOMATED SOFTWARE ENGINEERING, 2024, 31 (01)
  • [24] Recommendations for Datasets for Source Code Summarization
    LeClair, Alex
    McMillan, Collin
    arXiv, 2019,
  • [25] A Survey of Automatic Source Code Summarization
    Zhang, Chunyan
    Wang, Junchao
    Zhou, Qinglei
    Xu, Ting
    Tang, Ke
    Gui, Hairen
    Liu, Fudong
    SYMMETRY-BASEL, 2022, 14 (03):
  • [26] A Neural-Network based Code Summarization Approach by Using Source Code and its Call Dependencies
    Liu, Bohong
    Wang, Tao
    Zhang, Xunhui
    Fan, Qiang
    Yin, Gang
    Deng, Jinsheng
    11TH ASIA-PACIFIC SYMPOSIUM ON INTERNETWARE (INTERNETWARE 2019), 2019,
  • [27] Graph Neural Network for Source Code Defect Prediction
    Sikic, Lucija
    Kurdija, Adrian Satja
    Vladimir, Klemo
    Silic, Marin
    IEEE Access, 2022, 10 : 10402 - 10415
  • [28] Graph Neural Network for Source Code Defect Prediction
    Sikic, Lucija
    Kurdija, Adrian Satja
    Vladimir, Klemo
    Silic, Marin
    IEEE ACCESS, 2022, 10 : 10402 - 10415
  • [29] Towards Modeling Human Attention from Eye Movements for Neural Source Code Summarization
    Bansal A.
    Sharif B.
    McMillan C.
    Proceedings of the ACM on Human-Computer Interaction, 2023, 7 (ETRA)
  • [30] Code Structure-Guided Transformer for Source Code Summarization
    Gao, Shuzheng
    Gao, Cuiyun
    He, Yulan
    Zeng, Jichuan
    Nie, Lunyiu
    Xia, Xin
    Lyu, Michael
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (01)