Action Word Prediction for Neural Source Code Summarization

被引:15
|
作者
Haque, Sakib [1 ]
Bansal, Aakash [1 ]
Wu, Lingfei [2 ]
McMillan, Collin [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci, Notre Dame, IN 46556 USA
[2] IBM Res, Yorktown Hts, NY USA
关键词
neural networks; source code summarization; automatic documentation generation; AI in SE; GENERATION;
D O I
10.1109/SANER50967.2021.00038
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Source code summarization is the task of creating short, natural language descriptions of source code. Code summarization is the backbone of much software documentation such as JavaDocs, in which very brief comments such as "adds the customer object" help programmers quickly understand a snippet of code. In recent years, automatic code summarization has become a high value target of research, with approaches based on neural networks making rapid progress. However, as we will show in this paper, the production of good summaries relies on the production of the action word in those summaries: the meaning of the example above would be completely changed if "removes" were substituted for "adds." In this paper, we advocate for a special emphasis on action word prediction as an important stepping stone problem towards better code summarization - current techniques try to predict the action word along with the whole summary, and yet action word prediction on its own is quite difficult. We show the value of the problem for code summaries, explore the performance of current baselines, and provide recommendations for future research.
引用
收藏
页码:330 / 341
页数:12
相关论文
共 50 条
  • [1] Action word prediction for neural source code summarization
    Haque, Sakib
    Bansal, Aakash
    Wu, Lingfei
    McMillan, Collin
    [J]. arXiv, 2021,
  • [2] Enhancing code summarization with action word prediction
    Li, Mingchen
    Yu, Huiqun
    Fan, Guisheng
    Zhou, Ziyi
    Huang, Zijie
    [J]. NEUROCOMPUTING, 2024, 563
  • [3] Enhancing code summarization with action word prediction
    Li, Mingchen
    Yu, Huiqun
    Fan, Guisheng
    Zhou, Ziyi
    Huang, Zijie
    [J]. Neurocomputing, 2024, 563
  • [4] A Neural Framework for Retrieval and Summarization of Source Code
    Chen, Qingying
    Zhou, Minghui
    [J]. PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, : 826 - 831
  • [5] Semantic similarity loss for neural source code summarization
    Su, Chia-Yi
    McMillan, Collin
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024,
  • [6] Ensemble Models for Neural Source Code Summarization of Subroutines
    LeClair, Alexander
    Bansal, Aakash
    McMillan, Collin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2021), 2021, : 286 - 297
  • [7] Retrieval-based Neural Source Code Summarization
    Zhang, Jian
    Wang, Xu
    Zhang, Hongyu
    Sun, Hailong
    Liu, Xudong
    [J]. 2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 1385 - 1397
  • [8] Ensemble models for neural source code summarization of subroutines
    LeClair, Alexander
    Bansal, Aakash
    McMillan, Collin
    [J]. arXiv, 2021,
  • [9] Label Smoothing Improves Neural Source Code Summarization
    Haque, Sakib
    Bansal, Aakash
    McMillan, Collin
    [J]. 2023 IEEE/ACM 31ST INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2023, : 101 - 112
  • [10] Bi-LSTM-Based Neural Source Code Summarization
    Aljumah, Sarah
    Berriche, Lamia
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (24):