Project-Level Encoding for Neural Source Code Summarization of Subroutines

被引:19
|
作者
Bansal, Aakash [1 ]
Haque, Sakib [1 ]
McMillan, Collin [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
关键词
source code summarization; automatic documentation generation; neural networks; PROGRAM COMPREHENSION;
D O I
10.1109/ICPC52881.2021.00032
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Source code summarization of a subroutine is the task of writing a short, natural language description of that subroutine. The description usually serves in documentation aimed at programmers, where even brief phrase (e.g. "compresses data to a zip file") can help readers rapidly comprehend what a subroutine does without resorting to reading the code itself. Techniques based on neural networks (and encoder-decoder model designs in particular) have established themselves as the state-of-the-art. Yet a problem widely recognized with these models is that they assume the information needed to create a summary is present within the code being summarized itself - an assumption which is at odds with program comprehension literature. Thus a current research frontier lies in the question of encoding source code context into neural models of summarization. In this paper, we present a project-level encoder to improve models of code summarization. By project-level, we mean that we create a vectorized representation of selected code files in a software project, and use that representation to augment the encoder of state-of-the-art neural code summarization techniques. We demonstrate how our encoder improves several existing models, and provide guidelines for maximizing improvement while controlling time and resource costs in model size.
引用
收藏
页码:253 / 264
页数:12
相关论文
共 50 条
  • [31] INTEGRATED MODEL FOR PROJECT-LEVEL MANAGEMENT OF FLEXIBLE PAVEMENTS
    RADA, GR
    PERL, J
    WITCZAK, MW
    JOURNAL OF TRANSPORTATION ENGINEERING-ASCE, 1986, 112 (04): : 381 - 399
  • [32] Towards Modeling Human Attention from Eye Movements for Neural Source Code Summarization
    Bansal A.
    Sharif B.
    McMillan C.
    Proceedings of the ACM on Human-Computer Interaction, 2023, 7 (ETRA)
  • [33] Survey of screening procedures for project-level conformity analyses
    Houk, J
    Claggett, M
    ENERGY AND ENVIRONMENTAL CONCERNS 2004, 2004, (1880): : 50 - 58
  • [34] Code Structure-Guided Transformer for Source Code Summarization
    Gao, Shuzheng
    Gao, Cuiyun
    He, Yulan
    Zeng, Jichuan
    Nie, Lunyiu
    Xia, Xin
    Lyu, Michael
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (01)
  • [35] Florida DOT Project-Level Bridge Management Models
    Thompson, Paul D.
    Sobanjo, John O.
    Kerr, Richard
    JOURNAL OF BRIDGE ENGINEERING, 2003, 8 (06) : 345 - 352
  • [36] Contextual Information Enhanced Source Code Summarization
    Hu T.-X.
    Xie R.
    Ye W.
    Zhang S.-K.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (04): : 1695 - 1710
  • [37] Keyword-Based Source Code Summarization
    Zhang S.
    Xie R.
    Ye W.
    Hen L.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (09): : 1987 - 2000
  • [38] An data augmentation method for source code summarization
    Song, Zixuan
    Zeng, Hui
    Shang, Xiuwei
    Li, Guanxi
    Li, Hui
    Guo, Shikai
    NEUROCOMPUTING, 2023, 549
  • [39] Supporting software documentation with source code summarization
    Al-Msie'deen, Ra'Fat
    Blasi, Anas H.
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2019, 6 (01): : 59 - 67
  • [40] Entity Based Source Code Summarization (EBSCS)
    Babu, Chitti K.
    Kavitha, C.
    SankarRam, N.
    2016 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2016,