Automatic Source Code Summarization with Extended Tree-LSTM

被引:0
|
作者
Shido, Yusuke [1 ]
Kobayashi, Yasuaki [1 ]
Yamamoto, Akihiro [1 ]
Miyamoto, Atsushi [2 ]
Matsumura, Tadayuki [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
[2] Hitachi Ltd, Ctr Exploratory Res, Saitama, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural machine translation models are used to automatically generate a document from given source code since this can be regarded as a machine translation task. Source code summarization is one of the components for automatic document generation, which generates a summary in natural language from given source code. This suggests that techniques used in neural machine translation, such as Long Short-Term Memory (LSTM), can be used for source code summarization. However, there is a considerable difference between source code and natural language: Source code is essentially structured, having loops and conditional branching, etc. Therefore, there is some obstacle to apply known machine translation models to source code. Abstract syntax trees (ASTs) capture these structural properties and play an important role in recent machine learning studies on source code. Tree-LSTM is proposed as a generalization of LSTMs for tree-structured data. However, there is a critical issue when applying it to ASTs: It cannot handle a tree that contains nodes having an arbitrary number of children and their order simultaneously, which ASTs generally have such nodes. To address this issue, we propose an extension of Tree-LSTM, which we call Multi-way Tree-LSTM and apply it for source code summarization. As a result of computational experiments, our proposal achieved better results when compared with several state-of-the-art techniques.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Improving Tree-LSTM with Tree Attention
    Ahmed, Mahtab
    Samee, Muhammad Rifayat
    Mercer, Robert E.
    [J]. 2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 247 - 254
  • [2] A Survey of Automatic Source Code Summarization
    Zhang, Chunyan
    Wang, Junchao
    Zhou, Qinglei
    Xu, Ting
    Tang, Ke
    Gui, Hairen
    Liu, Fudong
    [J]. SYMMETRY-BASEL, 2022, 14 (03):
  • [3] Bi-LSTM-Based Neural Source Code Summarization
    Aljumah, Sarah
    Berriche, Lamia
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [4] Automatic source code summarization with graph attention networks
    Zhou, Yu
    Shen, Juanjuan
    Zhang, Xiaoqing
    Yang, Wenhua
    Han, Tingting
    Chen, Taolue
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 188
  • [5] Automatic Documentation Generation via Source Code Summarization
    McBurney, Paul W.
    [J]. 2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 2, 2015, : 903 - 906
  • [6] Reinforcement Learning with Tree-LSTM for Join Order Selection
    Yu, Xiang
    Li, Guoliang
    Chai, Chengliang
    Tang, Nan
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1297 - 1308
  • [7] Automatic Source Code Summarization of Context for Java']Java Methods
    McBurney, Paul W.
    McMillan, Collin
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2016, 42 (02) : 103 - 119
  • [8] On Automatic Summarization of What and Why Information in Source Code Changes
    Shen, Jinfeng
    Sun, Xiaobing
    Li, Bin
    Yang, Hui
    Hu, Jiajun
    [J]. PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS, VOL 1, 2016, : 103 - 112
  • [9] Neural machine translation with Gumbel Tree-LSTM based encoder
    Su, Chao
    Huang, Heyan
    Shi, Shumin
    Jian, Ping
    Shi, Xuewen
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71
  • [10] 改进Tree-LSTM网络的情感分析方法
    邹东尧
    王斌
    王丽萍
    [J]. 现代电子技术, 2022, (09) : 66 - 71