Survey on Neural Network-based Automatic Source Code Summarization Technologies

被引:0
|
作者
Song X.-T. [1 ]
Sun H.-L. [2 ,3 ,4 ]
机构
[1] College of Software, Taiyuan University of Technology, Taiyuan
[2] State Key Laboratory of Software Development Environment (Beihang University), Beijing
[3] School of Software, Beihang University, Beijing
[4] Beijing Advanced Innovation Center for Big Data and Brain Computing (Beihang University), Beijing
来源
Ruan Jian Xue Bao/Journal of Software | 2022年 / 33卷 / 01期
基金
中国国家自然科学基金;
关键词
Deep learning; Intelligent software engineering; Neural network; Program comprehension; Source code summarization;
D O I
10.13328/j.cnki.jos.006337
中图分类号
学科分类号
摘要
Source code summaries can help software developers comprehend programs faster and better, and assist maintenance developers in accomplishing their tasks efficiently. Since writing summaries by programmers is of high cost and low efficiency, researchers have tried to summarize source code automatically. In recent years, the technologies of neural network-based automatic summarization of source code have become the mainstream techniques of automatic source code summarization, and it is a hot research topic in the domain of intelligent software engineering. Firstly, this paper describes the concept of source code summarization and the definition of automatic source code summarization, presents its development history, and reviews the methods and metrics of the quality evaluation of the generated summaries. Then, it analyzes the general framework and the main challenges of neural network-based automatic code summarization algorithms. In addition, it focuses on the classification of representative algorithms, the design principle, characteristics, and restrictions of each category of algorithms. Finally, it discusses and looks forward to the trends on techniques of neural network-based source code summarization in future. © Copyright 2022, Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:55 / 77
页数:22
相关论文
共 97 条
  • [1] Nazar N, Hu Y, Jiang H., Summarizing software artifacts: A literature review, Journal of Computer Science and Technology, 31, 5, pp. 883-909, (2016)
  • [2] Rahman MM, Roy CK, Keivanloo I., Recommending insightful comments for source code using crowdsourced knowledge, Proc. of the 2015 IEEE 15th Int'l Working Conf. on Source Code Analysis and Manipulation (SCAM), pp. 81-90, (2015)
  • [3] Allamanis M, Tarlow D, Gordon A D, Wei Y., Bimodal modelling of source code and natural language, Proc. of the 32nd Int'l Conf. on Machine Learning, pp. 2123-2132, (2015)
  • [4] Ying A T T, Robillard M P., Code fragment summarization, Proc. of the the 9th Joint Meeting on Foundations of Software Engineering, pp. 655-658, (2013)
  • [5] Nazar N, Jiang H, Gao GJ, Zhang T, Li XC, Ren ZL., Source code fragment summarization with small-scale crowdsourcing based features, Frontiers of Computer Science, 10, 3, pp. 504-517, (2016)
  • [6] Movshovitz-Attias D, Cohen WW., Natural language models for predicting programming comments, Proc. of the Annual Meeting of the Association for Computational Linguistics, pp. 35-40, (2013)
  • [7] Wong E, Yang JQ, Tan L., AutoComment: Mining question and answer sites for automatic comment generation, Proc. of the 28th IEEE/ACM Int'l Conf. on Automated Software Engineering, pp. 562-567, (2013)
  • [8] Wong E, Liu T, Tan L., CloCom: Mining existing source code for automatic comment generation, Proc. of the 2015 IEEE 22nd Int'l Conf. on Software Analysis, Evolution, and Reengineering (SANER), pp. 380-389, (2015)
  • [9] Fowkes J, Chanthirasegaran P, Ranca R, Allamanis M, Lapata M, Sutton C., Autofolding for source code summarization, IEEE Trans. on Software Engineering, 43, 12, pp. 1095-1109, (2017)
  • [10] Liu ZX, Xia X, Hassan AE, Lo D, Xing ZC, Wang XY., Neural-machine-translation-based commit message generation: How far are we?, Proc. of the 33rd IEEE/ACM Int'l Conf. on Automated Software Engineering, pp. 373-384, (2018)