Bug localization based on syntactical and semantic information of source code

被引:0
|
作者
YAN Xuefeng [1 ,2 ]
CHENG Shasha [1 ]
GUO Liqin [3 ]
机构
[1] College of Computer Science Technology, Nanjing University of Aeronautics and Astronautics
[2] Collaborative Innovation Center of Novel Software Technology and Industrialization
[3] State Key Laboratory of Intelligent Manufacturing System Technology, Beijing Institute of Electronic System Engineering
基金
国家重点研发计划;
关键词
bug report; abstract syntax tree; code representation; software bug localization;
D O I
暂无
中图分类号
TP311.5 [软件工程];
学科分类号
081202 ; 0835 ;
摘要
The existing software bug localization models treat the source file as natural language, which leads to the loss of syntactical and structure information of the source file. A bug localization model based on syntactical and semantic information of source code is proposed. Firstly, abstract syntax tree(AST) is divided based on node category to obtain statement sequence. The statement tree is encoded into vectors to capture lexical and syntactical knowledge at the statement level.Secondly, the source code is transformed into vector representation by the sequence naturalness of the statement. Therefore,the problem of gradient vanishing and explosion caused by a large AST size is obviated when using AST to the represent source code. Finally, the correlation between bug reports and source files are comprehensively analyzed from three aspects of syntax, semantics and text to locate the buggy code. Experiments show that compared with other standard models, the proposed model improves the performance of bug localization, and it has good advantages in mean reciprocal rank(MRR), mean average precision(MAP) and Top N Rank.
引用
收藏
页码:236 / 246
页数:11
相关论文
共 50 条
  • [31] Bug detection in source code in process monitoring
    Liu, Ying
    Zhang, Lingling
    Zhang, Yuehua
    Shi, Yong
    Journal of Computational Information Systems, 2012, 8 (02): : 591 - 601
  • [32] Structured information in bug report descriptions—influence on IR-based bug localization and developers
    Michael Rath
    Patrick Mäder
    Software Quality Journal, 2019, 27 : 1315 - 1337
  • [33] A SOURCE CODE CONTROL-SYSTEM BASED ON SEMANTIC NETS
    INCE, DC
    SOFTWARE-PRACTICE & EXPERIENCE, 1984, 14 (12): : 1159 - 1168
  • [34] Precise and Scalable Querying of Syntactical Source Code Patterns Using Sample Code Snippets and a Database
    Panchenko, Oleksandr
    Karstens, Jan
    Plattner, Hasso
    Zeier, Alexander
    2011 IEEE 19TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC), 2011, : 41 - 50
  • [35] SOURCE CODE CONTROL SYSTEM BASED ON SEMANTIC NETS.
    Ince, D.C.
    Software - Practice and Experience, 1984, 14 (12) : 1159 - 1168
  • [36] How Does Execution Information Help with Information-Retrieval Based Bug Localization?
    Dao, Tung
    Zhang, Lingming
    Meng, Na
    2017 IEEE/ACM 25TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC), 2017, : 241 - 250
  • [37] The flowing nature matters: feature learning from the control flow graph of source code for bug localization
    Ma, Yi-Fan
    Li, Ming
    MACHINE LEARNING, 2022, 111 (03) : 853 - 870
  • [38] Information Retrieval Based Bug Localization: Research Problem, Progress, and Challenges
    Guo Z.-Q.
    Zhou H.-C.
    Liu S.-R.
    Li Y.-H.
    Chen L.
    Zhou Y.-M.
    Xu B.-W.
    Li, Yan-Hui (yanhuili@nju.edu.cn); Zhou, Yu-Ming (zhouyuming@nju.edu.cn), 1600, Chinese Academy of Sciences (31): : 2826 - 2854
  • [39] The flowing nature matters: feature learning from the control flow graph of source code for bug localization
    Yi-Fan Ma
    Ming Li
    Machine Learning, 2022, 111 : 853 - 870
  • [40] Research Progress on Software Bug Localization Technology Based on Information Retrieval
    Zhang Y.
    Liu J.-K.
    Xia X.
    Wu M.-H.
    Yan H.
    Xia, Xin (Xin.Xia@monash.edu), 1600, Chinese Academy of Sciences (31): : 2432 - 2452