Bug localization based on syntactical and semantic information of source code

被引:0
|
作者
YAN Xuefeng [1 ,2 ]
CHENG Shasha [1 ]
GUO Liqin [3 ]
机构
[1] College of Computer Science Technology, Nanjing University of Aeronautics and Astronautics
[2] Collaborative Innovation Center of Novel Software Technology and Industrialization
[3] State Key Laboratory of Intelligent Manufacturing System Technology, Beijing Institute of Electronic System Engineering
基金
国家重点研发计划;
关键词
bug report; abstract syntax tree; code representation; software bug localization;
D O I
暂无
中图分类号
TP311.5 [软件工程];
学科分类号
081202 ; 0835 ;
摘要
The existing software bug localization models treat the source file as natural language, which leads to the loss of syntactical and structure information of the source file. A bug localization model based on syntactical and semantic information of source code is proposed. Firstly, abstract syntax tree(AST) is divided based on node category to obtain statement sequence. The statement tree is encoded into vectors to capture lexical and syntactical knowledge at the statement level.Secondly, the source code is transformed into vector representation by the sequence naturalness of the statement. Therefore,the problem of gradient vanishing and explosion caused by a large AST size is obviated when using AST to the represent source code. Finally, the correlation between bug reports and source files are comprehensively analyzed from three aspects of syntax, semantics and text to locate the buggy code. Experiments show that compared with other standard models, the proposed model improves the performance of bug localization, and it has good advantages in mean reciprocal rank(MRR), mean average precision(MAP) and Top N Rank.
引用
收藏
页码:236 / 246
页数:11
相关论文
共 50 条
  • [41] Structured information in bug report descriptions-influence on IR-based bug localization and developers
    Rath, Michael
    Maeder, Patrick
    SOFTWARE QUALITY JOURNAL, 2019, 27 (03) : 1315 - 1337
  • [42] bjXnet: an improved bug localization model based on code property graph and attention mechanism
    Jiaxuan Han
    Cheng Huang
    Siqi Sun
    Zhonglin Liu
    Jiayong Liu
    Automated Software Engineering, 2023, 30
  • [43] Knowledge-Augmented Mutation-Based Bug Localization for Hardware Design Code
    Wu, Jiang
    Zhang, Zhuo
    Yang, Deheng
    Xu, Jianjun
    He, Jiayu
    Mao, Xiaoguang
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 21 (03)
  • [44] bjXnet: an improved bug localization model based on code property graph and attention mechanism
    Han, Jiaxuan
    Huang, Cheng
    Sun, Siqi
    Liu, Zhonglin
    Liu, Jiayong
    AUTOMATED SOFTWARE ENGINEERING, 2023, 30 (01)
  • [45] Semantic Robustness of Models of Source Code
    Henkel, Jordan
    Ramakrishnan, Goutham
    Wang, Zi
    Albarghouthi, Aws
    Jha, Somesh
    Reps, Thomas
    2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2022), 2022, : 526 - 537
  • [46] Estimating Semantic Relatedness in Source Code
    Mahmoud, Anas
    Bradshaw, Gary
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2015, 25 (01)
  • [47] Where Should the Bugs Be Fixed? More Accurate Information Retrieval-Based Bug Localization Based on Bug Reports
    Zhou, Jian
    Zhang, Hongyu
    Lo, David
    2012 34TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2012, : 14 - 24
  • [48] Predicting Bug Inducing Source Code Change Patterns
    Khan, Ayazuddin
    Ahsan, Syed Nadeem
    2016 INTERNATIONAL CONFERENCE ON OPEN SOURCE SYSTEMS AND TECHNOLOGIES (ICOSST), 2016, : 29 - 35
  • [49] On the Relationship between the Vocabulary of Bug Reports and Source Code
    Moreno, Laura
    Bandara, Wathsala
    Haiduc, Sonia
    Marcus, Andrian
    2013 29TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2013, : 452 - 455
  • [50] SMTLOC: Bug Localization for SMT Solver Based on Multi-source Spectrum
    Wang X.-S.
    Zhou Z.-D.
    Li X.-C.
    Jiang H.
    Ren Z.-L.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (07): : 3314 - 3331