Adaptive Ranking Relevant Source Files for Bug Reports Using Genetic Algorithm

被引:0
|
作者
Thi Mai Anh Bui [1 ]
Nhat Hai Nguyen [1 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, Hanoi, Vietnam
关键词
Bug localization; Genetic algorithm; bug report; semantic features; lexical features; LOCALIZATION;
D O I
10.3233/FAIA210042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Precisely locating buggy files for a given bug report is a cumbersome and time-consuming task, particularly in a large-scale project with thousands of source files and bug reports. An efficient bug localization module is desirable to improve the productivity of the software maintenance phase. Many previous approaches rank source files according to their relevance to a given bug report based on simple lexical matching scores. However, the lexical mismatches between natural language expressions used to describe bug reports and technical terms of software source code might reduce the bug localization system's accuracy. Incorporating domain knowledge through some features such as the semantic similarity, the fixing frequency of a source file, the code change history and similar bug reports is crucial to efficiently locating buggy files. In this paper, we propose a bug localization model, BugLocGA that leverages both lexical and semantic information as well as explores the relation between a bug report and a source file through some domain features. Given a bug report, we calculate the ranking score with every source files through a weighted sum of all features, where the weights are trained through a genetic algorithm with the aim of maximizing the performance of the bug localization model using two evaluation metrics: mean reciprocal rank (MRR) and mean average precision (MAP). The empirical results conducted on some widely-used open source software projects have showed that our model outperformed some state of the art approaches by effectively recommending relevant files where the bug should be fixed.
引用
收藏
页码:430 / 443
页数:14
相关论文
共 50 条
  • [1] Locating Relevant Source Files for Bug Reports using Textual Analysis
    Gharibi, Reza
    Rasekh, Amir Hossein
    Sadreddini, Mohammad Hadi
    2017 18TH CSI INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING CONFERENCE (CSSE), 2017, : 67 - 72
  • [2] Leveraging textual properties of bug reports to localize relevant source files
    Gharibi, Reza
    Rasekh, Amir Hossein
    Sadreddini, Mohammad Hadi
    Fakhrahmad, Seyed Mostafa
    INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (06) : 1058 - 1076
  • [3] Learning to Rank Relevant Files for Bug Reports using Domain Knowledge
    Ye, Xin
    Bunescu, Razvan
    Liu, Chang
    22ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (FSE 2014), 2014, : 689 - 699
  • [4] Learning to rank relevant files for bug reports using domain knowledge
    Ye, Xin
    Bunescu, Razvan
    Liu, Chang
    Proceedings of the ACM SIGSOFT Symposium on the Foundations of Software Engineering, 2014, 16-21-November-2014 : 689 - 699
  • [5] Using a Distributed Representation of Words in Localizing Relevant Files for Bug Reports
    Uneno, Yukiya
    Mizuno, Osamu
    Choi, Eun-Hye
    2016 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2016), 2016, : 183 - 190
  • [6] Mapping Bug Reports to Relevant Files: A Ranking Model, a Fine-Grained Benchmark, and Feature Evaluation
    Ye, Xin
    Bunescu, Razvan
    Liu, Chang
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2016, 42 (04) : 379 - 402
  • [7] Mapping Bug Reports to Relevant Source Code Files Based on the Vector Space Model and Word Embedding
    Liu, Guangliang
    Lu, Yang
    Shi, Ke
    Chang, Jingfei
    Wei, Xing
    IEEE ACCESS, 2019, 7 : 78870 - 78881
  • [8] Learning to rank faulty source files for dependent bug reports
    Safdari, Nasir
    Alrubaye, Hussein
    Aljedaani, Wajdi
    Baez, Bladimir Baez
    DiStasi, Andrew
    Mkaouer, Mohamed Wiem
    BIG DATA: LEARNING, ANALYTICS, AND APPLICATIONS, 2019, 10989
  • [9] Cost-Aware Clustering of Bug Reports by Using a Genetic Algorithm
    Lee, Jaekwon
    Kim, Dongsun
    Jung, Woosung
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2019, 35 (01) : 175 - 200
  • [10] Tracking Buggy Files: New Efficient Adaptive Bug Localization Algorithm
    Fejzer, Mikolaj
    Narebski, Jakub
    Przymus, Piotr
    Stencel, Krzysztof
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 48 (07) : 2557 - 2569