Locating Relevant Source Files for Bug Reports using Textual Analysis

被引:0
|
作者
Gharibi, Reza [1 ]
Rasekh, Amir Hossein [1 ]
Sadreddini, Mohammad Hadi [1 ]
机构
[1] Shiraz Univ, Dept Comp Sci & Engn & IT, Shiraz, Iran
关键词
bug localization; information retrieval; bug report; classification; textual analysis; LOCALIZATION; RETRIEVAL; CODE;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Bug reports are an important part of software project's life cycle since they help improve the software's quality. However, in well-known systems, the huge number of bug reports make it difficult for the developer team to efficiently locate the bug and then assign it to be fixed. To solve this issue, various bug localization techniques have been proposed to rank all the source files of a project with respect to how likely they are to contain a bug. This makes the source files' search space smaller and helps developers to find relevant source files quicker. In this paper, we propose a three component bug localization approach which leverages different textual properties of bug reports and source files as well as the relations between previously fixed bug reports and a newly received one. Our approach uses information retrieval, textual matching, and multi-label classification to improve the performance of bug localization. We evaluate our approach on two open source software projects (i.e. SWT and ZXing) to examine its performance. Experimental results show that our approach can rank appropriate source files for more than 80% of bugs in top 10 for these projects and also improve the MRR and MAP values compared to two existing bug localization tools, BugLocator and BLUiR.
引用
收藏
页码:67 / 72
页数:6
相关论文
共 50 条
  • [1] Leveraging textual properties of bug reports to localize relevant source files
    Gharibi, Reza
    Rasekh, Amir Hossein
    Sadreddini, Mohammad Hadi
    Fakhrahmad, Seyed Mostafa
    INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (06) : 1058 - 1076
  • [2] Adaptive Ranking Relevant Source Files for Bug Reports Using Genetic Algorithm
    Thi Mai Anh Bui
    Nhat Hai Nguyen
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2021, 337 : 430 - 443
  • [3] Learning to Rank Relevant Files for Bug Reports using Domain Knowledge
    Ye, Xin
    Bunescu, Razvan
    Liu, Chang
    22ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (FSE 2014), 2014, : 689 - 699
  • [4] Learning to rank relevant files for bug reports using domain knowledge
    Ye, Xin
    Bunescu, Razvan
    Liu, Chang
    Proceedings of the ACM SIGSOFT Symposium on the Foundations of Software Engineering, 2014, 16-21-November-2014 : 689 - 699
  • [5] Using a Distributed Representation of Words in Localizing Relevant Files for Bug Reports
    Uneno, Yukiya
    Mizuno, Osamu
    Choi, Eun-Hye
    2016 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2016), 2016, : 183 - 190
  • [6] Textual Analysis of Security Bug Reports
    Peeples, Cody R.
    Rotella, Pete
    McLaughlin, Mark-David
    2017 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR HOMELAND SECURITY (HST), 2017,
  • [7] Convolutional Neural Networks-Based Locating Relevant Buggy Code Files for Bug Reports Affected by Data Imbalance
    Liu, Guangliang
    Lu, Yang
    Shi, Ke
    Chang, Jingfei
    Wei, Xing
    IEEE ACCESS, 2019, 7 : 131304 - 131316
  • [8] Mapping Bug Reports to Relevant Source Code Files Based on the Vector Space Model and Word Embedding
    Liu, Guangliang
    Lu, Yang
    Shi, Ke
    Chang, Jingfei
    Wei, Xing
    IEEE ACCESS, 2019, 7 : 78870 - 78881
  • [9] Learning to rank faulty source files for dependent bug reports
    Safdari, Nasir
    Alrubaye, Hussein
    Aljedaani, Wajdi
    Baez, Bladimir Baez
    DiStasi, Andrew
    Mkaouer, Mohamed Wiem
    BIG DATA: LEARNING, ANALYTICS, AND APPLICATIONS, 2019, 10989
  • [10] Using textual bug reports to predict the fault category of software bugs
    Hirsch, Thomas
    Hofer, Birgit
    ARRAY, 2022, 15