Predicting Effectiveness of IR-Based Bug Localization Techniques

被引:16
|
作者
Le, Tien-Duy B. [1 ]
Thung, Ferdian [1 ]
Lo, David [1 ]
机构
[1] Singapore Management Univ, Sch Informat Syst, Singapore, Singapore
关键词
D O I
10.1109/ISSRE.2014.39
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, many information retrieval (IR) based bug localization approaches have been proposed in the literature. These approaches use information retrieval techniques to process a textual bug report and a collection of source code files to find buggy files. They output a ranked list of files sorted by their likelihood to contain the bug. Recent approaches can achieve reasonable accuracy, however, even a state-of-the-art bug localization tool outputs many ranked lists where buggy files appear very low in the lists. This potentially causes developers to distrust bug localization tools. Parnin and Orso recently conduct a user study and highlight that developers do not find an automated debugging tool useful if they do not find the root cause of a bug early in a ranked list. To address this problem, we build an oracle that can automatically predict whether a ranked list produced by an IR-based bug localization tool is likely to be effective or not. We consider a ranked list to be effective if a buggy file appears in the top-N position of the list. If a ranked list is unlikely to be effective, developers do not need to waste time in checking the recommended files one by one. In such cases, it is better for developers to use traditional debugging methods or request for further information to localize bugs. To build this oracle, our approach extracts features that can be divided into four categories: score features, textual features, topic model features, and metadata features. We build a separate prediction model for each category, and combine them to create a composite prediction model which is used as the oracle. We name our proposed approach APRILE, which stands for Automated PRediction of IR-based Bug Localization's Effectiveness. We have evaluated APRILE to predict the effectiveness of three state-of-the-art IR-based bug localization tools on more than three thousands bug reports from AspectJ, Eclipse, and SWT. APRILE can achieve an average precision, recall, and F-measure of at least 70.36%, 66.94%, and 68.03%, respectively. Furthermore, APRILE outperforms a baseline approach by 84.48%, 17.74%, and 31.56% for the AspectJ, Eclipse, and SWT bug reports, respectively.
引用
收藏
页码:335 / 345
页数:11
相关论文
共 50 条
  • [1] Influence of Structured Information in Bug Report Descriptions on IR-based Bug Localization
    Rath, Michael
    Maeder, Patrick
    [J]. 44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, : 26 - 32
  • [2] An empirical study of the effectiveness of IR-based bug localization for large-scale industrial projects
    Li, Wei
    Li, Qingan
    Ming, Yunlong
    Dai, Weijiao
    Ying, Shi
    Yuan, Mengting
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (02)
  • [3] An empirical study of the effectiveness of IR-based bug localization for large-scale industrial projects
    Wei Li
    Qingan Li
    Yunlong Ming
    Weijiao Dai
    Shi Ying
    Mengting Yuan
    [J]. Empirical Software Engineering, 2022, 27
  • [4] Structured information in bug report descriptions—influence on IR-based bug localization and developers
    Michael Rath
    Patrick Mäder
    [J]. Software Quality Journal, 2019, 27 : 1315 - 1337
  • [5] A Novel Approach to Automatic Query Reformulation for IR-based Bug Localization
    Kim, Misoo
    Lee, Eunseok
    [J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 1752 - 1759
  • [6] Structured information in bug report descriptions-influence on IR-based bug localization and developers
    Rath, Michael
    Maeder, Patrick
    [J]. SOFTWARE QUALITY JOURNAL, 2019, 27 (03) : 1315 - 1337
  • [7] The forgotten role of search queries in IR-based bug localization: an empirical study
    Mohammad Masudur Rahman
    Foutse Khomh
    Shamima Yeasmin
    Chanchal K. Roy
    [J]. Empirical Software Engineering, 2021, 26
  • [8] Improving IR-Based Bug Localization with Context-Aware Query Reformulation
    Rahman, Mohammad Masudur
    Roy, Chanchal K.
    [J]. ESEC/FSE'18: PROCEEDINGS OF THE 2018 26TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2018, : 621 - 632
  • [9] A Novel Automatic Query Expansion with Word Embedding for IR-based Bug Localization
    Kim, Misoo
    Kim, Youngkyoung
    Lee, Eunseok
    [J]. 2021 IEEE 32ND INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2021), 2021, : 276 - 287
  • [10] A Large-Scale Comparative Evaluation of IR-Based Tools for Bug Localization
    Akbar, Shayan A.
    Kak, Avinash C.
    [J]. 2020 IEEE/ACM 17TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2020, : 21 - 31