An Empirical Study of IR-based Bug Localization for Deep Learning-based Software

被引:1
|
作者
Kim, Misoo [1 ]
Kim, Youngkyoung [2 ]
Lee, Eunseok [3 ]
机构
[1] Sungkyunkwan Univ, Inst Software Convergence, Suwon, South Korea
[2] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon, South Korea
[3] Sungkyunkwan Univ, Coll Comp & Informat, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
Empirical study; Deep learning-related software; Information retrieval-based bug localization; !text type='Python']Python[!/text] bugs; CLASSIFIER CONFIGURATION; IMPACT;
D O I
10.1109/ICST53961.2022.00024
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As the impact of deep-learning-based software (DLSW) increases, automatic debugging techniques for guaranteeing DLSW quality are becoming increasingly important. Information-retrieval-based bug localization (IRBL) techniques can aid in debugging by automatically localizing buggy entities (tiles and functions). The low-cost advantage of IRBL can alleviate the difficulty of identifying bug locations due to the complexity of DLSW. However, there are significant differences between DI SW and traditional software, and these differences lead to differences in search space and query quality for IRBL. That is, IRBL performance must be validated in DLSW. We empirically validated IRBL performance for DLSW from the following four perspectives: 1) similarity model, 2) query generation, 3) ranking model for buggy file localization, and 4) ranking model for buggy function localization. Based on four research questions and a large-scale experiment using 2,365 bug reports from 136 DLSW projects, we confirmed the salient characteristics of DLSW from the perspective of IRBL and derived four recommendations for practical IRBL usage in DLSW from the empirical results. Regarding IRBL performance, we validated that IRBL performance midi the combination of bug-related features outperformed that of using only file similarity by 15% and IRBL ranked buggy files and functions on average of 1.6th and 2.9th, respectively. Our study is valuable as a baseline for IRBL researchers and as a guideline for DLSW developers who wish to apply IRBL to ensure DLSW quality.
引用
收藏
页码:128 / 139
页数:12
相关论文
共 50 条
  • [1] The forgotten role of search queries in IR-based bug localization: an empirical study
    Mohammad Masudur Rahman
    Foutse Khomh
    Shamima Yeasmin
    Chanchal K. Roy
    [J]. Empirical Software Engineering, 2021, 26
  • [2] The forgotten role of search queries in IR-based bug localization: an empirical study
    Rahman, Mohammad Masudur
    Khomh, Foutse
    Yeasmin, Shamima
    Roy, Chanchal K.
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (06)
  • [3] Deep learning-based software bug classification
    Meher, Jyoti Prakash
    Biswas, Sourav
    Mall, Rajib
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 166
  • [4] An empirical study of the effectiveness of IR-based bug localization for large-scale industrial projects
    Wei Li
    Qingan Li
    Yunlong Ming
    Weijiao Dai
    Shi Ying
    Mengting Yuan
    [J]. Empirical Software Engineering, 2022, 27
  • [5] An empirical study of the effectiveness of IR-based bug localization for large-scale industrial projects
    Li, Wei
    Li, Qingan
    Ming, Yunlong
    Dai, Weijiao
    Ying, Shi
    Yuan, Mengting
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (02)
  • [6] Predicting Effectiveness of IR-Based Bug Localization Techniques
    Le, Tien-Duy B.
    Thung, Ferdian
    Lo, David
    [J]. 2014 IEEE 25TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2014, : 335 - 345
  • [7] Just enough semantics: An information theoretic approach for IR-based software bug localization
    Khatiwada, Saket
    Tushev, Miroslav
    Mahmoud, Anas
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2018, 93 : 45 - 57
  • [8] Study of Information Retrieval and Machine Learning-Based Software Bug Localization Models
    Tamanna
    Sangwan, Om Prakash
    [J]. ADVANCES IN COMPUTING AND INTELLIGENT SYSTEMS, ICACM 2019, 2020, : 503 - 510
  • [9] Influence of Structured Information in Bug Report Descriptions on IR-based Bug Localization
    Rath, Michael
    Maeder, Patrick
    [J]. 44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, : 26 - 32
  • [10] Structured information in bug report descriptions—influence on IR-based bug localization and developers
    Michael Rath
    Patrick Mäder
    [J]. Software Quality Journal, 2019, 27 : 1315 - 1337