Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique

被引:23
|
作者
Kukkar, Ashima [1 ]
Mohana, Rajni [1 ]
Kumar, Yugal [1 ]
Nayyar, Anand [2 ,3 ]
Bilal, Muhammad [4 ]
Kwak, Kyung-Sup [5 ]
机构
[1] Jaypee Univ Informat Technol, Dept Comp Sci, Wakanghat 173234, India
[2] Duy Tan Univ, Grad Sch, Da Nang 550000, Vietnam
[3] Duy Tan Univ, Fac Informat Technol, Da Nang 550000, Vietnam
[4] Hankuk Univ Foreign Studies, Comp & Elect Syst Engn Dept, Seoul 17035, South Korea
[5] Inha Univ, Informat & Commun Engn, Incheon 22212, South Korea
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
基金
新加坡国家研究基金会;
关键词
Computer bugs; Feature extraction; Deep learning; Software; Computational modeling; Natural languages; Manuals; Duplicate bug report detection; Siamese networks; natural language processing; deep learning; bug tracking system; software maintenance; software development; convolutional neural network; software engineering; TRENDS;
D O I
10.1109/ACCESS.2020.3033045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Duplicate bug report detection is a process of finding a duplicate bug report in the bug tracking system. This process is essential to avoid unnecessary work and rediscovery. In typical bug tracking systems, more than thousands of duplicate bug reports are reported every day. In turn, human cost, effort and time are increased. This makes it an important problem in the software management process. The solution is to automate the duplicate bug report detection system for reducing the manual effort, thus the productivity of triager's and developer's is increased. It also speeds up the process of software management as a result software maintenance cost is also reduced. However, existing systems are not quite accurate yet, in spite of these systems used various machine learning approaches. In this work, an automatic bug report detection and classification model is proposed using deep learning technique. The proposed system has three modules i.e. Preprocessing, Deep Learning Model and Duplicate Bug report Detection and Classification. Further, the proposed model used Convolutional Neural Network based deep learning model to extract relevant feature. These relevant features are used to determine the similar features of bug reports. Hence, the bug reports similarity is computers through these similar features. The performance of the proposed system is evaluated on six publicly available datasets using six performance metrics. It is noticed that the proposed system outperforms the existing systems by achieving an accuracy rate in the range of 85% to 99 % and recall@k rate in between 79%-94%. Moreover, the effectiveness of the proposed system is also measured on the cross training datasets of same and different domain. The proposed system achieves a good high accuracy rate for same domain data sets and low accuracy rate for different domain datasets.
引用
收藏
页码:200749 / 200763
页数:15
相关论文
共 50 条
  • [1] A Novel Technique for Duplicate Detection and Classification of Bug Reports
    Zhang, Tao
    Lee, Byungjeong
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (07): : 1756 - 1768
  • [2] Does Deep Learning improve the performance of duplicate bug report detection? An empirical study?
    Jiang, Yuan
    Su, Xiaohong
    Treude, Christoph
    Shang, Chao
    Wang, Tiantian
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 198
  • [3] A Systematic Study of Duplicate Bug Report Detection
    Gupta, Som
    Gupta, Sanjai Kumar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (01) : 578 - 589
  • [4] Duplicate Bug Report Detection Using Clustering
    Gopalan, Raj P.
    Krishna, Aneesh
    [J]. 2014 23RD AUSTRALASIAN SOFTWARE ENGINEERING CONFERENCE (ASWEC), 2013, : 104 - 109
  • [5] Reformulating Queries for Duplicate Bug Report Detection
    Chaparro, Oscar
    Florez, Juan Manuel
    Singh, Unnati
    Marcus, Andrian
    [J]. 2019 IEEE 26TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER), 2019, : 218 - 229
  • [6] An HMM-based approach for automatic detection and classification of duplicate bug reports
    Ebrahimi, Neda
    Trabelsi, Abdelaziz
    Islam, Md Shariful
    Hamou-Lhadj, Abdelwahab
    Khanmohammadi, Kobra
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 113 : 98 - 109
  • [7] Adaptive Deep Learning Model for Software Bug Detection and Classification
    Sivapurnima S.
    Manjula D.
    [J]. Computer Systems Science and Engineering, 2023, 45 (02): : 1233 - 1248
  • [8] Duplicate Bug Report Detection: How Far Are We?
    Zhang, Ting
    Han, Donggyun
    Vinayakarao, Venkatesh
    Irsan, Ivana Clairine
    Xu, Bowen
    Thung, Ferdian
    Lo, David
    Jiang, Lingxiao
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (04)
  • [9] Deep learning-based software bug classification
    Meher, Jyoti Prakash
    Biswas, Sourav
    Mall, Rajib
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 166
  • [10] A deep reinforcement learning technique for bug detection in video games
    Rani G.
    Pandey U.
    Wagde A.A.
    Dhaka V.S.
    [J]. International Journal of Information Technology, 2023, 15 (1) : 355 - 367