DENATURE: duplicate detection and type identification in open source bug repositories

被引:1
|
作者
Chauhan, Ruby [1 ]
Sharma, Shakshi [2 ]
Goyal, Anjali [3 ]
机构
[1] NorthCap Univ, Sect 23 A, Gurugram 122017, Haryana, India
[2] Univ Tartu, Tartu, Estonia
[3] Sharda Univ, Sch Engn & Technol, Dept Comp Sci & Engn, Greater Noida, India
关键词
Bug tracking system; Bug reports; Duplicate detection; Bug type identification; Similarity measures; Classification; Information retrieval techniques; CLASSIFICATION; MODEL;
D O I
10.1007/s13198-023-01855-x
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Software projects reckon on the bug tracking systems to guide software maintenance activities. The critical information about the nature of the crash is carried by the bug reports which are submitted to bug repositories. This information is in free form text format and is submitted by users or developers. A large amount of bug reports gets collected in bug repositories. Out of these submitted bugs, many reports are mere identical of the already existing bugs. Furthermore, not all non-duplicate bugs are reproducible in nature. This paper introduces DENATURE, a two step framework for detecting duplication and identifying bug type. The proposed framework will help to minimize time and developer's effort utilized in resolution of bug reports which will further improvise overall software quality. Information retrieval techniques are used for finding duplicate bugs and machine learning classification techniques are used for identifying the type of bug report. Through experiments, we found that the proposed framework obtained prediction accuracy up to 88.81%.
引用
收藏
页码:S275 / S292
页数:18
相关论文
共 50 条
  • [21] Bug characteristics in open source software
    Lin Tan
    Chen Liu
    Zhenmin Li
    Xuanhui Wang
    Yuanyuan Zhou
    Chengxiang Zhai
    Empirical Software Engineering, 2014, 19 : 1665 - 1705
  • [22] Incremental Relational Topic Model for Duplicate Bug Report Detection
    Nguyen, Anh Tuan
    Nguyen, Tien N.
    2022 29TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC, 2022, : 99 - 108
  • [23] Bug characteristics in open source software
    Tan, Lin
    Liu, Chen
    Li, Zhenmin
    Wang, Xuanhui
    Zhou, Yuanyuan
    Zhai, Chengxiang
    EMPIRICAL SOFTWARE ENGINEERING, 2014, 19 (06) : 1665 - 1705
  • [24] Duplicate Bug Report detection using Named Entity Recognition
    Zheng, Wei
    Li, Yunfan
    Wu, Xiaoxue
    Cheng, Jingyuan
    KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [25] SPBC: A self-paced learning model for bug classification from historical repositories of open-source software
    Mohsin, Hufsa
    Shi, Chongyang
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 167
  • [26] Open source portal to distributed image repositories
    Tao, WC
    Ratib, O
    Kho, HT
    Hsu, YC
    Wang, C
    Lee, C
    McCoy, JM
    MEDICAL IMAGING 2004: PACS AND IMAGING INFORMATICS, 2004, 5 (25): : 185 - 194
  • [27] A comparative study of the performance of IR models on duplicate bug detection
    Kaushik, Nilam
    Tahvildari, Ladan
    2012 16TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR), 2012, : 159 - 168
  • [28] DURFEX: A Feature Extraction Technique for Efficient Detection of Duplicate Bug Reports
    Sabor, Korosh Koochekian
    Hamou-Lhadj, Abdelwahab
    Larsson, Alf
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS), 2017, : 240 - 250
  • [29] Towards Understanding the Impacts of Textual Dissimilarity on Duplicate Bug Report Detection
    Jahan, Sigma
    Rahman, Mohammad Masudur
    2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING, SANER, 2023, : 25 - 36
  • [30] A Contextual Approach towards More Accurate Duplicate Bug Report Detection
    Alipour, Anahita
    Hindle, Abram
    Stroulia, Eleni
    2013 10TH IEEE WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2013, : 183 - 192