New Methodology for Contextual Features Usage in Duplicate Bug Reports Detection

被引:0
|
作者
Neysiani, Behzad Soleimani [1 ]
Babamir, Seyed Morteza [1 ]
机构
[1] Univ Kashan, Fac Comp & Elect Engn, Dept Software Engn, Kashan, Esfahan, Iran
关键词
Information Retrieval; Natural Language Processing; Duplicate Detection; Bug Reports; Topic; Feature Expansion;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Duplicate bug report detection is one of the major problems in software triage systems like Bugzilla to deal with end user requests. User request contains some categorical and especially textual fields which need feature extraction for duplicate detection. Contextual and topical features are acquired using calculating cosine similarity between term frequency or inverse document frequency or BM25F technique from a pair of bug reports against some topics. This research proposes the individual Manhattan distance similarity approach instead of cosine distance similarity for every topic in contextual features to expand the feature dimension which can increase the accuracy of the duplicate bug report detection process. The four famous datasets of bug reports have used for evaluation of the proposed method including Android, Eclipse, Mozilla, and Open Office which the experimental results indicate performance improvement for four contextual features including general, cryptography, network, and Java topics.
引用
收藏
页码:178 / 183
页数:6
相关论文
共 50 条
  • [21] Duplicate Bug Report Detection Using Clustering
    Gopalan, Raj P.
    Krishna, Aneesh
    2014 23RD AUSTRALASIAN SOFTWARE ENGINEERING CONFERENCE (ASWEC), 2013, : 104 - 109
  • [22] Detecting Duplicate Bug Reports with Software Engineering Domain Knowledge
    Aggarwal, Karan
    Rutgers, Tanner
    Timbers, Finbarr
    Hindle, Abram
    Greiner, Russ
    Stroulia, Eleni
    2015 22ND INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), 2015, : 211 - 220
  • [23] Detecting duplicate bug reports with software engineering domain knowledge
    Aggarwal, Karan
    Timbers, Finbarr
    Rutgers, Tanner
    Hindle, Abram
    Stroulia, Eleni
    Greiner, Russell
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2017, 29 (03)
  • [24] Duplicate Bug Report Detection: How Far Are We?
    Zhang, Ting
    Han, Donggyun
    Vinayakarao, Venkatesh
    Irsan, Ivana Clairine
    Xu, Bowen
    Thung, Ferdian
    Lo, David
    Jiang, Lingxiao
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (04)
  • [25] Coping with Duplicate Bug Reports in Free/Open Source Software Projects
    Davidson, Jennifer L.
    Mohan, Nitin
    Jensen, Carlos
    2011 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING (VL/HCC 2011), 2011, : 101 - 108
  • [26] An Approach to Detecting Duplicate Bug Reports using Natural Language and Execution Information
    Wang, Xiaoyin
    Zhang, Lu
    Xie, Tao
    Anvik, John
    Sun, Jiasu
    ICSE'08 PROCEEDINGS OF THE THIRTIETH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 2008, : 461 - 470
  • [27] Incremental Relational Topic Model for Duplicate Bug Report Detection
    Nguyen, Anh Tuan
    Nguyen, Tien N.
    2022 29TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC, 2022, : 99 - 108
  • [28] Duplicate Bug Report detection using Named Entity Recognition
    Zheng, Wei
    Li, Yunfan
    Wu, Xiaoxue
    Cheng, Jingyuan
    KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [29] A comparative study of the performance of IR models on duplicate bug detection
    Kaushik, Nilam
    Tahvildari, Ladan
    2012 16TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR), 2012, : 159 - 168
  • [30] DENATURE: duplicate detection and type identification in open source bug repositories
    Chauhan, Ruby
    Sharma, Shakshi
    Goyal, Anjali
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023, 14 (SUPPL 1) : S275 - S292