New Methodology for Contextual Features Usage in Duplicate Bug Reports Detection

被引:0
|
作者
Neysiani, Behzad Soleimani [1 ]
Babamir, Seyed Morteza [1 ]
机构
[1] Univ Kashan, Fac Comp & Elect Engn, Dept Software Engn, Kashan, Esfahan, Iran
关键词
Information Retrieval; Natural Language Processing; Duplicate Detection; Bug Reports; Topic; Feature Expansion;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Duplicate bug report detection is one of the major problems in software triage systems like Bugzilla to deal with end user requests. User request contains some categorical and especially textual fields which need feature extraction for duplicate detection. Contextual and topical features are acquired using calculating cosine similarity between term frequency or inverse document frequency or BM25F technique from a pair of bug reports against some topics. This research proposes the individual Manhattan distance similarity approach instead of cosine distance similarity for every topic in contextual features to expand the feature dimension which can increase the accuracy of the duplicate bug report detection process. The four famous datasets of bug reports have used for evaluation of the proposed method including Android, Eclipse, Mozilla, and Open Office which the experimental results indicate performance improvement for four contextual features including general, cryptography, network, and Java topics.
引用
收藏
页码:178 / 183
页数:6
相关论文
共 50 条
  • [41] Exploring the Role of Automation in Duplicate Bug Report Detection: An Industrial Case Study
    Gotharsson, Malte
    Stahre, Karl
    Gay, Gregory
    Neto, Francisco Gomes de Oliveira
    PROCEEDINGS OF THE 2024 IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATION OF SOFTWARE TEST, AST 2024, 2024, : 193 - 203
  • [42] Understanding Key Features of High-impact Bug Reports
    Karim, Md. Rejaul
    Ihara, Akinori
    Yang, Xin
    Iida, Hajimu
    Matsumoto, Kenichi
    2017 8TH IEEE INTERNATIONAL WORKSHOP ON EMPIRICAL SOFTWARE ENGINEERING IN PRACTICE (IWESEP), 2017, : 53 - 58
  • [43] Detection of duplicate defect reports using Natural Language Processing
    Runeson, Per
    Alexandersson, Magnus
    Nyholm, Oskar
    ICSE 2007: 29TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2007, : 499 - +
  • [44] A New Method of Security Bug Reports Analysis
    Xu, Yunwu
    Li, Yan
    IT PROFESSIONAL, 2024, 26 (02) : 49 - 56
  • [45] Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports
    William & Mary, Williamsburg
    VA, United States
    不详
    FL, United States
    arXiv,
  • [46] Efficient feature extraction model for validation performance improvement of duplicate bug report detection in software bug triage systems
    Neysiani, Behzad Soleimani
    Babamir, Seyed Morteza
    Aritsugi, Masayoshi
    INFORMATION AND SOFTWARE TECHNOLOGY, 2020, 126
  • [47] Does Deep Learning improve the performance of duplicate bug report detection? An empirical study?
    Jiang, Yuan
    Su, Xiaohong
    Treude, Christoph
    Shang, Chao
    Wang, Tiantian
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 198
  • [48] Duplicate Bug Report Detection Using an Attention-Based Neural Language Model
    Ben Messaoud, Montassar
    Miladi, Asma
    Jenhani, Ilyes
    Mkaouer, Mohamed Wiem
    Ghadhab, Lobna
    IEEE TRANSACTIONS ON RELIABILITY, 2023, 72 (02) : 846 - 858
  • [49] Duplicate Bug Report Detection Using Dual-Channel Convolutional Neural Networks
    He, Jianjun
    Xu, Ling
    Yan, Meng
    Xia, Xin
    Lei, Yan
    2020 IEEE/ACM 28TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2020, : 117 - 127
  • [50] Automatically Identifying Security Bug Reports via Multitype Features Analysis
    Zou, Deqing
    Deng, Zhijun
    Li, Zhen
    Jin, Hai
    INFORMATION SECURITY AND PRIVACY, 2018, 10946 : 619 - 633