Detecting Duplicate Bug Reports with Software Engineering Domain Knowledge

被引:0
|
作者
Aggarwal, Karan [1 ]
Rutgers, Tanner [1 ]
Timbers, Finbarr [1 ]
Hindle, Abram [1 ]
Greiner, Russ [1 ]
Stroulia, Eleni [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
关键词
duplicate bug reports; information retrieval; software engineering textbooks; machine learning; software literature; documentation;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In previous work by Alipour et al., a methodology was proposed for detecting duplicate bug reports by comparing the textual content of bug reports to subject-specific contextual material, namely lists of software-engineering terms, such as non-functional requirements and architecture keywords. When a bug report contains a word in these word-list contexts, the bug report is considered to be associated with that context and this information tends to improve bug-deduplication methods. In this paper, we propose a method to partially automate the extraction of contextual word lists from software-engineering literature. Evaluating this software-literature context method on real-world bug reports produces useful results that indicate this semi-automated method has the potential to substantially decrease the manual effort used in contextual bug deduplication while suffering only a minor loss in accuracy.
引用
收藏
页码:211 / 220
页数:10
相关论文
共 50 条
  • [21] New Methodology for Contextual Features Usage in Duplicate Bug Reports Detection
    Neysiani, Behzad Soleimani
    Babamir, Seyed Morteza
    2019 5TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2019, : 178 - 183
  • [22] A Replication Package for It Takes Two to TANGO: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports
    Cooper, Nathan
    Bernal-Cardenas, Carlos
    Chaparro, Oscar
    Moran, Kevin
    Poshyvanyk, Denys
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2021), 2021, : 160 - 161
  • [23] SOFTWARE MODULE CLASSIFICATION FOR COMMERCIAL BUG REPORTS
    Ozturk, Ceyhun E.
    Yilmaz, Eyup Halit
    Koksal, Omer
    Koc, Aykut
    2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
  • [24] Modeling Domain Knowledge in Support of Requirements Analysis in Software Engineering
    Li, Zhi
    Hall, Jon G.
    Rapanotti, Lucia
    2010 INTERNATIONAL CONFERENCE ON COMMUNICATION AND VEHICULAR TECHNOLOGY (ICCVT 2010), VOL II, 2010, : 270 - 273
  • [25] Identifying and Detecting Inaccurate Stack Traces in Bug Reports
    Bheree, Meher Kiran
    Anvik, John
    2024 7TH INTERNATIONAL CONFERENCE ON SOFTWARE AND SYSTEM ENGINEERING, ICOSSE 2024, 2024, : 9 - 14
  • [26] An HMM-based approach for automatic detection and classification of duplicate bug reports
    Ebrahimi, Neda
    Trabelsi, Abdelaziz
    Islam, Md Shariful
    Hamou-Lhadj, Abdelwahab
    Khanmohammadi, Kobra
    INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 113 : 98 - 109
  • [27] Software engineering and knowledge engineering
    Juristo, N
    Acuña, ST
    EXPERT SYSTEMS WITH APPLICATIONS, 2002, 23 (04) : 345 - 347
  • [28] Towards Word Embeddings for Improved Duplicate Bug Report Retrieval in Software Repositories
    Budhiraja, Amar
    Dutta, Kartik
    Shrivastava, Manish
    Reddy, Raghu
    PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 167 - 170
  • [29] Analyzing Bug Reports by Topic Mining in Software Evolution
    Nguyen, Uy
    Cheng, Kowk Sun
    Cho, Samuel Sungmin
    Song, Myoungkyu
    2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 1645 - 1652
  • [30] Invalid bug reports complicate the software aging situation
    Wu, Xiaoxue
    Zheng, Wei
    Pu, Minchao
    Chen, Jie
    Mu, Dejun
    SOFTWARE QUALITY JOURNAL, 2020, 28 (01) : 195 - 220