Still Confusing for Bug-Component Triaging? Deep Feature Learning and Ensemble Setting to Rescue

被引：1

作者：

Su, Yanqi ^{[1
]}

Han, Zheming ^{[1
]}

Gao, Zhipeng ^{[2
]}

Xing, Zhenchang ^{[1
,3
]}

Lu, Qinghua ^{[3
]}

Xu, Xiwei ^{[3
]}

机构：

[1] Australian Natl Univ, Canberra, ACT 0200, Australia

[2] Zhejiang Univ China, Hangzhou, Peoples R China

[3] CSIRO, Data61, Geelong, Vic, Australia

来源：

2023 IEEE/ACM 31ST INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC | 2023年

基金：

美国国家科学基金会;

关键词：

Bug Triaging; Deep Learning; Text Classification; SEVERITY; ACCURATE;

D O I：

10.1109/ICPC58990.2023.00046

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

To speed up the bug-fixing process, it is essential to triage bugs into the right components as soon as possible. Given the large number of bugs filed everyday, a reliable and effective bug-component triaging tool is needed to assist this task. LR-BKG is the state-of-the-art toolkit for doing this. However, the suboptimal performance for recommending the right component at the first position (low Top-1 accuracy) limits its usage in practice. We thoroughly investigate the limitations of LR-BKG and find out the gap between the manual feature design of LR-BKG and the characteristics of bug reports causes such suboptimal performance. Therefore, we propose an approach, DEEPTRIAG, which uses the large scale pre-trained models to extract deep features automatically from bug reports (including bug summary and description), to fill this gap. DEEPTRIAG transforms bug-component triaging into a multi-classification task (CodeBERT-Classifier) and a generation task (CodeT5-Generator). Then, we ensemble the prediction results from them to improve the performance of bug-component triaging further. Extensive experimental results demonstrate the superior performance of DEEPTRIAG on bug-component triaging over LR-BKG. In particular, the overall Top-1 accuracy is improved from 56.2% to 68.3% on Mozilla dataset and from 51.3% to 64.1% on Eclipse dataset, which verifies the effectiveness and generalization of our approach on improving the practical usage for bug-component triaging.

引用

页码：316 / 327

页数：12

共 50 条

[31] Deep Kernel Principal Component Analysis for multi-level feature learning
Tonin, Francesco
Tao, Qinghua
Patrinos, Panagiotis
Suykens, Johan A.K.
[J]. Neural Networks, 2024, 170 : 578 - 595
[32] Deep Kernel Principal Component Analysis for multi-level feature learning
Tonin, Francesco
Tao, Qinghua
Patrinos, Panagiotis
Suykens, Johan A. K.
[J]. NEURAL NETWORKS, 2024, 170 : 578 - 595
[33] Machine learning-based lung and colon cancer detection using deep feature extraction and ensemble learning
Talukder, Md Alamin
Islam, Md Manowarul
Uddin, Md Ashraf
Akhter, Arnisha
Hasan, Khondokar Fida
Moni, Mohammad Ali
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
[34] Approach for Detecting Attacks on IoT Networks Based on Ensemble Feature Selection and Deep Learning Models
Rihan, Shaza Dawood Ahmed
Anbar, Mohammed
Alabsi, Basim Ahmad
[J]. SENSORS, 2023, 23 (17)
[35] Spatial Feature Fusion for Biomedical Image Classification based on Ensemble Deep CNN and Transfer Learning
Patel, Sanskruti
Patel, Rachana
Ganatra, Nilay
Patel, Atul
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 153 - 159
[36] DETECTION OF ANDROID MALWARE USING DEEP LEARNING ENSEMBLE WITH CHEETAH-OPTIMIZED FEATURE SELECTION
Almotairi, Sultan
Khan, Mohd Abdul Rahim
Alharbi, Olayan
Alzaid, Zaid
Hausawi, Yasser M.
Almutairi, Jaber
[J]. ADVANCES AND APPLICATIONS IN DISCRETE MATHEMATICS, 2024, 41 (05): : 357 - 392
[37] FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation
Turkoglu, Mehmet Ozgur
Becker, Alexander
Guenduez, Hueseyin Anil
Rezaei, Mina
Bischl, Bernd
Daudt, Rodrigo Caye
D'Aronco, Stefano
Wegner, Jan Dirk
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[38] BIO-INSPIRED ENSEMBLE FEATURE SELECTION (BIEFS) AND ENSEMBLE MULTIPLE DEEP LEARNING (EMDL) CLASSIFIER FOR BREAST CANCER DIAGNOSIS
Priya, R. S. Padma
Vadivu, P. Senthil
[J]. JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 483 - 499
[39] Hyperspectral Image Classification Algorithm Based on Principal Component Texture Feature Deep Learning
Xu Yifang
[J]. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2020, 10 (09) : 2027 - 2031
[40] Feature Selection Based on Principal Component Regression for Underwater Source Localization by Deep Learning
Zhu, Xiaoyu
Dong, Hefeng
Salvo Rossi, Pierluigi
Landro, Martin
[J]. REMOTE SENSING, 2021, 13 (08)

← 1 2 3 4 5 →