Still Confusing for Bug-Component Triaging? Deep Feature Learning and Ensemble Setting to Rescue

被引:1
|
作者
Su, Yanqi [1 ]
Han, Zheming [1 ]
Gao, Zhipeng [2 ]
Xing, Zhenchang [1 ,3 ]
Lu, Qinghua [3 ]
Xu, Xiwei [3 ]
机构
[1] Australian Natl Univ, Canberra, ACT 0200, Australia
[2] Zhejiang Univ China, Hangzhou, Peoples R China
[3] CSIRO, Data61, Geelong, Vic, Australia
基金
美国国家科学基金会;
关键词
Bug Triaging; Deep Learning; Text Classification; SEVERITY; ACCURATE;
D O I
10.1109/ICPC58990.2023.00046
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
To speed up the bug-fixing process, it is essential to triage bugs into the right components as soon as possible. Given the large number of bugs filed everyday, a reliable and effective bug-component triaging tool is needed to assist this task. LR-BKG is the state-of-the-art toolkit for doing this. However, the suboptimal performance for recommending the right component at the first position (low Top-1 accuracy) limits its usage in practice. We thoroughly investigate the limitations of LR-BKG and find out the gap between the manual feature design of LR-BKG and the characteristics of bug reports causes such suboptimal performance. Therefore, we propose an approach, DEEPTRIAG, which uses the large scale pre-trained models to extract deep features automatically from bug reports (including bug summary and description), to fill this gap. DEEPTRIAG transforms bug-component triaging into a multi-classification task (CodeBERT-Classifier) and a generation task (CodeT5-Generator). Then, we ensemble the prediction results from them to improve the performance of bug-component triaging further. Extensive experimental results demonstrate the superior performance of DEEPTRIAG on bug-component triaging over LR-BKG. In particular, the overall Top-1 accuracy is improved from 56.2% to 68.3% on Mozilla dataset and from 51.3% to 64.1% on Eclipse dataset, which verifies the effectiveness and generalization of our approach on improving the practical usage for bug-component triaging.
引用
收藏
页码:316 / 327
页数:12
相关论文
共 50 条
  • [31] Deep Kernel Principal Component Analysis for multi-level feature learning
    Tonin, Francesco
    Tao, Qinghua
    Patrinos, Panagiotis
    Suykens, Johan A.K.
    [J]. Neural Networks, 2024, 170 : 578 - 595
  • [32] Deep Kernel Principal Component Analysis for multi-level feature learning
    Tonin, Francesco
    Tao, Qinghua
    Patrinos, Panagiotis
    Suykens, Johan A. K.
    [J]. NEURAL NETWORKS, 2024, 170 : 578 - 595
  • [33] Machine learning-based lung and colon cancer detection using deep feature extraction and ensemble learning
    Talukder, Md Alamin
    Islam, Md Manowarul
    Uddin, Md Ashraf
    Akhter, Arnisha
    Hasan, Khondokar Fida
    Moni, Mohammad Ali
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
  • [34] Approach for Detecting Attacks on IoT Networks Based on Ensemble Feature Selection and Deep Learning Models
    Rihan, Shaza Dawood Ahmed
    Anbar, Mohammed
    Alabsi, Basim Ahmad
    [J]. SENSORS, 2023, 23 (17)
  • [35] Spatial Feature Fusion for Biomedical Image Classification based on Ensemble Deep CNN and Transfer Learning
    Patel, Sanskruti
    Patel, Rachana
    Ganatra, Nilay
    Patel, Atul
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 153 - 159
  • [36] DETECTION OF ANDROID MALWARE USING DEEP LEARNING ENSEMBLE WITH CHEETAH-OPTIMIZED FEATURE SELECTION
    Almotairi, Sultan
    Khan, Mohd Abdul Rahim
    Alharbi, Olayan
    Alzaid, Zaid
    Hausawi, Yasser M.
    Almutairi, Jaber
    [J]. ADVANCES AND APPLICATIONS IN DISCRETE MATHEMATICS, 2024, 41 (05): : 357 - 392
  • [37] FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation
    Turkoglu, Mehmet Ozgur
    Becker, Alexander
    Guenduez, Hueseyin Anil
    Rezaei, Mina
    Bischl, Bernd
    Daudt, Rodrigo Caye
    D'Aronco, Stefano
    Wegner, Jan Dirk
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [38] BIO-INSPIRED ENSEMBLE FEATURE SELECTION (BIEFS) AND ENSEMBLE MULTIPLE DEEP LEARNING (EMDL) CLASSIFIER FOR BREAST CANCER DIAGNOSIS
    Priya, R. S. Padma
    Vadivu, P. Senthil
    [J]. JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 483 - 499
  • [39] Hyperspectral Image Classification Algorithm Based on Principal Component Texture Feature Deep Learning
    Xu Yifang
    [J]. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2020, 10 (09) : 2027 - 2031
  • [40] Feature Selection Based on Principal Component Regression for Underwater Source Localization by Deep Learning
    Zhu, Xiaoyu
    Dong, Hefeng
    Salvo Rossi, Pierluigi
    Landro, Martin
    [J]. REMOTE SENSING, 2021, 13 (08)