Large language model ChatGPT versus small deep learning models for self-admitted technical debt detection: Why not together?

被引:0
|
作者
Li, Jun [1 ]
Li, Lixian [2 ]
Liu, Jin [1 ]
Yu, Xiao [3 ,4 ]
Liu, Xiao [5 ]
Keung, Jacky Wai [6 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] China Satellite Network Explorat Co Ltd, Chongqing, Peoples R China
[3] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan, Peoples R China
[4] Wuhan Univ Technol, Chongqing Res Inst, Chongqing, Peoples R China
[5] Deakin Univ, Sch Informat Technol, Geelong, Australia
[6] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
ChatGPT; fusion; performance and interpretability; self-admitted technical debt; small deep learning models; CLASSIFICATION;
D O I
10.1002/spe.3360
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given the increasing complexity and volume of Self-Admitted Technical Debts (SATDs), how to efficiently detect them becomes critical in software engineering practice for improving code quality and project efficiency. Although current deep learning methods have achieved good performance in detecting SATDs in code comments, they lack explanation. Large language models such as ChatGPT are increasingly being applied to text classification tasks due to their ability to provide explanations for classification results, but it is unclear how effective ChatGPT is for SATD classification. As the first in-depth study of ChatGPT for SATD detection, we evaluate ChatGPT's effectiveness, compare it with small deep learning models, and find that ChatGPT performs better on Recall, while small models perform better on Precision. Furthermore, to enhance the performance of these approaches, we propose a novel fusion approach named FSATD which combines ChatGPT with small models for SATD detection so as to provide reliable explanations. Through extensive experiments on 62,276 comments from 10 open-source projects, we show that FSATD outperforms existing methods in performance of F1-score in cross-project scenarios. Additionally, FSATD allows for flexible adjustment of fusion strategies, adapting to different requirements of various application scenarios, and can achieve the best Precision, Recall, or F1-score.
引用
收藏
页数:26
相关论文
共 35 条
  • [1] Deep Learning-Based Self-Admitted Technical Debt Detection Empirical Research
    Qu, Yubin
    Bao, Tie
    Yuan, Meng
    Li, Long
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2023, 24 (04): : 975 - 987
  • [2] Self-admitted technical debt in R: detection and causes
    Rishab Sharma
    Ramin Shahbazi
    Fatemeh H. Fard
    Zadia Codabux
    Melina Vidoni
    [J]. Automated Software Engineering, 2022, 29
  • [3] Self-admitted technical debt in R: detection and causes
    Sharma, Rishab
    Shahbazi, Ramin
    Fard, Fatemeh H.
    Codabux, Zadia
    Vidoni, Melina
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2022, 29 (02)
  • [4] Data Balancing Improves Self-Admitted Technical Debt Detection
    Sridharan, Murali
    Mantyla, Mika
    Rantala, Leevi
    Claes, Maelick
    [J]. 2021 IEEE/ACM 18TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2021), 2021, : 358 - 368
  • [5] Automatically Learning Patterns for Self-Admitted Technical Debt Removal
    Zampetti, Fiorella
    Serebrenik, Alexander
    Di Penta, Massimiliano
    [J]. PROCEEDINGS OF THE 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER '20), 2020, : 355 - 366
  • [6] Deep neural network ensembles for detecting self-admitted technical debt
    Yin, Ming
    Zhu, Kuiyu
    Xiao, Hongli
    Zhu, Dan
    Jiang, Jijiao
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (01) : 93 - 105
  • [7] A Large-Scale Empirical Study on Self-Admitted Technical Debt
    Bavota, Gabriele
    Russo, Barbara
    [J]. 13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), 2016, : 315 - 326
  • [8] Using Natural Language Processing to Automatically Detect Self-Admitted Technical Debt
    Maldonado, Everton da Silva
    Shihab, Emad
    Tsantalis, Nikolaos
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2017, 43 (11) : 1044 - 1062
  • [9] Self-admitted technical debt detection by learning its comprehensive semantics via graph neural networks
    Li, Hui
    Qu, Yang
    Liu, Yong
    Chen, Rong
    Ai, Jun
    Guo, Shikai
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (10): : 2152 - 2176
  • [10] 23 Shades of Self-Admitted Technical Debt: An Empirical Study on Machine Learning Software
    OBrien, David
    Biswas, Sumon
    Imtiaz, Sayem
    Abdalkareem, Rabe
    Shihab, Emad
    Rajan, Hridesh
    [J]. PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, : 734 - 746