Large language model ChatGPT versus small deep learning models for self-admitted technical debt detection: Why not together?

被引：0

作者：

Li, Jun ^{[1
]}

Li, Lixian ^{[2
]}

Liu, Jin ^{[1
]}

Yu, Xiao ^{[3
,4
]}

Liu, Xiao ^{[5
]}

Keung, Jacky Wai ^{[6
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China

[2] China Satellite Network Explorat Co Ltd, Chongqing, Peoples R China

[3] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan, Peoples R China

[4] Wuhan Univ Technol, Chongqing Res Inst, Chongqing, Peoples R China

[5] Deakin Univ, Sch Informat Technol, Geelong, Australia

[6] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

来源：

SOFTWARE-PRACTICE & EXPERIENCE | 2024年

基金：

中国国家自然科学基金;

关键词：

ChatGPT; fusion; performance and interpretability; self-admitted technical debt; small deep learning models; CLASSIFICATION;

D O I：

10.1002/spe.3360

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Given the increasing complexity and volume of Self-Admitted Technical Debts (SATDs), how to efficiently detect them becomes critical in software engineering practice for improving code quality and project efficiency. Although current deep learning methods have achieved good performance in detecting SATDs in code comments, they lack explanation. Large language models such as ChatGPT are increasingly being applied to text classification tasks due to their ability to provide explanations for classification results, but it is unclear how effective ChatGPT is for SATD classification. As the first in-depth study of ChatGPT for SATD detection, we evaluate ChatGPT's effectiveness, compare it with small deep learning models, and find that ChatGPT performs better on Recall, while small models perform better on Precision. Furthermore, to enhance the performance of these approaches, we propose a novel fusion approach named FSATD which combines ChatGPT with small models for SATD detection so as to provide reliable explanations. Through extensive experiments on 62,276 comments from 10 open-source projects, we show that FSATD outperforms existing methods in performance of F1-score in cross-project scenarios. Additionally, FSATD allows for flexible adjustment of fusion strategies, adapting to different requirements of various application scenarios, and can achieve the best Precision, Recall, or F1-score.

引用

页数：26

共 35 条

[1] Deep Learning-Based Self-Admitted Technical Debt Detection Empirical Research
Qu, Yubin
Bao, Tie
Yuan, Meng
Li, Long
[J]. JOURNAL OF INTERNET TECHNOLOGY, 2023, 24 (04): : 975 - 987
[2] Self-admitted technical debt in R: detection and causes
Rishab Sharma
Ramin Shahbazi
Fatemeh H. Fard
Zadia Codabux
Melina Vidoni
[J]. Automated Software Engineering, 2022, 29
[3] Self-admitted technical debt in R: detection and causes
Sharma, Rishab
Shahbazi, Ramin
Fard, Fatemeh H.
Codabux, Zadia
Vidoni, Melina
[J]. AUTOMATED SOFTWARE ENGINEERING, 2022, 29 (02)
[4] Data Balancing Improves Self-Admitted Technical Debt Detection
Sridharan, Murali
Mantyla, Mika
Rantala, Leevi
Claes, Maelick
[J]. 2021 IEEE/ACM 18TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2021), 2021, : 358 - 368
[5] Automatically Learning Patterns for Self-Admitted Technical Debt Removal
Zampetti, Fiorella
Serebrenik, Alexander
Di Penta, Massimiliano
[J]. PROCEEDINGS OF THE 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER '20), 2020, : 355 - 366
[6] Deep neural network ensembles for detecting self-admitted technical debt
Yin, Ming
Zhu, Kuiyu
Xiao, Hongli
Zhu, Dan
Jiang, Jijiao
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (01) : 93 - 105
[7] A Large-Scale Empirical Study on Self-Admitted Technical Debt
Bavota, Gabriele
Russo, Barbara
[J]. 13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), 2016, : 315 - 326
[8] Using Natural Language Processing to Automatically Detect Self-Admitted Technical Debt
Maldonado, Everton da Silva
Shihab, Emad
Tsantalis, Nikolaos
[J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2017, 43 (11) : 1044 - 1062
[9] Self-admitted technical debt detection by learning its comprehensive semantics via graph neural networks
Li, Hui
Qu, Yang
Liu, Yong
Chen, Rong
Ai, Jun
Guo, Shikai
[J]. SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (10): : 2152 - 2176
[10] 23 Shades of Self-Admitted Technical Debt: An Empirical Study on Machine Learning Software
OBrien, David
Biswas, Sumon
Imtiaz, Sayem
Abdalkareem, Rabe
Shihab, Emad
Rajan, Hridesh
[J]. PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, : 734 - 746

← 1 2 3 4 →