Automatic software code repair using deep learning techniques

被引：0

作者：

Abdollahpour, Mohammad Mahdi ^{[1
]}

Ashtiani, Mehrdad ^{[1
]}

Bakhshi, Fatemeh ^{[1
]}

机构：

[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran

来源：

SOFTWARE QUALITY JOURNAL | 2024年 / 32卷 / 02期

关键词：

Code repair; Deep learning; Bug; Software defect; Bert; Transfer learning;

D O I：

10.1007/s11219-023-09653-1

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In the multi-hundred-billion-dollar industry of software development, the debugging process is an expensive task for developers. So, much effort has been put into debugging automation. In the last decade, researchers have repaired codes according to predefined rules which are only effective in limited types of bugs. Through a lot of experiments, researchers have found that deep learning models are useful in code repair automation similar to the revolutionary results these approaches have produced in various other fields. To solve such a difficult problem, recent works focus on debugging bugs that appear on one line of code. It has been reported that this type of bug occurs at an average rate of 1 out of every 1600 lines of code in a software project, which is significant. The current research follows these approaches and introduces a novel automatic code-repair system. We have employed the transfer learning technique to reuse a pre-trained model on the problem. The proposed system is designed based on the encoder-decoder architecture. In the encoder, a new pre-trained Bert model named JavaBert is used. Then, the model was fine tuned. The decoder is a transformer with an autoregressive structure. ManySStuBs4J [1] dataset is used for evaluation purposes. The results of the evaluations show that the proposed system has higher accuracy and BLEU criteria than CodeBert and the baseline model. Baseline is a simple model that acts as a reference in machine learning studies and CodeBert is one of the most similar models to the proposed model. The bilingual evaluation understudy score (BLEU) improvement is between 0.04 and 0.16%, the accuracy improvement is between 0.64 and 5.81%, the recall improvement is between 1.08 and 9.2%, and the F-score improvement is between 3.27 and 6.18%.

引用

页码：361 / 390

页数：30

共 50 条

[21] Using machine learning techniques for DSP software performance prediction at source code level
Liu, Weihua
Hu, Erh-Wen
Su, Bogong
Wang, Jian
[J]. CONNECTION SCIENCE, 2021, 33 (01) : 26 - 41
[22] Predicting Code Smells and Analysis of Predictions: Using Machine Learning Techniques and Software Metrics
Mhawish, Mohammad Y.
Gupta, Manjari
[J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (06) : 1428 - 1445
[23] A deep learning-based approach for software vulnerability detection using code metrics
Subhan, Fazli
Wu, Xiaoxue
Bo, Lili
Sun, Xiaobing
Rahman, Muhammad
[J]. IET SOFTWARE, 2022, 16 (05) : 516 - 526
[24] AN EVALUATION OF DEEP LEARNING TECHNIQUES FOR QR CODE DETECTION
Blanger, Leonardo
Hirata, Nina S. T.
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1625 - 1629
[25] Automatic Classification of Cardiac Arrhythmias Using Deep Learning Techniques: A Systematic Review
Vasquez-Iturralde, Fernando
Flores-Calero, Marco Javier
Grijalva, Felipe
Rosales-Acosta, Andres
[J]. IEEE ACCESS, 2024, 12 : 118467 - 118492
[26] Automatic Detection of Traffic Accidents from Video Using Deep Learning Techniques
Robles-Serrano, Sergio
Sanchez-Torres, German
Branch-Bedoya, John
[J]. COMPUTERS, 2021, 10 (11)
[27] Automatic biometry of fetal brain MRIs using deep and machine learning techniques
She, Jiayan
Huang, Haiying
Ye, Zhijun
Huang, Wei
Sun, Yan
Liu, Chuan
Yang, Weilin
Wang, Jiaxi
Ye, Pengfei
Zhang, Lei
Ning, Gang
[J]. SCIENTIFIC REPORTS, 2023, 13 (01)
[28] Automatic Brain White Matter Hypertinsities Segmentation using Deep Learning Techniques
Viteri, Jose A.
Loayza, Francis R.
Pelaez, Enrique
Layedra, Fabricio
[J]. HEALTHINF: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL. 5: HEALTHINF, 2021, : 244 - 252
[29] Automatic Cloud Detection and Removal in Satellite Imagery Using Deep Learning Techniques
Li, Jingyi
Lv, Yinbao
Yan, Xu
Weng, Hongjian
Li, Duo
Shi, Nan
[J]. TRAITEMENT DU SIGNAL, 2024, 41 (02) : 857 - 865
[30] Automatic biometry of fetal brain MRIs using deep and machine learning techniques
Jiayan She
Haiying Huang
Zhijun Ye
Wei Huang
Yan Sun
Chuan Liu
Weilin Yang
Jiaxi Wang
Pengfei Ye
Lei Zhang
Gang Ning
[J]. Scientific Reports, 13

← 1 2 3 4 5 →