Automatic software code repair using deep learning techniques

被引:0
|
作者
Abdollahpour, Mohammad Mahdi [1 ]
Ashtiani, Mehrdad [1 ]
Bakhshi, Fatemeh [1 ]
机构
[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran
关键词
Code repair; Deep learning; Bug; Software defect; Bert; Transfer learning;
D O I
10.1007/s11219-023-09653-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the multi-hundred-billion-dollar industry of software development, the debugging process is an expensive task for developers. So, much effort has been put into debugging automation. In the last decade, researchers have repaired codes according to predefined rules which are only effective in limited types of bugs. Through a lot of experiments, researchers have found that deep learning models are useful in code repair automation similar to the revolutionary results these approaches have produced in various other fields. To solve such a difficult problem, recent works focus on debugging bugs that appear on one line of code. It has been reported that this type of bug occurs at an average rate of 1 out of every 1600 lines of code in a software project, which is significant. The current research follows these approaches and introduces a novel automatic code-repair system. We have employed the transfer learning technique to reuse a pre-trained model on the problem. The proposed system is designed based on the encoder-decoder architecture. In the encoder, a new pre-trained Bert model named JavaBert is used. Then, the model was fine tuned. The decoder is a transformer with an autoregressive structure. ManySStuBs4J [1] dataset is used for evaluation purposes. The results of the evaluations show that the proposed system has higher accuracy and BLEU criteria than CodeBert and the baseline model. Baseline is a simple model that acts as a reference in machine learning studies and CodeBert is one of the most similar models to the proposed model. The bilingual evaluation understudy score (BLEU) improvement is between 0.04 and 0.16%, the accuracy improvement is between 0.64 and 5.81%, the recall improvement is between 1.08 and 9.2%, and the F-score improvement is between 3.27 and 6.18%.
引用
收藏
页码:361 / 390
页数:30
相关论文
共 50 条
  • [21] Using machine learning techniques for DSP software performance prediction at source code level
    Liu, Weihua
    Hu, Erh-Wen
    Su, Bogong
    Wang, Jian
    [J]. CONNECTION SCIENCE, 2021, 33 (01) : 26 - 41
  • [22] Predicting Code Smells and Analysis of Predictions: Using Machine Learning Techniques and Software Metrics
    Mhawish, Mohammad Y.
    Gupta, Manjari
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (06) : 1428 - 1445
  • [23] A deep learning-based approach for software vulnerability detection using code metrics
    Subhan, Fazli
    Wu, Xiaoxue
    Bo, Lili
    Sun, Xiaobing
    Rahman, Muhammad
    [J]. IET SOFTWARE, 2022, 16 (05) : 516 - 526
  • [24] AN EVALUATION OF DEEP LEARNING TECHNIQUES FOR QR CODE DETECTION
    Blanger, Leonardo
    Hirata, Nina S. T.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1625 - 1629
  • [25] Automatic Classification of Cardiac Arrhythmias Using Deep Learning Techniques: A Systematic Review
    Vasquez-Iturralde, Fernando
    Flores-Calero, Marco Javier
    Grijalva, Felipe
    Rosales-Acosta, Andres
    [J]. IEEE ACCESS, 2024, 12 : 118467 - 118492
  • [26] Automatic Detection of Traffic Accidents from Video Using Deep Learning Techniques
    Robles-Serrano, Sergio
    Sanchez-Torres, German
    Branch-Bedoya, John
    [J]. COMPUTERS, 2021, 10 (11)
  • [27] Automatic biometry of fetal brain MRIs using deep and machine learning techniques
    She, Jiayan
    Huang, Haiying
    Ye, Zhijun
    Huang, Wei
    Sun, Yan
    Liu, Chuan
    Yang, Weilin
    Wang, Jiaxi
    Ye, Pengfei
    Zhang, Lei
    Ning, Gang
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [28] Automatic Brain White Matter Hypertinsities Segmentation using Deep Learning Techniques
    Viteri, Jose A.
    Loayza, Francis R.
    Pelaez, Enrique
    Layedra, Fabricio
    [J]. HEALTHINF: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL. 5: HEALTHINF, 2021, : 244 - 252
  • [29] Automatic Cloud Detection and Removal in Satellite Imagery Using Deep Learning Techniques
    Li, Jingyi
    Lv, Yinbao
    Yan, Xu
    Weng, Hongjian
    Li, Duo
    Shi, Nan
    [J]. TRAITEMENT DU SIGNAL, 2024, 41 (02) : 857 - 865
  • [30] Automatic biometry of fetal brain MRIs using deep and machine learning techniques
    Jiayan She
    Haiying Huang
    Zhijun Ye
    Wei Huang
    Yan Sun
    Chuan Liu
    Weilin Yang
    Jiaxi Wang
    Pengfei Ye
    Lei Zhang
    Gang Ning
    [J]. Scientific Reports, 13