Application of Deep Learning in Software Defect Prediction: Systematic Literature Review and Meta-analysis

被引:20
|
作者
Zain, Zuhaira Muhammad [1 ]
Sakri, Sapiah [1 ]
Ismail, Nurul Halimatul Asmak [2 ]
机构
[1] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Informat Syst Dept, Riyadh, Saudi Arabia
[2] Princess Nourah Bint Abdulrahman Univ, Appl Coll, Dept Comp Sci & Informat Technol, Riyadh, Saudi Arabia
关键词
Deep Learning; Software Defect Prediction; Systematic Literature Review; Meta-Analysis; MEAN SQUARED ERROR; QUALITY;
D O I
10.1016/j.infsof.2023.107175
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Despite recent attention given to Software Defect Prediction (SDP), the lack of any systematic effort to assess existing empirical evidence on the application of Deep Learning (DL) in SDP indicates that it is still relatively under-researched.Objective: To synthesize literature on SDP using DL, pertaining to measurements, models, techniques, datasets, and achievements; to obtain a full understanding of current SDP-related methodologies using DL; and to compare the DL models' performances with those of Machine Learning (ML) models in classifying software defects.Method: We completed a thorough review of the literature in this domain. To answer the research issues, results from primary investigations were synthesized. The preliminary findings for DL vs. ML in SDP were verified by using meta-analysis (MA).Result: We discovered 63 primary studies that passed the systematic literature review quality evaluation. However, only 19 primary studies passed the MA quality evaluation. The five most popular performance mea-surements employed in SDP were f-measure, recall, accuracy, precision, and Area Under the Curve (AUC). The top five DL techniques used in building SDP models were Convolutional Neural Network (CNN), Deep Neural Network (DNN), Long Short-Term Memory (LSTM), Deep Belief Network (DBN), and Stacked Denoising Autoencoder (SDAE). PROMISE and NASA datasets were found to be used more frequently to train and test DL models in SDP. The MA results show that DL was favored over ML in terms of study and dataset across accuracy, f-measure, and AUC.Conclusion: The application of DL in SDP remains a challenge, but it has the potential to achieve better predictive performance when the performance-influencing parameters are optimized. We provide a reference point for future research which could be used to improve research quality in this domain.
引用
收藏
页数:28
相关论文
共 50 条
  • [21] The effectiveness of universal design for learning: A systematic review of the literature and meta-analysis
    Almeqdad, Qais I.
    Alodat, Ali M.
    Alquraan, Mahmoud F.
    Mohaidat, Mohammad A.
    Al-Makhzoomy, Alaa K.
    COGENT EDUCATION, 2023, 10 (01):
  • [22] Deep learning for cephalometric landmark detection: systematic review and meta-analysis
    Schwendicke, Falk
    Chaurasia, Akhilanand
    Arsiwala, Lubaina
    Lee, Jae-Hong
    Elhennawy, Karim
    Jost-Brinkmann, Paul-Georg
    Demarco, Flavio
    Krois, Joachim
    CLINICAL ORAL INVESTIGATIONS, 2021, 25 (07) : 4299 - 4309
  • [23] Deep learning for cephalometric landmark detection: systematic review and meta-analysis
    Falk Schwendicke
    Akhilanand Chaurasia
    Lubaina Arsiwala
    Jae-Hong Lee
    Karim Elhennawy
    Paul-Georg Jost-Brinkmann
    Flavio Demarco
    Joachim Krois
    Clinical Oral Investigations, 2021, 25 : 4299 - 4309
  • [24] Deep learning for temporomandibular joint arthropathies: A systematic review and meta-analysis
    Rokhshad, Rata
    Mohammad-Rahimi, Hossein
    Sohrabniya, Fatemeh
    Jafari, Bahare
    Shobeiri, Parnian
    Tsolakis, Ioannis A.
    Ourang, Seyed AmirHossein
    Sultan, Ahmed S.
    Khawaja, Shehryar Nasir
    Bavarian, Roxanne
    Palomo, Juan Martin
    JOURNAL OF ORAL REHABILITATION, 2024, 51 (08) : 1632 - 1644
  • [25] Deep learning for wireless capsule endoscopy: a systematic review and meta-analysis
    Soffer, Shelly
    Klang, Eyal
    Shimon, Orit
    Nachmias, Noy
    Eliakim, Rami
    Ben-Horin, Shomron
    Kopylov, Uri
    Barash, Yiftach
    GASTROINTESTINAL ENDOSCOPY, 2020, 92 (04) : 831 - +
  • [26] Machine learning for prediction of viral hepatitis: A systematic review and meta-analysis
    Moulaei, Khadijeh
    Sharifi, Hamid
    Bahaadinbeigy, Kambiz
    Haghdoost, Ali Akbar
    Nasiri, Naser
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 179
  • [27] Machine/Deep Learning for Software Engineering: A Systematic Literature Review
    Wang, Simin
    Huang, Liguo
    Gao, Amiao
    Ge, Jidong
    Zhang, Tengfei
    Feng, Haitao
    Satyarth, Ishna
    Li, Ming
    Zhang, He
    Ng, Vincent
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (03) : 1188 - 1231
  • [28] Systematic literature review: machine learning for software fault prediction
    Navarro Cedeno, Gabriel Omar
    Cortes Moya, Katherine
    Somarribas Dormond, Ahmed
    Gonzalez-Torres, Antonio
    Rojas-Hernandez, Yenory
    2023 IEEE 41ST CENTRAL AMERICA AND PANAMA CONVENTION, CONCAPAN XLI, 2023, : 134 - 139
  • [29] A SYSTEMATIC REVIEW OF THE NETWORK META-ANALYSIS LITERATURE
    Chambers, J. D.
    Pyo, J.
    Winn, A.
    Neumann, P. J.
    VALUE IN HEALTH, 2013, 16 (03) : A47 - A48
  • [30] Software fault prediction using data mining, machine learning and deep learning techniques: A systematic literature review
    Batool, Iqra
    Khan, Tamim Ahmed
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100