Machine learning based methods for software fault prediction: A survey

被引:56
|
作者
Pandey, Sushant Kumar [1 ]
Mishra, Ravi Bhushan [1 ]
Tripathi, Anil Kumar [1 ]
机构
[1] Indian Inst Technol BHU, Dept Comp Sci & Engn, Varanasi, Uttar Pradesh, India
关键词
Machine learning; Fault proneness; Statistical techniques; Fault prediction; Systematic literature review; DEFECT PREDICTION; EMPIRICAL-ANALYSIS; FEATURE-SELECTION; MODEL; QUALITY; METRICS; CLASSIFICATION; PRONENESS; FRAMEWORK; REGRESSION;
D O I
10.1016/j.eswa.2021.114595
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several prediction approaches are contained in the arena of software engineering such as prediction of effort, security, quality, fault, cost, and re-usability. All these prediction approaches are still in the rudimentary phase. Experiments and research are conducting to build a robust model. Software Fault Prediction (SFP) is the process to develop the model which can be utilized by software practitioners to detect faulty classes/module before the testing phase. Prediction of defective modules before the testing phase will help the software development team leader to allocate resources more optimally and it reduces the testing effort. In this article, we present a Systematic Literature Review (SLR) of various studies from 1990 to June 2019 towards applying machine learning and statistical method over software fault prediction. We have cited 208 research articles, in which we studied 154 relevant articles. We investigated the competence of machine learning in existing datasets and research projects. To the best of our knowledge, the existing SLR considered only a few parameters over SFP?s performance, and they partially examined the various threats and challenges of SFP techniques. In this article, we aggregated those parameters and analyzed them accordingly, and we also illustrate the different challenges in the SFP domain. We also compared the performance between machine learning and statistical techniques based on SFP models. Our empirical study and analysis demonstrate that the prediction ability of machine learning techniques for classifying class/module as fault/non-fault prone is better than classical statistical models. The performance of machine learning-based SFP methods over fault susceptibility is better than conventional statistical purposes. The empirical evidence of our survey reports that the machine learning techniques have the capability, which can be used to identify fault proneness, and able to form well-generalized result. We have also investigated a few challenges in fault prediction discipline, i.e., quality of data, over-fitting of models, and class imbalance problem. We have also summarized 154 articles in a tabular form for quick identification.
引用
收藏
页数:35
相关论文
共 50 条
  • [31] A survey on fault diagnosis of rotating machinery based on machine learning
    Wang, Qi
    Huang, Rui
    Xiong, Jianbin
    Yang, Jianxiang
    Dong, Xiangjun
    Wu, Yipeng
    Wu, Yinbo
    Lu, Tiantian
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (10)
  • [32] Survey on Software Vulnerability Analysis method based on Machine Learning
    Gong Jie
    Kuang Xiao-hui
    Liu Qiang
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 642 - 647
  • [33] Scientific programming using optimized machine learning techniques for software fault prediction to improve software quality
    Shafiq, Muhammad
    Alghamedy, Fatemah H.
    Jamal, Nasir
    Kamal, Tahir
    Daradkeh, Yousef Ibrahim
    Shabaz, Mohammad
    IET SOFTWARE, 2023, 17 (04) : 694 - 704
  • [34] Fault prediction of railway track circuit based on machine learning
    Zhang, Xin
    Ru, Yan
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2024, 45 (04) : 216 - 228
  • [35] Intelligent energy meter fault prediction based on machine learning
    Li Helong
    Yu Haibo
    Yuan Jinshuai
    2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 296 - 300
  • [36] AI-Based on Machine Learning Methods for Urban Real Estate Prediction: A Systematic Survey
    Tekouabou, Stephane C. K.
    Gherghina, Stefan Cristian
    Kameni, Eric Desire
    Filali, Youssef
    Gartoumi, Khalil Idrissi
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (02) : 1079 - 1095
  • [37] AI-Based on Machine Learning Methods for Urban Real Estate Prediction: A Systematic Survey
    Stéphane C. K. Tekouabou
    Ştefan Cristian Gherghina
    Eric Désiré Kameni
    Youssef Filali
    Khalil Idrissi Gartoumi
    Archives of Computational Methods in Engineering, 2024, 31 : 1079 - 1095
  • [38] Comprehensive Survey of different Machine Learning Algorithms used for Software Defect Prediction
    Sharadhi, A. K.
    Gururaj, Vybhavi
    Umadi, Keerti R.
    Kumar, Mushkan
    Shankar, Sahana P.
    Varadam, Deepak
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 425 - 430
  • [39] Machine learning based success prediction for crowdsourcing software projects
    Illahi, Inam
    Liu, Hui
    Umer, Qasim
    Niu, Nan
    JOURNAL OF SYSTEMS AND SOFTWARE, 2021, 178
  • [40] Software defect prediction based on weighted extreme learning machine
    Gai, Jinjing
    Zheng, Shang
    Yu, Hualong
    Yang, Hongji
    MULTIAGENT AND GRID SYSTEMS, 2020, 16 (01) : 67 - 82