Performance evaluation of software defect prediction with NASA dataset using machine learning techniques

被引:4
|
作者
Siddiqui T. [1 ]
Mustaqeem M. [1 ]
机构
[1] Department of Computer Science, Aligarh Muslim University (AMU), UP, Aligarh
关键词
Machine learning; Software defects prediction;
D O I
10.1007/s41870-023-01528-9
中图分类号
学科分类号
摘要
The software industry’s growth and increasing complexity have made software maintenance more challenging, with Software Defects (SD) being a significant contributor to quality degradation leading to resource wastage in terms of effort, time, and finances. If the SD is not considered in the early stages of software development, it occurs in any stage of the Software Development Life Cycle (SDLC). The proposed study focuses on improving software quality through Software Defect Prediction (SDP) using machine learning (ML) and data balancing techniques. To mitigate the problem of imbalanced datasets, which often lead to model overfitting, the authors employ the Synthetic Minority Oversampling Technique (SMOTE) combined with ML approaches. The assessment encompasses various ML techniques, including Random Forest, SVM, KNN, and LDA, on the balanced CM1 dataset of the NASA promise repository and evaluated performance using accuracy, precision, recall, F1-score, and AUC-ROC. Random Forest emerges as a standout performer, with an accuracy of 98.09% and an F1-score of 97.25%. SVM and KNN also demonstrate high accuracy rates of 97.71% and 97.56%, respectively, while LDA shows balanced performance with an accuracy of 96.04% and an F1-score of 95.93%. Notably, our study achieves significant performance improvements compared with the prior state-of-the-art. The author provides a roadmap to achieve improved performance and predictive capabilities in SDP, highlighting the novel contribution of the proposed study. These findings hold great potential for the software industry, offering solutions to enhance software quality and streamline development. © 2023, The Author(s), under exclusive licence to Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:4131 / 4139
页数:8
相关论文
共 50 条
  • [1] Performance Analysis of Machine Learning Techniques on Software Defect Prediction using NASA Datasets
    Iqbal, Ahmed
    Aftab, Shabib
    Ali, Umair
    Nawaz, Zahid
    Sana, Laraib
    Ahmad, Munir
    Husen, Arif
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 300 - 308
  • [2] Software Defect Prediction on Unlabelled Dataset with Machine Learning Techniques
    Ronchieri, Elisabetta
    Canaparo, Marco
    Belgiovine, Mauro
    Salomoni, Davide
    2019 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2019,
  • [3] Software Defect Prediction Analysis Using Machine Learning Techniques
    Khalid, Aimen
    Badshah, Gran
    Ayub, Nasir
    Shiraz, Muhammad
    Ghouse, Mohamed
    SUSTAINABILITY, 2023, 15 (06)
  • [4] Towards Effective Software Defect Prediction Using Machine Learning Techniques
    Akshat Pandey
    Akshay Jadhav
    SN Computer Science, 5 (8)
  • [5] Performance Evaluation of Some Machine Learning Algorithms in NASA Defect Prediction Data Sets
    Aydin, Zeynep Behrin Guven
    Samli, Ruya
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 295 - 297
  • [6] On Software Defect Prediction Using Machine Learning
    Ren, Jinsheng
    Qin, Ke
    Ma, Ying
    Luo, Guangchun
    JOURNAL OF APPLIED MATHEMATICS, 2014,
  • [7] An empirical framework for defect prediction using machine learning techniques with Android software
    Malhotra, Ruchika
    APPLIED SOFT COMPUTING, 2016, 49 : 1034 - 1050
  • [8] Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques
    Khan, Bilal
    Naseem, Rashid
    Shah, Muhammad Arif
    Wakil, Karzan
    Khan, Atif
    Uddin, M. Irfan
    Mahmoud, Marwan
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021 (2021)
  • [9] Software defect identification using machine learning techniques
    Ceylan, Evren
    Kudubay, F. Onur
    Bener, Ayse B.
    32ND EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA) - PROCEEDINGS, 2006, : 240 - +
  • [10] Software Defect Prediction Using Supervised Machine Learning Techniques: A Systematic Literature Review
    Matloob, Faseeha
    Aftab, Shabib
    Ahmad, Munir
    Khan, Muhammad Adnan
    Fatima, Areej
    Iqbal, Muhammad
    Alruwaili, Wesam Mohsen
    Elmitwally, Nouh Sabri
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 29 (02): : 403 - 421