Performance evaluation of software defect prediction with NASA dataset using machine learning techniques

被引:4
|
作者
Siddiqui T. [1 ]
Mustaqeem M. [1 ]
机构
[1] Department of Computer Science, Aligarh Muslim University (AMU), UP, Aligarh
关键词
Machine learning; Software defects prediction;
D O I
10.1007/s41870-023-01528-9
中图分类号
学科分类号
摘要
The software industry’s growth and increasing complexity have made software maintenance more challenging, with Software Defects (SD) being a significant contributor to quality degradation leading to resource wastage in terms of effort, time, and finances. If the SD is not considered in the early stages of software development, it occurs in any stage of the Software Development Life Cycle (SDLC). The proposed study focuses on improving software quality through Software Defect Prediction (SDP) using machine learning (ML) and data balancing techniques. To mitigate the problem of imbalanced datasets, which often lead to model overfitting, the authors employ the Synthetic Minority Oversampling Technique (SMOTE) combined with ML approaches. The assessment encompasses various ML techniques, including Random Forest, SVM, KNN, and LDA, on the balanced CM1 dataset of the NASA promise repository and evaluated performance using accuracy, precision, recall, F1-score, and AUC-ROC. Random Forest emerges as a standout performer, with an accuracy of 98.09% and an F1-score of 97.25%. SVM and KNN also demonstrate high accuracy rates of 97.71% and 97.56%, respectively, while LDA shows balanced performance with an accuracy of 96.04% and an F1-score of 95.93%. Notably, our study achieves significant performance improvements compared with the prior state-of-the-art. The author provides a roadmap to achieve improved performance and predictive capabilities in SDP, highlighting the novel contribution of the proposed study. These findings hold great potential for the software industry, offering solutions to enhance software quality and streamline development. © 2023, The Author(s), under exclusive licence to Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:4131 / 4139
页数:8
相关论文
共 50 条
  • [21] PyTraceBugs: A Large Python']Python Code Dataset for Supervised Machine Learning in Software Defect Prediction
    Akimova, Elena N.
    Bersenev, Alexander Yu
    Deikov, Artem A.
    Kobylkin, Konstantin S.
    Konygin, Anton, V
    Mezentsev, Ilya P.
    Misilov, Vladimir E.
    2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2021), 2021, : 141 - 151
  • [22] Software defect prediction: A study on software metrics using statistical and machine learning methods
    Canaparo, Marco
    Ronchierr, Elisabetta
    Bertaccini, Gianluca
    INTERNATIONAL SYMPOSIUM ON GRIDS & CLOUDS 2022, 2022,
  • [23] Chip Performance Prediction Using Machine Learning Techniques
    Su, Min-Yan
    Lin, Wei-Chen
    Kuo, Yen-Ting
    Li, Chien-Mo
    Fang, Eric Jia-Wei
    Hsueh, Sung S-Y
    2021 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2021,
  • [24] Prediction of Employee Performance using Machine Learning Techniques
    Lather, Anu Singh
    Malhotra, Ruchika
    Saloni, Priya
    Singh, Prabhjot
    Mittal, Sarthak
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION SCIENCE AND SYSTEM, AISS 2019, 2019,
  • [25] Validating Unsupervised Machine Learning Techniques for Software Defect Prediction With Generic Metamorphic Testing
    Chan, Pak Yuen Patrick
    Keung, Jacky
    IEEE ACCESS, 2024, 12 : 165155 - 165172
  • [27] Improved prediction of software defects using ensemble machine learning techniques
    Mehta, Sweta
    Patnaik, K. Sridhar
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (16): : 10551 - 10562
  • [28] Improved prediction of software defects using ensemble machine learning techniques
    Sweta Mehta
    K. Sridhar Patnaik
    Neural Computing and Applications, 2021, 33 : 10551 - 10562
  • [29] An empirical study of software reliability prediction using machine learning techniques
    Kumar, Pradeep
    Singh, Yogesh
    International Journal of System Assurance Engineering and Management, 2012, 3 (03) : 194 - 208
  • [30] Machine Learning Empowered Software Defect Prediction System
    Daoud, Mohammad Sh.
    Aftab, Shabib
    Ahmad, Munir
    Khan, Muhammad Adnan
    Iqbal, Ahmed
    Abbas, Sagheer
    Iqbal, Muhammad
    Ihnaini, Baha
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (02): : 1287 - 1300