Optimized ensemble machine learning model for software bugs prediction

被引:1
|
作者
Johnson, Femi [1 ]
Oluwatobi, Olayiwola [2 ]
Folorunso, Olusegun [1 ]
Ojumu, Alomaja Victor [3 ]
Quadri, Alatishe [2 ]
机构
[1] Fed Univ Agr, Dept Comp Sci, Abeokuta, Nigeria
[2] Olabisi Onabanjo Univ, Dept Comp Sci, Ago Iwoye, Nigeria
[3] Yaba Coll Technol, Dept Comp Technol, Lagos, Nigeria
关键词
Software; Bugs detection; Vectorization; Logistic regression; Extra tree classifier; FAULT-PRONENESS; NEURAL-NETWORK; MODULES; NUMBER; PHASE;
D O I
10.1007/s11334-022-00506-x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software accuracy and efficiency checks are becoming of paramount interest to system users before utilization. As a result, twenty-first-century programmers are consciously developing less buggy, highly efficient, and robust software with a higher degree of accuracy. Occasionally, undetected bugs in large software due to the complexity of codes and other associated parametric attributes cause hardware to malfunction. In this paper, an ensemble model of Logistic Regression and Extra tree classifier algorithms is deployed on parametric software attributes for the accurate classification and prediction of software bugs. The implementation was performed on different platforms (WEKA, MATLAB and PyCharm) to determine the rate of memory utilization, optimize prediction time, maximize the model's efficiency and compare accuracy rankings among similar machine models. A publicly available software defects dataset from the National Aeronautics and Space Administration (NASA) containing 16,962 instances and 38 attributes for software defects prediction was collected, pre-processed and used in the implementation of this study. The collected data were vectorized, subjected to principal component analysis (PCA) for dimension reduction based on ranking values and divided in the ratio 3:2 for training and testing of the ensemble model classifier, respectively, on new sets of buggy software datasets. The result from the ensembled model showed a significant increase from 96.7-97.8% in the prediction accuracy of the un-vectorized dataset to vectorized dataset. An appreciable decrease in the prediction time (19.7 s) of the vectorized dataset was also observed against the initial time (26.9 s) recorded for the un-vectorized dataset. In addition, memory utilization for vectorized datasets increased during the training phase due to the number of bits but got reduced at the final testing phase of the software bug prediction. However, the overall accuracy of 97.8% recorded by the optimized ensemble model for buggy software prediction proved the model's capability to accurately classify and predict buggy software with efficient memory utilization at optimal time duration.
引用
下载
收藏
页码:91 / 101
页数:11
相关论文
共 50 条
  • [1] Optimized ensemble machine learning model for software bugs prediction
    Femi Johnson
    Olayiwola Oluwatobi
    Olusegun Folorunso
    Alomaja Victor Ojumu
    Alatishe Quadri
    Innovations in Systems and Software Engineering, 2023, 19 : 91 - 101
  • [2] Developing software predictive model for examining the software bugs using machine learning
    Singh, Swati
    Mehrotra, Monica
    Bharati, Taran Singh
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2024, 15 (01) : 44 - 52
  • [3] MAAT: A Novel Ensemble Approach to Addressing Fairness and Performance Bugs for Machine Learning Software
    Chen, Zhenpeng
    Zhang, Jie M.
    Sarro, Federica
    Harman, Mark
    PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, : 1122 - 1134
  • [4] Improved prediction of software defects using ensemble machine learning techniques
    Sweta Mehta
    K. Sridhar Patnaik
    Neural Computing and Applications, 2021, 33 : 10551 - 10562
  • [5] Improved prediction of software defects using ensemble machine learning techniques
    Mehta, Sweta
    Patnaik, K. Sridhar
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (16): : 10551 - 10562
  • [6] Prediction of software quality with Machine Learning-Based ensemble methods
    Ceran A.A.
    Ar Y.
    Tanrıöver Ö.Ö.
    Seyrek Ceran S.
    Materials Today: Proceedings, 2023, 81 : 18 - 25
  • [7] Optimized Machine Learning Model for Fire Consequence Prediction
    Zhong, Wei
    Wang, Shuangli
    Wu, Tan
    Gao, Xiaolei
    Liang, Tianshui
    FIRE-SWITZERLAND, 2024, 7 (04):
  • [8] An Optimized Extreme Learning Machine Algorithm for Improving Software Maintainability Prediction
    Gupta, Shkha
    Chug, Anuradha
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 829 - 836
  • [9] CROP PRODUCTION-ENSEMBLE MACHINE LEARNING MODEL FOR PREDICTION
    Kumar, N. Naveen
    Mohanraj, P.
    Priyatharsini, S.
    Shakthi, S. P.
    Sivakumar, S.
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (04) : 391 - 400
  • [10] Comprehensive hepatotoxicity prediction: ensemble model integrating machine learning and deep learning
    Khan, Muhammad Zafar Irshad
    Ren, Jia-Nan
    Cao, Cheng
    Ye, Hong-Yu-Xiang
    Wang, Hao
    Guo, Ya-Min
    Yang, Jin-Rong
    Chen, Jian-Zhong
    FRONTIERS IN PHARMACOLOGY, 2024, 15