Optimized ensemble machine learning model for software bugs prediction

被引：1

作者：

Johnson, Femi ^{[1
]}

Oluwatobi, Olayiwola ^{[2
]}

Folorunso, Olusegun ^{[1
]}

Ojumu, Alomaja Victor ^{[3
]}

Quadri, Alatishe ^{[2
]}

机构：

[1] Fed Univ Agr, Dept Comp Sci, Abeokuta, Nigeria

[2] Olabisi Onabanjo Univ, Dept Comp Sci, Ago Iwoye, Nigeria

[3] Yaba Coll Technol, Dept Comp Technol, Lagos, Nigeria

来源：

INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING | 2023年 / 19卷 / 01期

关键词：

Software; Bugs detection; Vectorization; Logistic regression; Extra tree classifier; FAULT-PRONENESS; NEURAL-NETWORK; MODULES; NUMBER; PHASE;

D O I：

10.1007/s11334-022-00506-x

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Software accuracy and efficiency checks are becoming of paramount interest to system users before utilization. As a result, twenty-first-century programmers are consciously developing less buggy, highly efficient, and robust software with a higher degree of accuracy. Occasionally, undetected bugs in large software due to the complexity of codes and other associated parametric attributes cause hardware to malfunction. In this paper, an ensemble model of Logistic Regression and Extra tree classifier algorithms is deployed on parametric software attributes for the accurate classification and prediction of software bugs. The implementation was performed on different platforms (WEKA, MATLAB and PyCharm) to determine the rate of memory utilization, optimize prediction time, maximize the model's efficiency and compare accuracy rankings among similar machine models. A publicly available software defects dataset from the National Aeronautics and Space Administration (NASA) containing 16,962 instances and 38 attributes for software defects prediction was collected, pre-processed and used in the implementation of this study. The collected data were vectorized, subjected to principal component analysis (PCA) for dimension reduction based on ranking values and divided in the ratio 3:2 for training and testing of the ensemble model classifier, respectively, on new sets of buggy software datasets. The result from the ensembled model showed a significant increase from 96.7-97.8% in the prediction accuracy of the un-vectorized dataset to vectorized dataset. An appreciable decrease in the prediction time (19.7 s) of the vectorized dataset was also observed against the initial time (26.9 s) recorded for the un-vectorized dataset. In addition, memory utilization for vectorized datasets increased during the training phase due to the number of bits but got reduced at the final testing phase of the software bug prediction. However, the overall accuracy of 97.8% recorded by the optimized ensemble model for buggy software prediction proved the model's capability to accurately classify and predict buggy software with efficient memory utilization at optimal time duration.

引用

下载

页码：91 / 101

页数：11

共 50 条

[1] Optimized ensemble machine learning model for software bugs prediction
Femi Johnson
Olayiwola Oluwatobi
Olusegun Folorunso
Alomaja Victor Ojumu
Alatishe Quadri
Innovations in Systems and Software Engineering, 2023, 19 : 91 - 101
[2] Developing software predictive model for examining the software bugs using machine learning
Singh, Swati
Mehrotra, Monica
Bharati, Taran Singh
INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2024, 15 (01) : 44 - 52
[3] MAAT: A Novel Ensemble Approach to Addressing Fairness and Performance Bugs for Machine Learning Software
Chen, Zhenpeng
Zhang, Jie M.
Sarro, Federica
Harman, Mark
PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, : 1122 - 1134
[4] Improved prediction of software defects using ensemble machine learning techniques
Sweta Mehta
K. Sridhar Patnaik
Neural Computing and Applications, 2021, 33 : 10551 - 10562
[5] Improved prediction of software defects using ensemble machine learning techniques
Mehta, Sweta
Patnaik, K. Sridhar
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (16): : 10551 - 10562
[6] Prediction of software quality with Machine Learning-Based ensemble methods
Ceran A.A.
Ar Y.
Tanrıöver Ö.Ö.
Seyrek Ceran S.
Materials Today: Proceedings, 2023, 81 : 18 - 25
[7] Optimized Machine Learning Model for Fire Consequence Prediction
Zhong, Wei
Wang, Shuangli
Wu, Tan
Gao, Xiaolei
Liang, Tianshui
FIRE-SWITZERLAND, 2024, 7 (04):
[8] An Optimized Extreme Learning Machine Algorithm for Improving Software Maintainability Prediction
Gupta, Shkha
Chug, Anuradha
2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 829 - 836
[9] CROP PRODUCTION-ENSEMBLE MACHINE LEARNING MODEL FOR PREDICTION
Kumar, N. Naveen
Mohanraj, P.
Priyatharsini, S.
Shakthi, S. P.
Sivakumar, S.
INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (04) : 391 - 400
[10] Comprehensive hepatotoxicity prediction: ensemble model integrating machine learning and deep learning
Khan, Muhammad Zafar Irshad
Ren, Jia-Nan
Cao, Cheng
Ye, Hong-Yu-Xiang
Wang, Hao
Guo, Ya-Min
Yang, Jin-Rong
Chen, Jian-Zhong
FRONTIERS IN PHARMACOLOGY, 2024, 15

← 1 2 3 4 5 →