Data and Ensemble Machine Learning Fusion Based Intelligent Software Defect Prediction System

被引:3
|
作者
Abbas, Sagheer [1 ]
Aftab, Shabib [1 ,2 ]
Khan, Muhammad Adnan [3 ,4 ]
Ghazal, Taher M. [5 ,6 ]
Al Hamadi, Hussam [7 ]
Yeun, Chan Yeob [8 ]
机构
[1] Natl Coll Business Adm & Econ, Sch Comp Sci, Lahore 54000, Pakistan
[2] Virtual Univ Pakistan, Dept Comp Sci, Lahore 54000, Pakistan
[3] Gachon Univ, Fac Artificial Intelligence & Software, Dept Software, Seongnam 13120, South Korea
[4] Riphah Int Univ, Fac Comp, Riphah Sch Comp & Innovat, Lahore Campus, Lahore 54000, Pakistan
[5] Skyline Univ Coll, Sch Informat Technol, Sharjah, U Arab Emirates
[6] UKM, Fac Informat Sci & Technol, Ctr Cyber Secur, Bangi 43600, Selangor, Malaysia
[7] Univ Dubai, Coll Engn & IT, Al Ain 14143, U Arab Emirates
[8] Khalifa Univ, Ctr Cyber Phys Syst, EECS Dept, Abu Dhabi 127788, U Arab Emirates
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 75卷 / 03期
关键词
Ensemble machine learning fusion; software defect prediction; fuzzy logic; QUALITY; OVERLAP; MODEL;
D O I
10.32604/cmc.2023.037933
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The software engineering field has long focused on creating high-quality software despite limited resources. Detecting defects before the testing stage of software development can enable quality assurance engineers to con-centrate on problematic modules rather than all the modules. This approach can enhance the quality of the final product while lowering development costs. Identifying defective modules early on can allow for early corrections and ensure the timely delivery of a high-quality product that satisfies customers and instills greater confidence in the development team. This process is known as software defect prediction, and it can improve end-product quality while reducing the cost of testing and maintenance. This study proposes a software defect prediction system that utilizes data fusion, feature selection, and ensemble machine learning fusion techniques. A novel filter-based metric selection technique is proposed in the framework to select the optimum features. A three-step nested approach is presented for predicting defective modules to achieve high accuracy. In the first step, three supervised machine learning techniques, including Decision Tree, Support Vector Machines, and Naive Bayes, are used to detect faulty modules. The second step involves integrating the predictive accuracy of these classification techniques through three ensemble machine-learning methods: Bagging, Voting, and Stacking. Finally, in the third step, a fuzzy logic technique is employed to integrate the predictive accuracy of the ensemble machine learning techniques. The experiments are performed on a fused software defect dataset to ensure that the developed fused ensemble model can perform effectively on diverse datasets. Five NASA datasets are integrated to create the fused dataset: MW1, PC1, PC3, PC4, and CM1. According to the results, the proposed system exhibited superior performance to other advanced techniques for predicting software defects, achieving a remarkable accuracy rate of 92.08%.
引用
收藏
页码:6083 / 6100
页数:18
相关论文
共 50 条
  • [1] Ensemble learning based software defect prediction
    Dong, Xin
    Liang, Yan
    Miyamoto, Shoichiro
    Yamaguchi, Shingo
    JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (04): : 377 - 391
  • [2] A Cloud-Based Software Defect Prediction System Using Data and Decision-Level Machine Learning Fusion
    Aftab, Shabib
    Abbas, Sagheer
    Ghazal, Taher M.
    Ahmad, Munir
    Hamadi, Hussam Al
    Yeun, Chan Yeob
    Khan, Muhammad Adnan
    MATHEMATICS, 2023, 11 (03)
  • [3] Software Defect Prediction: A Machine Learning Approach with Voting Ensemble
    Mosquera, Marcela
    Hurtado, Remigio
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 585 - 595
  • [4] Software Defect Prediction Using an Intelligent Ensemble-Based Model
    Ali, Misbah
    Mazhar, Tehseen
    Arif, Yasir
    Al-Otaibi, Shaha
    Ghadi, Yazeed Yasin
    Shahzad, Tariq
    Khan, Muhammad Amir
    Hamam, Habib
    IEEE ACCESS, 2024, 12 : 20376 - 20395
  • [5] Machine Learning Empowered Software Defect Prediction System
    Daoud, Mohammad Sh.
    Aftab, Shabib
    Ahmad, Munir
    Khan, Muhammad Adnan
    Iqbal, Ahmed
    Abbas, Sagheer
    Iqbal, Muhammad
    Ihnaini, Baha
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (02): : 1287 - 1300
  • [6] Software Defect Prediction Method Based on Clustering Ensemble Learning
    Tao, Hongwei
    Cao, Qiaoling
    Chen, Haoran
    Li, Yanting
    Niu, Xiaoxu
    Wang, Tao
    Geng, Zhenhao
    Shang, Songtao
    IET SOFTWARE, 2024, 2024
  • [7] Handling Imbalanced Data using Ensemble Learning in Software Defect Prediction
    Malhotra, Ruchika
    Jain, Juhi
    PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 300 - 304
  • [8] A study on software metrics based software defect prediction using data mining and machine learning techniques
    Prasad, Manjula C.M.
    Florence, Lilly
    Arya, Arti
    International Journal of Database Theory and Application, 2015, 8 (03): : 179 - 190
  • [9] Feature Clustering and Ensemble Learning Based Approach for Software Defect Prediction
    Srivastava R.
    Jain A.K.
    Recent Advances in Computer Science and Communications, 2022, 15 (06): : 868 - 882
  • [10] Software defect prediction based on weighted extreme learning machine
    Gai, Jinjing
    Zheng, Shang
    Yu, Hualong
    Yang, Hongji
    MULTIAGENT AND GRID SYSTEMS, 2020, 16 (01) : 67 - 82