Feature Clustering and Ensemble Learning Based Approach for Software Defect Prediction

被引:0
|
作者
Srivastava R. [1 ]
Jain A.K. [1 ]
机构
[1] Department of Applied Mathematics, Delhi Technological University, Delhi
关键词
class imbalance; confidence interval; ensemble modelling; feature selection; hard voting; Software defects;
D O I
10.2174/2666255813999201109201259
中图分类号
学科分类号
摘要
Objective: Defects in delivered software products not only have financial implications but also affect the reputation of the organisation and lead to wastage of time and human re-sources. This paper aims to detect defects in software modules. Methods: Our approach sequentially combines SMOTE algorithm with K-means clustering algorithm to deal with class imbalance problem to obtain a set of key features based on the inter-class and intra-class coefficient of correlation and ensemble modeling to predict defects in software modules. After cautious examination, an ensemble framework of XGBoost, Decision Tree, and Random Forest is used for the prediction of software defects owing to numerous merits of the ensembling approach. Results: We have used five open-source datasets from NASA PROMISE repository for software engineering. The result obtained from our approach has been compared with that of individual algorithms used in the ensemble. A confidence interval for the accuracy of our approach with re-spect to performance evaluation metrics, namely accuracy, precision, recall, F1 score and AUC score, has also been constructed at a significance level of 0.01. Conclusion: Results have been depicted pictographically. © 2022 Bentham Science Publishers.
引用
收藏
页码:868 / 882
页数:14
相关论文
共 50 条
  • [31] Semantic Feature Learning based on Double Sequences Structure for Software Defect Number Prediction
    Wang, Tao
    Tao, Chuanqi
    Guo, Hongjing
    Tang, Lijin
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 157 - 166
  • [32] A software defect prediction method with metric compensation based on feature selection and transfer learning
    Chen, Jinfu
    Wang, Xiaoli
    Cai, Saihua
    Xu, Jiaping
    Chen, Jingyi
    Chen, Haibo
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (05) : 715 - 731
  • [33] Graph-Based Feature Learning for Cross-Project Software Defect Prediction
    Abdu, Ahmed
    Zhai, Zhengjun
    Abdo, Hakim A.
    Algabri, Redhwan
    Lee, Sungon
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 161 - 180
  • [34] Handling Imbalanced Data using Ensemble Learning in Software Defect Prediction
    Malhotra, Ruchika
    Jain, Juhi
    PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 300 - 304
  • [35] Software Defect Prediction Using Ensemble Learning: A Systematic Literature Review
    Matloob, Faseeha
    Ghazal, Taher M.
    Taleb, Nasser
    Aftab, Shabib
    Ahmad, Munir
    Khan, Muhammad Adnan
    Abbas, Sagheer
    Soomro, Tariq Rahim
    IEEE ACCESS, 2021, 9 : 98754 - 98771
  • [36] Building an Ensemble for Software Defect Prediction Based on Diversity Selection
    Petric, Jean
    Bowes, David
    Hall, Tracy
    Christianson, Bruce
    Baddoo, Nathan
    ESEM'16: PROCEEDINGS OF THE 10TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT, 2016,
  • [37] Deep learning based software defect prediction
    Qiao, Lei
    Li, Xuesong
    Umer, Qasim
    Guo, Ping
    NEUROCOMPUTING, 2020, 385 : 100 - 110
  • [38] Software Defect Prediction Based on Fourier Learning
    Yang, Kang
    Yu, Huiqun
    Fan, Guisheng
    Yang, Xingguang
    Zheng, Song
    Leng, Chunxia
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2018, : 388 - 392
  • [39] Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm
    Tang, Yu
    Dai, Qi
    Yang, Mengyuan
    Du, Tony
    Chen, Lifang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (06) : 1967 - 1987
  • [40] Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm
    Yu Tang
    Qi Dai
    Mengyuan Yang
    Tony Du
    Lifang Chen
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 1967 - 1987