Feature Clustering and Ensemble Learning Based Approach for Software Defect Prediction

被引:0
|
作者
Srivastava R. [1 ]
Jain A.K. [1 ]
机构
[1] Department of Applied Mathematics, Delhi Technological University, Delhi
关键词
class imbalance; confidence interval; ensemble modelling; feature selection; hard voting; Software defects;
D O I
10.2174/2666255813999201109201259
中图分类号
学科分类号
摘要
Objective: Defects in delivered software products not only have financial implications but also affect the reputation of the organisation and lead to wastage of time and human re-sources. This paper aims to detect defects in software modules. Methods: Our approach sequentially combines SMOTE algorithm with K-means clustering algorithm to deal with class imbalance problem to obtain a set of key features based on the inter-class and intra-class coefficient of correlation and ensemble modeling to predict defects in software modules. After cautious examination, an ensemble framework of XGBoost, Decision Tree, and Random Forest is used for the prediction of software defects owing to numerous merits of the ensembling approach. Results: We have used five open-source datasets from NASA PROMISE repository for software engineering. The result obtained from our approach has been compared with that of individual algorithms used in the ensemble. A confidence interval for the accuracy of our approach with re-spect to performance evaluation metrics, namely accuracy, precision, recall, F1 score and AUC score, has also been constructed at a significance level of 0.01. Conclusion: Results have been depicted pictographically. © 2022 Bentham Science Publishers.
引用
收藏
页码:868 / 882
页数:14
相关论文
共 50 条
  • [41] Dictionary Learning Based Software Defect Prediction
    Jing, Xiao-Yuan
    Ying, Shi
    Zhang, Zhi-Wu
    Wu, Shan-Shan
    Liu, Jin
    36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, : 414 - 423
  • [42] Software defect prediction using learning to rank approach
    Nassif, Ali Bou
    Talib, Manar Abu
    Azzeh, Mohammad
    Alzaabi, Shaikha
    Khanfar, Rawan
    Kharsa, Ruba
    Angelis, Lefteris
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [43] A New Learning to Rank Approach for Software Defect Prediction
    Al-omari, Sara
    Elsheikh, Yousef
    Azzeh, Mohammed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 805 - 812
  • [44] Software defect prediction using learning to rank approach
    Ali Bou Nassif
    Manar Abu Talib
    Mohammad Azzeh
    Shaikha Alzaabi
    Rawan Khanfar
    Ruba Kharsa
    Lefteris Angelis
    Scientific Reports, 13
  • [45] A Learning-to-Rank Approach to Software Defect Prediction
    Yang, Xiaoxing
    Tang, Ke
    Yao, Xin
    IEEE TRANSACTIONS ON RELIABILITY, 2015, 64 (01) : 234 - 246
  • [46] A Package Based Clustering for Enhancing Software Defect Prediction Accuracy
    Islam, Rayhanul
    Sakib, Kazi
    2014 17TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2014, : 81 - 86
  • [47] Neighborhood Approximate Reducts-Based Ensemble Learning Algorithm and Its Application in Software Defect Prediction
    Yang, Zhiyong
    Du, Junwei
    Hu, Qiang
    Jiang, Feng
    ROUGH SETS, IJCRS 2022, 2022, 13633 : 100 - 113
  • [48] Software Defect Prediction Using an Intelligent Ensemble-Based Model
    Ali, Misbah
    Mazhar, Tehseen
    Arif, Yasir
    Al-Otaibi, Shaha
    Ghadi, Yazeed Yasin
    Shahzad, Tariq
    Khan, Muhammad Amir
    Hamam, Habib
    IEEE ACCESS, 2024, 12 : 20376 - 20395
  • [49] SMOTE-Based Homogeneous Ensemble Methods for Software Defect Prediction
    Balogun, Abdullateef O.
    Lafenwa-Balogun, Fatimah B.
    Mojeed, Hammed A.
    Adeyemo, Victor E.
    Akande, Oluwatobi N.
    Akintola, Abimbola G.
    Bajeh, Amos O.
    Usman-Hamza, Fatimah E.
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2020, PT VI, 2020, 12254 : 615 - 631
  • [50] Reinforcing defect prediction: a reinforcement learning approach to mitigate class imbalance in software defect prediction
    Mahfooz Alam
    Mohd Mustaqeem
    Iran Journal of Computer Science, 2025, 8 (1) : 151 - 162