Feature Clustering and Ensemble Learning Based Approach for Software Defect Prediction

被引:0
|
作者
Srivastava R. [1 ]
Jain A.K. [1 ]
机构
[1] Department of Applied Mathematics, Delhi Technological University, Delhi
关键词
class imbalance; confidence interval; ensemble modelling; feature selection; hard voting; Software defects;
D O I
10.2174/2666255813999201109201259
中图分类号
学科分类号
摘要
Objective: Defects in delivered software products not only have financial implications but also affect the reputation of the organisation and lead to wastage of time and human re-sources. This paper aims to detect defects in software modules. Methods: Our approach sequentially combines SMOTE algorithm with K-means clustering algorithm to deal with class imbalance problem to obtain a set of key features based on the inter-class and intra-class coefficient of correlation and ensemble modeling to predict defects in software modules. After cautious examination, an ensemble framework of XGBoost, Decision Tree, and Random Forest is used for the prediction of software defects owing to numerous merits of the ensembling approach. Results: We have used five open-source datasets from NASA PROMISE repository for software engineering. The result obtained from our approach has been compared with that of individual algorithms used in the ensemble. A confidence interval for the accuracy of our approach with re-spect to performance evaluation metrics, namely accuracy, precision, recall, F1 score and AUC score, has also been constructed at a significance level of 0.01. Conclusion: Results have been depicted pictographically. © 2022 Bentham Science Publishers.
引用
收藏
页码:868 / 882
页数:14
相关论文
共 50 条
  • [21] Using Coding-Based Ensemble Learning to Improve Software Defect Prediction
    Sun, Zhongbin
    Song, Qinbao
    Zhu, Xiaoyan
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1806 - 1817
  • [22] Software Defect Prediction Approach Based on a Diversity Ensemble Combined With Neural Network
    Chen, Jinfu
    Xu, Jiaping
    Cai, Saihua
    Wang, Xiaoli
    Chen, Haibo
    Li, Zhehao
    IEEE TRANSACTIONS ON RELIABILITY, 2024, 73 (03) : 1487 - 1501
  • [23] Deep Semantic Feature Learning for Software Defect Prediction
    Wang, Song
    Liu, Taiyue
    Nam, Jaechang
    Tan, Lin
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (12) : 1267 - 1293
  • [24] A Feature Engineering and Ensemble Learning Based Approach for Repeated Buyers Prediction
    Zhang, Mingyang
    Lu, Jiayue
    Ma, Ning
    Cheng, T. C. Edwin
    Hua, Guowei
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2022, 17 (06)
  • [25] Software defect prediction using ensemble learning on selected features
    Laradji, Issam H.
    Alshayeb, Mohammad
    Ghouti, Lahouari
    INFORMATION AND SOFTWARE TECHNOLOGY, 2015, 58 : 388 - 402
  • [26] Hybrid SMOTE-Ensemble Approach for Software Defect Prediction
    Alsawalqah, Hamad
    Faris, Hossam
    Aljarah, Ibrahim
    Alnemer, Loai
    Alhindawi, Nouh
    SOFTWARE ENGINEERING TRENDS AND TECHNIQUES IN INTELLIGENT SYSTEMS, CSOC2017, VOL 3, 2017, 575 : 355 - 366
  • [27] Data and Ensemble Machine Learning Fusion Based Intelligent Software Defect Prediction System
    Abbas, Sagheer
    Aftab, Shabib
    Khan, Muhammad Adnan
    Ghazal, Taher M.
    Al Hamadi, Hussam
    Yeun, Chan Yeob
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 6083 - 6100
  • [28] LDFR: Learning deep feature representation for software defect prediction
    Xu, Zhou
    Li, Shuai
    Xu, Jun
    Liu, Jin
    Luo, Xiapu
    Zhang, Yifeng
    Zhang, Tao
    Keung, Jacky
    Tang, Yutian
    JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 158
  • [29] Software Defect Prediction Scheme Based on Feature Selection
    Wang, Pei
    Jin, Cong
    Jin, Shu-Wei
    2012 INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING (ISISE), 2012, : 477 - 480
  • [30] Ensemble clustering based approach for software architecture recovery
    Puchala S.P.R.
    Chhabra J.K.
    Rathee A.
    International Journal of Information Technology, 2022, 14 (4) : 2013 - 2019