Neighbor cleaning learning based cost-sensitive ensemble learning approach for software defect prediction

被引:1
|
作者
Li, Li [1 ]
Su, Renjia [1 ]
Zhao, Xin [1 ]
机构
[1] Northeast Forestry Univ, Sch Comp & Control Engn, Harbin, Peoples R China
来源
关键词
class imbalance; class overlap; cost-sensitive learning; machine learning; software defect prediction;
D O I
10.1002/cpe.8017
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The class imbalance problem in software defect prediction datasets leads to prediction results that are biased toward the majority class, and the class overlap problem leads to fuzzy boundaries for classification decisions, both of which affect the model's prediction performance on the dataset. A neighbor cleaning learning (NCL) is an effective technique for defect prediction. To solve the class overlap problem and class imbalance problem, the NCL-based cost-sensitive ensemble learning approach for software defect prediction (NCL_CSEL) model is proposed. First, the bootstrap resampled data are trained using the base classifier. Subsequently, multiple classifiers are integrated by a static ensemble to obtain the final classification results. As the base classifier, the Adaptive Boosting (AdaBoost) classifier combining NCL and cost-sensitive learning is proposed, and the class overlap problem and class imbalance problem are solved by balancing the proportion of overlap sample removal in NCL and the size of the cost factor in cost-sensitive learning. Specifically, the NCL algorithm is used to initialize the sample weights, while the cost-sensitive method is employed to update the sample weights. Experiments based on the NASA dataset and AEEEM dataset show that the defect prediction model can improve the bal value by approximately 7% and the AUC value by 9.5% when the NCL algorithm is added. NCL_CSEL can effectively solve the class imbalance problem and significantly improve the prediction performance compared with existing methods for solving the class imbalance problem.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Software Defect Prediction Based on Cost-Sensitive Dictionary Learning
    Wan, Hongyan
    Wu, Guoqing
    Yu, Mali
    Yuan, Mengting
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2019, 29 (09) : 1219 - 1243
  • [2] Cost-sensitive Dictionary Learning for Software Defect Prediction
    Liang Niu
    Jianwu Wan
    Hongyuan Wang
    Kaiwei Zhou
    Neural Processing Letters, 2020, 52 : 2415 - 2449
  • [3] Cost-sensitive Dictionary Learning for Software Defect Prediction
    Niu, Liang
    Wan, Jianwu
    Wang, Hongyuan
    Zhou, Kaiwei
    NEURAL PROCESSING LETTERS, 2020, 52 (03) : 2415 - 2449
  • [4] Two-Stage Cost-Sensitive Learning for Software Defect Prediction
    Liu, Mingxia
    Miao, Linsong
    Zhang, Daoqiang
    IEEE TRANSACTIONS ON RELIABILITY, 2014, 63 (02) : 676 - 686
  • [5] Discriminating features-based cost-sensitive approach for software defect prediction
    Ali, Aftab
    Khan, Naveed
    Abu-Tair, Mamun
    Noppen, Joost
    McClean, Sally
    McChesney, Ian
    AUTOMATED SOFTWARE ENGINEERING, 2021, 28 (02)
  • [6] Discriminating features-based cost-sensitive approach for software defect prediction
    Aftab Ali
    Naveed Khan
    Mamun Abu-Tair
    Joost Noppen
    Sally McClean
    Ian McChesney
    Automated Software Engineering, 2021, 28
  • [7] Cost-Sensitive Ensemble Learning for Venture Capital Exit Prediction
    Fang, Heng
    Ma, Ding
    2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 161 - 167
  • [8] Misclassification Cost-Sensitive Software Defect Prediction
    Xu, Ling
    Wang, Bei
    Liu, Ling
    Zhou, Mo
    Liao, Shengping
    Yan, Meng
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 256 - 263
  • [9] Cost-sensitive learning for defect escalation
    Sheng, Victor S.
    Gu, Bin
    Fang, Wei
    Wu, Jian
    KNOWLEDGE-BASED SYSTEMS, 2014, 66 : 146 - 155
  • [10] Ensemble learning based software defect prediction
    Dong, Xin
    Liang, Yan
    Miyamoto, Shoichiro
    Yamaguchi, Shingo
    JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (04): : 377 - 391