Influence Analysis Method of Class Imbalance on Software Defect Prediction Model Stability and Prediction Performance

被引:0
|
作者
Zhang, Yan-Mei [1 ,2 ]
Zhi, Sheng-Lin [3 ]
Jiang, Shu-Juan [1 ,2 ]
Yuan, Guan [1 ,2 ]
机构
[1] Mine Digitization Engineering Research Center, The Ministry of Education, China University of Mining and Technology, Jiangsu, Xuzhou,221116, China
[2] School of Computer Science and Technology, China University of Mining and Technology, Jiangsu, Xuzhou,221116, China
[3] KeHua Data Co.,Ltd, Guangdong, Shenzhen,518055, China
来源
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Classification (of information) - Defects - Forecasting;
D O I
10.12263/DZXB.20210911
中图分类号
学科分类号
摘要
The paper proposes a method for analyzing the influence of class imbalance on software defect prediction model stability and prediction performance. Firstly, the original data set is constructed into a set of new data sets whose unbalance rate is less than the original data set's unbalance rate by using the undersampling method. Where, fixed seeds are used in the construction of the data set to ensure that the data in the same unbalanced rate data set constructed by the same data set is the same, so as to reduce the randomness of the results of each run. Secondly, the MCC value is taken as the performance evaluation indicator of the prediction model, and the new data set generated each time is put into the classification algorithm of the model for training and prediction evaluation, so as to obtain the MCC value at different unbalanced rate for the current data set. We also propose a performance stability evaluation indicator. The experimental results show that, MCC is more suitable as the stability evaluation indicator of software defect prediction model under the condition of class imbalance compared with AUC. For the stability of software defect prediction performance, the cost sensitive model performs better than the ensemble model. © 2023 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:2076 / 2087
相关论文
共 50 条
  • [1] SOFTWARE DEFECT PREDICTION: ANALYSIS OF CLASS IMBALANCE AND PERFORMANCE STABILITY
    Balogun, Abdullateef O.
    Basri, Shuib
    Abdulkadir, Said J.
    Adeyemo, Victor E.
    Imam, Abdullahi A.
    Bajeh, Amos O.
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2019, 14 (06): : 3294 - 3308
  • [2] The Impact Study of Class Imbalance on the Performance of Software Defect Prediction Models
    分类不平衡对软件缺陷预测模型性能的影响研究
    [J]. Qian, Jun-Yan (qjy2000@gmail.com), 2018, Science Press (41):
  • [3] An ensemble model for addressing class imbalance and class overlap in software defect prediction
    Dar, Abdul Waheed
    Farooq, Sheikh Umar
    [J]. International Journal of System Assurance Engineering and Management, 2024, 15 (12) : 5584 - 5603
  • [4] An Ensemble Oversampling Model for Class Imbalance Problem in Software Defect Prediction
    Huda, Shamsul
    Liu, Kevin
    Abdelrazek, Mohamed
    Ibrahim, Amani
    Alyahya, Sultan
    Al-Dossari, Hmood
    Ahmad, Shafiq
    [J]. IEEE ACCESS, 2018, 6 : 24184 - 24195
  • [5] The Performance Stability of Defect Prediction Models with Class Imbalance: An Empirical Study
    Yu, Qiao
    Jiang, Shujuan
    Zhang, Yanmei
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (02) : 265 - 272
  • [6] Using Class Imbalance Learning for Software Defect Prediction
    Wang, Shuo
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2013, 62 (02) : 434 - 443
  • [7] Class Imbalance Data-Generation for Software Defect Prediction
    Li, Zheng
    Zhang, Xingyao
    Guo, Junxia
    Shang, Ying
    [J]. 2019 26TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC), 2019, : 276 - 283
  • [8] Tackling class overlap and imbalance problems in software defect prediction
    Lin Chen
    Bin Fang
    Zhaowei Shang
    Yuanyan Tang
    [J]. Software Quality Journal, 2018, 26 : 97 - 125
  • [9] Tackling class overlap and imbalance problems in software defect prediction
    Chen, Lin
    Fang, Bin
    Shang, Zhaowei
    Tang, Yuanyan
    [J]. SOFTWARE QUALITY JOURNAL, 2018, 26 (01) : 97 - 125
  • [10] Class Imbalance Reduction (CIR): A Novel Approach to Software Defect Prediction in the Presence of Class Imbalance
    Bejjanki, Kiran Kumar
    Gyani, Jayadev
    Gugulothu, Narsimha
    [J]. SYMMETRY-BASEL, 2020, 12 (03):