Adaptive Centre-Weighted Oversampling for Class Imbalance in Software Defect Prediction

被引:3
|
作者
Zhao, Qi [1 ]
Yan, Xuefeng [1 ,2 ]
Zhou, Yong [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing, Jiangsu, Peoples R China
关键词
software defect prediction; class imbalance; oversampling; adaptive centre; weights; SMOTE; ALGORITHM;
D O I
10.1109/BDCloud.2018.00044
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of software engineering, software defect prediction can maintain the high quality of software products, which is a popular current research topic. However, class imbalance affects the overall classification accuracy of software defect prediction models which is the key issue to be resolved. A new method called adaptive centre-weighted oversampling (ACWO) is proposed to effectively address imbalanced learning problems. First, an appropriate neighborhood size and neighbors are determined for each minority class sample. Then, for a minority class sample, the adaptive centre that is within its neighborhood range, its neighbors and the minority class sample are used to generate synthetic samples. Finally, oversampling of each minority class sample is carried out based on the weights assigned to them. These weights are obtained according to the neighborhood sizes and Euclidean distances to the centre. Afterwards, the software defect prediction model is eventually established by ACWO algorithm with stacked denoising autoencoder neural network. Experimental results show that the software defect prediction model based on ACWO algorithm has a better performance than based on many existing class imbalance learning algorithms according to the precision P, recall R, F1 measure, G-mean, and AUC values.
引用
收藏
页码:223 / 230
页数:8
相关论文
共 50 条
  • [1] An Ensemble Oversampling Model for Class Imbalance Problem in Software Defect Prediction
    Huda, Shamsul
    Liu, Kevin
    Abdelrazek, Mohamed
    Ibrahim, Amani
    Alyahya, Sultan
    Al-Dossari, Hmood
    Ahmad, Shafiq
    [J]. IEEE ACCESS, 2018, 6 : 24184 - 24195
  • [2] Support Vector based Oversampling Technique for Handling Class Imbalance in Software Defect Prediction
    Malhotra, Ruchika
    Agrawal, Vaibhav
    Pal, Vedansh
    Agarwal, Tushar
    [J]. 2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 1078 - 1083
  • [3] MAHAKIL: Diversity Based Oversampling Approach to Alleviate the Class Imbalance Issue in Software Defect Prediction
    Benni, Kwabena Ebo
    Keung, Jacky
    Phannachitta, Passakorn
    Monden, Akito
    Mensah, Solomon
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (06) : 534 - 550
  • [4] Genetic algorithm-based oversampling approach to prune the class imbalance issue in software defect prediction
    Arun, C.
    Lakshmi, C.
    [J]. SOFT COMPUTING, 2022, 26 (23) : 12915 - 12931
  • [5] Genetic algorithm-based oversampling approach to prune the class imbalance issue in software defect prediction
    C. Arun
    C. Lakshmi
    [J]. Soft Computing, 2022, 26 : 12915 - 12931
  • [6] COSTE: Complexity-based OverSampling TEchnique to alleviate the class imbalance problem in software defect prediction
    Feng, Shuo
    Keung, Jacky
    Yu, Xiao
    Xiao, Yan
    Bennin, Kwabena Ebo
    Kabir, Md Alamgir
    Zhang, Miao
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 129
  • [7] MAHAKIL: Diversity based Oversampling Approach to Alleviate the Class Imbalance Issue in Software Defect Prediction Extended Abstract
    Bennin, Kwabena E.
    Keung, Jacky
    Phannachitta, Passakorn
    Monden, Akito
    Mensah, Solomon
    [J]. PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2018, : 699 - 699
  • [8] Using Class Imbalance Learning for Software Defect Prediction
    Wang, Shuo
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2013, 62 (02) : 434 - 443
  • [9] Class Imbalance Data-Generation for Software Defect Prediction
    Li, Zheng
    Zhang, Xingyao
    Guo, Junxia
    Shang, Ying
    [J]. 2019 26TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC), 2019, : 276 - 283
  • [10] Tackling class overlap and imbalance problems in software defect prediction
    Lin Chen
    Bin Fang
    Zhaowei Shang
    Yuanyan Tang
    [J]. Software Quality Journal, 2018, 26 : 97 - 125