Adaptive Centre-Weighted Oversampling for Class Imbalance in Software Defect Prediction

被引:3
|
作者
Zhao, Qi [1 ]
Yan, Xuefeng [1 ,2 ]
Zhou, Yong [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing, Jiangsu, Peoples R China
关键词
software defect prediction; class imbalance; oversampling; adaptive centre; weights; SMOTE; ALGORITHM;
D O I
10.1109/BDCloud.2018.00044
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of software engineering, software defect prediction can maintain the high quality of software products, which is a popular current research topic. However, class imbalance affects the overall classification accuracy of software defect prediction models which is the key issue to be resolved. A new method called adaptive centre-weighted oversampling (ACWO) is proposed to effectively address imbalanced learning problems. First, an appropriate neighborhood size and neighbors are determined for each minority class sample. Then, for a minority class sample, the adaptive centre that is within its neighborhood range, its neighbors and the minority class sample are used to generate synthetic samples. Finally, oversampling of each minority class sample is carried out based on the weights assigned to them. These weights are obtained according to the neighborhood sizes and Euclidean distances to the centre. Afterwards, the software defect prediction model is eventually established by ACWO algorithm with stacked denoising autoencoder neural network. Experimental results show that the software defect prediction model based on ACWO algorithm has a better performance than based on many existing class imbalance learning algorithms according to the precision P, recall R, F1 measure, G-mean, and AUC values.
引用
收藏
页码:223 / 230
页数:8
相关论文
共 50 条
  • [21] Class Imbalance Learning to Heterogeneous Cross-Software Projects Defect Prediction
    Vashisht, Rohit
    Rizvi, Syed Afzal Murtaza
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2022, 10 (01)
  • [22] A Hybrid Approach to Coping with High Dimensionality and Class Imbalance for Software Defect Prediction
    Gao, Kehan
    Khoshgoftaar, Taghi
    Napolitano, Amri
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, 2012, : 281 - 288
  • [23] Which type of metrics are useful to deal with class imbalance in software defect prediction?
    Ozturk, Muhammed Maruf
    INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 92 : 17 - 29
  • [24] Influence Analysis Method of Class Imbalance on Software Defect Prediction Model Stability and Prediction Performance
    Zhang Y.-M.
    Zhi S.-L.
    Jiang S.-J.
    Yuan G.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (08): : 2076 - 2087
  • [25] An empirical study toward dealing with noise and class imbalance issues in software defect prediction
    Pandey, Sushant Kumar
    Tripathi, Anil Kumar
    SOFT COMPUTING, 2021, 25 (21) : 13465 - 13492
  • [26] Class Imbalance Evolution and Verification Latency in Just-in-Time Software Defect Prediction
    Cabral, George G.
    Minku, Leandro L.
    Shihab, Emad
    Mujahid, Suhaib
    2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), 2019, : 666 - 676
  • [27] Prospect theory-based oversampling for software defect prediction
    Xu, Biao
    Yan, Yuanting
    Zhang, Yiwen
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (08): : 2822 - 2831
  • [28] Software Defect Prediction using Oversampling Algorithm: A-SUWO
    Choirunnisa, Shabrina
    Meidyani, Biandina
    Rochimah, Siti
    2018 ELECTRICAL POWER, ELECTRONICS, COMMUNICATIONS, CONTROLS, AND INFORMATICS SEMINAR (EECCIS), 2018, : 337 - 341
  • [29] An empirical study toward dealing with noise and class imbalance issues in software defect prediction
    Sushant Kumar Pandey
    Anil Kumar Tripathi
    Soft Computing, 2021, 25 : 13465 - 13492
  • [30] Software defect prediction based on correlation weighted class association rule mining
    Shao, Yuanxun
    Liu, Bin
    Wang, Shihai
    Li, Guoqi
    KNOWLEDGE-BASED SYSTEMS, 2020, 196