Binary Imbalanced Data Classification Based on Modified D2GAN Oversampling and Classifier Fusion

被引:3
|
作者
Zhai, Junhai [1 ]
Qi, Jiaxing [1 ]
Zhang, Sufang [2 ]
机构
[1] Hebei Univ, Coll Math & Informat Sci, Hebei Key Lab Machine Learning & Computat Intelli, Baoding 071002, Peoples R China
[2] China Meteorol Adm, Hebei Branch China Meteorol Adm Training Ctr, Baoding 071000, Peoples R China
关键词
Gallium nitride; Generative adversarial networks; Generators; Training; Diversity methods; Data models; Machine learning; Binary class imbalance; diversity oversampling; generative adversarial network; classifier fusion; fuzzy integral; SMOTE; ENSEMBLE; PREDICTION; MACHINE;
D O I
10.1109/ACCESS.2020.3023949
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary imbalance problem refers to such a classification scenario where one class contains a large number of samples while another class contains only a few samples. When traditional classifiers face with imbalanced datasets, they usually bias towards majority class resulting in poor classification performance. Oversampling is an effective method to address this problem, yet how to conduct diversity oversampling is a challenge. In this article, we proposed a diversity oversampling method based on a modified D2GAN model, and on the basis of diversity oversampling, we also proposed a binary imbalanced data classification approach based on classifier fusion by fuzzy integral. Extensive experiments are conducted on 8 data sets to compare the proposed methods with 7 state-of-the-art methods on 5 aspects: MMD-score, Silhouette-score, F-measure, G-means, and AUC-area. The 7 methods include 4 SMOTE related approaches and 3 GAN related approaches. The experimental results demonstrate that the proposed methods are more effective and efficient than the compared approaches.
引用
收藏
页码:169456 / 169469
页数:14
相关论文
共 50 条
  • [1] An Improved D2GAN-based oversampling algorithm for imbalanced data classification
    Zhao, Xiaoqiang
    Yao, Qinglei
    STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (06) : 569 - 582
  • [2] Binary imbalanced big data classification based on fuzzy data reduction and classifier fusion
    Zhai, Junhai
    Wang, Mohan
    Zhang, Sufang
    SOFT COMPUTING, 2022, 26 (06) : 2781 - 2792
  • [3] Binary imbalanced big data classification based on fuzzy data reduction and classifier fusion
    Junhai Zhai
    Mohan Wang
    Sufang Zhang
    Soft Computing, 2022, 26 : 2781 - 2792
  • [4] Binary imbalanced data classification based on diversity oversampling by generative models
    Zhai, Junhai
    Qi, Jiaxing
    Shen, Chu
    INFORMATION SCIENCES, 2022, 585 : 313 - 343
  • [5] Multi-oversampling with Evidence Fusion for Imbalanced Data Classification
    Tian, Hongpeng
    Zhang, Zuowei
    Liu, Zhunga
    Zuo, Jingwei
    BELIEF FUNCTIONS: THEORY AND APPLICATIONS, BELIEF 2024, 2024, 14909 : 68 - 77
  • [6] Imbalanced data classification based on diverse sample generation and classifier fusion
    Zhai, Junhai
    Qi, Jiaxing
    Zhang, Sufang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (03) : 735 - 750
  • [7] Imbalanced data classification based on diverse sample generation and classifier fusion
    Junhai Zhai
    Jiaxing Qi
    Sufang Zhang
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 735 - 750
  • [8] Gaussian Distribution Based Oversampling for Imbalanced Data Classification
    Xie, Yuxi
    Qiu, Min
    Zhang, Haibo
    Peng, Lizhi
    Chen, Zhenxiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (02) : 667 - 679
  • [9] Radial-Based oversampling for noisy imbalanced data classification
    Koziarski, Michal
    Krawczyk, Bartosz
    Wozniak, Michal
    NEUROCOMPUTING, 2019, 343 : 19 - 33
  • [10] Radial-Based Oversampling for Multiclass Imbalanced Data Classification
    Krawczyk, Bartosz
    Koziarski, Michal
    Wozniak, Michal
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 2818 - 2831