Binary Imbalanced Data Classification Based on Modified D2GAN Oversampling and Classifier Fusion

被引:3
|
作者
Zhai, Junhai [1 ]
Qi, Jiaxing [1 ]
Zhang, Sufang [2 ]
机构
[1] Hebei Univ, Coll Math & Informat Sci, Hebei Key Lab Machine Learning & Computat Intelli, Baoding 071002, Peoples R China
[2] China Meteorol Adm, Hebei Branch China Meteorol Adm Training Ctr, Baoding 071000, Peoples R China
关键词
Gallium nitride; Generative adversarial networks; Generators; Training; Diversity methods; Data models; Machine learning; Binary class imbalance; diversity oversampling; generative adversarial network; classifier fusion; fuzzy integral; SMOTE; ENSEMBLE; PREDICTION; MACHINE;
D O I
10.1109/ACCESS.2020.3023949
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary imbalance problem refers to such a classification scenario where one class contains a large number of samples while another class contains only a few samples. When traditional classifiers face with imbalanced datasets, they usually bias towards majority class resulting in poor classification performance. Oversampling is an effective method to address this problem, yet how to conduct diversity oversampling is a challenge. In this article, we proposed a diversity oversampling method based on a modified D2GAN model, and on the basis of diversity oversampling, we also proposed a binary imbalanced data classification approach based on classifier fusion by fuzzy integral. Extensive experiments are conducted on 8 data sets to compare the proposed methods with 7 state-of-the-art methods on 5 aspects: MMD-score, Silhouette-score, F-measure, G-means, and AUC-area. The 7 methods include 4 SMOTE related approaches and 3 GAN related approaches. The experimental results demonstrate that the proposed methods are more effective and efficient than the compared approaches.
引用
收藏
页码:169456 / 169469
页数:14
相关论文
共 50 条
  • [21] Adaptive Fusion Based Method for Imbalanced Data Classification
    Liang, Zefeng
    Wang, Huan
    Yang, Kaixiang
    Shi, Yifan
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [22] Switching synthesizing-incorporated and cluster-based synthetic oversampling for imbalanced binary classification
    Dou, Jun
    Gao, Zihan
    Wei, Guoliang
    Song, Yan
    Li, Ming
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [23] Binary text classification using genetic programming with crossover-based oversampling for imbalanced datasets
    Aljero, Mona Khalifa A.
    Dimililer, Nazife
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2023, 31 (01) : 180 - 192
  • [24] Spark-based deep classifier framework for imbalanced data classification
    Bhowate, Vikas Gajananrao
    Reddy, T. Hanumantha
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (05): : 1661 - 1677
  • [25] Tabular GAN-Based Oversampling of Imbalanced Time-to-Event Data for Survival Prediction
    Tan, Huaning
    Chen, Renxing
    Qin, Meng
    Tang, Lining
    Wu, Zhibing
    Luo, Qianlin
    Quan, Yujuan
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 376 - 380
  • [26] An intrusion detection imbalanced data classification algorithm based on CWGAN-GP oversampling
    Yao, Qinglei
    Zhao, Xiaoqiang
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2025, 18 (03)
  • [27] An efficient method to determine sample size in oversampling based on classification complexity for imbalanced data
    Lee, Dohyun
    Kim, Kyoungok
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184 (184)
  • [28] Oversampling framework based on sample subspace optimization with accelerated binary particle swarm optimization for imbalanced classification
    Li, Junnan
    APPLIED SOFT COMPUTING, 2024, 162
  • [29] GAN-Based Semi-supervised For Imbalanced Data Classification
    Zhou, Tingting
    Liu, Wei
    Zhou, Congyu
    Chen, Leiting
    2018 4TH INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM2018), 2018, : 17 - 21
  • [30] Improved KD-tree based imbalanced big data classification and oversampling for MapReduce platforms
    Sleeman, William C.
    Roseberry, Martha
    Ghosh, Preetam
    Cano, Alberto
    Krawczyk, Bartosz
    APPLIED INTELLIGENCE, 2024, 54 (23) : 12558 - 12575