Binary Imbalanced Data Classification Based on Modified D2GAN Oversampling and Classifier Fusion

被引:3
|
作者
Zhai, Junhai [1 ]
Qi, Jiaxing [1 ]
Zhang, Sufang [2 ]
机构
[1] Hebei Univ, Coll Math & Informat Sci, Hebei Key Lab Machine Learning & Computat Intelli, Baoding 071002, Peoples R China
[2] China Meteorol Adm, Hebei Branch China Meteorol Adm Training Ctr, Baoding 071000, Peoples R China
关键词
Gallium nitride; Generative adversarial networks; Generators; Training; Diversity methods; Data models; Machine learning; Binary class imbalance; diversity oversampling; generative adversarial network; classifier fusion; fuzzy integral; SMOTE; ENSEMBLE; PREDICTION; MACHINE;
D O I
10.1109/ACCESS.2020.3023949
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary imbalance problem refers to such a classification scenario where one class contains a large number of samples while another class contains only a few samples. When traditional classifiers face with imbalanced datasets, they usually bias towards majority class resulting in poor classification performance. Oversampling is an effective method to address this problem, yet how to conduct diversity oversampling is a challenge. In this article, we proposed a diversity oversampling method based on a modified D2GAN model, and on the basis of diversity oversampling, we also proposed a binary imbalanced data classification approach based on classifier fusion by fuzzy integral. Extensive experiments are conducted on 8 data sets to compare the proposed methods with 7 state-of-the-art methods on 5 aspects: MMD-score, Silhouette-score, F-measure, G-means, and AUC-area. The 7 methods include 4 SMOTE related approaches and 3 GAN related approaches. The experimental results demonstrate that the proposed methods are more effective and efficient than the compared approaches.
引用
收藏
页码:169456 / 169469
页数:14
相关论文
共 50 条
  • [31] Weighted Competitive-Collaborative Representation Based Classifier for Imbalanced Data Classification
    Li, Yanting
    Wang, Shuai
    Jin, Junwei
    Chen, C. L. Philip
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 462 - 472
  • [32] Clustering-based improved adaptive synthetic minority oversampling technique for imbalanced data classification
    Jin, Dian
    Xie, Dehong
    Liu, Di
    Gong, Murong
    INTELLIGENT DATA ANALYSIS, 2023, 27 (03) : 635 - 652
  • [33] Evolutionary Mahalanobis Distance-Based Oversampling for Multi-Class Imbalanced Data Classification
    Yao, Leehter
    Lin, Tung-Bin
    SENSORS, 2021, 21 (19)
  • [34] A Synthetic Minority Oversampling Technique Based on Gaussian Mixture Model Filtering for Imbalanced Data Classification
    Xu, Zhaozhao
    Shen, Derong
    Kou, Yue
    Nie, Tiezheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3740 - 3753
  • [35] A non-parameter oversampling approach for imbalanced data classification based on hybrid natural neighbors
    Lin, Junyue
    Liang, Lu
    APPLIED INTELLIGENCE, 2025, 55 (05)
  • [36] Self-adaptive oversampling method based on the complexity of minority data in imbalanced datasets classification
    Tao, Xinmin
    Guo, Xinyue
    Zheng, Yujia
    Zhang, Xiaohan
    Chen, Zhiyu
    KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [37] A Classification Model for Imbalanced Medical Data based on PCA and Farther Distance based Synthetic Minority Oversampling Technique
    Mustafa, Nadir
    Memon, Raheel A.
    Li, Jian-Ping
    Omer, Mohammed Z.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2017, 8 (01) : 61 - 67
  • [38] Classifier Ensemble Based on Multiview Optimization for High-Dimensional Imbalanced Data Classification
    Xu, Yuhong
    Yu, Zhiwen
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 870 - 883
  • [39] Empirical Assessment of Ensemble based Approaches to Classify Imbalanced Data in Binary Classification
    Kaur, Prabhjot
    Gosain, Anjana
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (03) : 48 - 58
  • [40] An Ensemble Method Based on SVC and Euclidean Distance for Classification Binary Imbalanced Data
    Zhao, Lei
    Wang, Lei
    Gui, Guan
    WIRELESS INTERNET (WICON 2016), 2018, 214 : 312 - 320