A survey of multi-class imbalanced data classification methods

被引:4
|
作者
Han, Meng [1 ]
Li, Ang [1 ]
Gao, Zhihui [1 ]
Mu, Dongliang [1 ]
Liu, Shujuan [1 ]
机构
[1] North Minzu Univ, Sch Comp Sci & Engn, Yinchuan, Ningxia, Peoples R China
关键词
Classification; multi-class imbalance data; data preprocessing method; algorithm-level classification method; EXTREME LEARNING-MACHINE; SELECTION; ALGORITHM; CNN;
D O I
10.3233/JIFS-221902
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In reality, the data generated in many fields are often imbalanced, such as fraud detection, network intrusion detection and disease diagnosis. The class with fewer instances in the data is called the minority class, and the minority class in some applications contains the significant information. So far, many classification methods and strategies for binary imbalanced data have been proposed, but there are still many problems and challenges in multi-class imbalanced data that need to be solved urgently. The classification methods for multi-class imbalanced data are analyzed and summarized in terms of data preprocessing methods and algorithm-level classification methods, and the performance of the algorithms using the same dataset is compared separately. In the data preprocessing methods, the methods of oversampling, under-sampling, hybrid sampling and feature selection are mainly introduced. Algorithm-level classification methods are comprehensively introduced in four aspects: ensemble learning, neural network, support vector machine and multi-class decomposition technique. At the same time, all data preprocessing methods and algorithm-level classification methods are analyzed in detail in terms of the techniques used, comparison algorithms, pros and cons, respectively. Moreover, the evaluation metrics commonly used for multi-class imbalanced data classification methods are described comprehensively. Finally, the future directions of multi-class imbalanced data classification are given.
引用
收藏
页码:2471 / 2501
页数:31
相关论文
共 50 条
  • [41] A GAN-Based Data Augmentation Method for Imbalanced Multi-Class Skin Lesion Classification
    Su, Qichen
    Hamed, Haza Nuzly Abdull
    Isa, Mohd Adham
    Hao, Xue
    Dai, Xin
    IEEE ACCESS, 2024, 12 : 16498 - 16513
  • [42] Learning from Combination of Data Chunks for Multi-class Imbalanced Data
    Liu, Xu-Ying
    Li, Qian-Qian
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 1680 - 1687
  • [43] MKC-SMOTE: A Novel Synthetic Oversampling Method for Multi-Class Imbalanced Data Classification
    Wang, Jiao
    Awang, Norhashidah
    IEEE ACCESS, 2024, 12 : 196929 - 196938
  • [44] BPSO-Adaboost-KNN ensemble learning algorithm for multi-class imbalanced data classification
    Guo Haixiang
    Li Yijing
    Li Yanan
    Liu Xiao
    Li Jinling
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 49 : 176 - 193
  • [45] AUC Evaluation of Multi-class Classifier Performance in Imbalanced Data
    Ni, Huangjing
    Wang, Wei
    2010 INTERNATIONAL CONFERENCE ON FUTURE CONTROL AND AUTOMATION (ICFCA 2010), 2010, : 48 - 51
  • [46] Efficient DANNLO classifier for multi-class imbalanced data on Hadoop
    Satyanarayana S.
    Tayar Y.
    Prasad R.S.R.
    International Journal of Information Technology, 2019, 11 (2) : 321 - 329
  • [47] Learning Imbalanced Multi-class Data with Optimal Dichotomy Weights
    Liu, Xu-Ying
    Li, Qian-Qian
    Zhou, Zhi-Hua
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 478 - 487
  • [48] A Partial Labeling Framework for Multi-Class Imbalanced Streaming Data
    Arabmakki, Elaheh
    Kantardzic, Mehmed
    Sethi, Tegjyot Singh
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1018 - 1025
  • [49] Imbalanced Multi-class Classification of Structural Damage in a Wind Turbine Foundation
    Leon-Medina, Jersson X.
    Pares, Nuria
    Anaya, Maribel
    Tibaduiza, Diego
    Pozo, Francesc
    EUROPEAN WORKSHOP ON STRUCTURAL HEALTH MONITORING (EWSHM 2022), VOL 3, 2023, : 492 - 500
  • [50] Multi-class Ensemble Learning of Imbalanced Bidding Fraud Data
    Anowar, Farzana
    Sadaoui, Samira
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 352 - 358