A preprocessing method combined with an ensemble framework for the multiclass imbalanced data classification

被引:0
|
作者
Pavan Kumar M.R. [1 ]
Jayagopal P. [2 ]
机构
[1] School of Computer Science and Engineering, Vellore Institute of Technology, Vellore
[2] School of Information Technology & Engineering, Vellore Institute of Technology, Vellore
关键词
ensemble learning; Imbalanced dataset; multiclass imbalance classification; multiple classifier system; oversampling;
D O I
10.1080/1206212X.2019.1700335
中图分类号
学科分类号
摘要
Skewed distributions appear in many real-world classification problems. Skewed distributions, underrepresented classes, and multiple overlapping regions in multiclass imbalanced datasets deteriorate the performance of existing classification algorithms and approaches. In this context, we combine a novel preprocessing procedure to tackle minority classes in multiclass imbalanced problems with an ensemble framework. The preprocessing method oversamples the minority classes based on normalized probability, and then an ensemble called a stacked generalization framework is used to train the model. The motive behind combining the ensemble framework and the preprocessing procedure is to enhance the overall classification performance of the classifier for multiclass imbalanced problems. Experimental results on 20 multiclass imbalanced datasets show that the proposed preprocessing method with the ensemble framework outperforms the representative approaches in 13 datasets for macro average arithmetic (MAvA) and mean F-measure (MFM) metrics. In the case of state-of-the-art techniques, the proposed approach steered 14 datasets for the MAvA metric and 15 datasets for the MFM metric to success. © 2019 Informa UK Limited, trading as Taylor & Francis Group.
引用
下载
收藏
页码:1178 / 1185
页数:7
相关论文
共 50 条
  • [31] Binary Data Embedding Framework for Multiclass Classification
    Chi, Yuan
    Griffith, Elias J.
    Goulermas, John Yannis
    Ralph, Jason F.
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2015, 45 (04) : 453 - 464
  • [32] Adaptive Data Embedding Framework for Multiclass Classification
    Mu, Tingting
    Jiang, Jianmin
    Wang, Yan
    Goulermas, John Y.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (08) : 1291 - 1303
  • [33] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Shi, Peibei
    Wang, Zhong
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2021, 34 (06) : 2250 - 2266
  • [34] Imbalanced Data Classification Using Weighted Voting Ensemble
    Lu, Lin
    Wozniak, Michal
    IMAGE PROCESSING AND COMMUNICATIONS: TECHNIQUES, ALGORITHMS AND APPLICATIONS, 2020, 1062 : 82 - 91
  • [35] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    SHI Peibei
    WANG Zhong
    Journal of Systems Science & Complexity, 2021, 34 (06) : 2250 - 2266
  • [36] An ensemble classifier framework for mining imbalanced data streams
    Ouyang, Zhen-Zheng
    Luo, Jian-Shu
    Hu, Dong-Min
    Wu, Quan-Yuan
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (01): : 184 - 189
  • [37] Adaptive ensemble of classifiers with regularization for imbalanced data classification
    Wang, Chen
    Deng, Chengyuan
    Yu, Zhoulu
    Hui, Dafeng
    Gong, Xiaofeng
    Luo, Ruisen
    INFORMATION FUSION, 2021, 69 : 81 - 102
  • [38] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Peibei Shi
    Zhong Wang
    Journal of Systems Science and Complexity, 2021, 34 : 2250 - 2266
  • [39] Dynamic Classification Ensembles for Handling Imbalanced Multiclass Drifted Data Streams
    Madkour A.H.
    Abdelkader H.M.
    Mohammed A.M.
    Information Sciences, 2024, 670
  • [40] A novel ensemble method for classifying imbalanced data
    Sun, Zhongbin
    Song, Qinbao
    Zhu, Xiaoyan
    Sun, Heli
    Xu, Baowen
    Zhou, Yuming
    PATTERN RECOGNITION, 2015, 48 (05) : 1623 - 1637