Random Balance ensembles for multiclass imbalance learning

被引:20
|
作者
Rodriguez, Juan J. [1 ]
Diez-Pastor, Jose-Francisco [1 ]
Arnaiz-Gonzalez, Alvar [1 ]
Kuncheva, Ludmila, I [2 ]
机构
[1] Univ Burgos, Escuela Politecn Super, Avda Cantabria S-N, Burgos 09006, Spain
[2] Bangor Univ, Dean St, Bangor LL57 1UT, Gwynedd, Wales
关键词
Classifier ensembles; Imbalanced data; Multiclass classification; DATA-SETS; NEURAL-NETWORKS; STATISTICAL COMPARISONS; BINARIZATION TECHNIQUES; SAMPLING APPROACH; MULTIPLE CLASSES; CLASSIFICATION; SMOTE; CLASSIFIERS; PREDICTION;
D O I
10.1016/j.knosys.2019.105434
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random Balance strategy (RandBal) has been recently proposed for constructing classifier ensembles for imbalanced, two-class data sets. In RandBal, each base classifier is trained with a sample of the data with a random class prevalence, independent of the a priori distribution. Hence, for each sample, one of the classes will be undersampled while the other will be oversampled. RandBal can be applied on its own or can be combined with any other ensemble method. One particularly successful variant is RandBalBoost which integrates Random Balance and boosting. Encouraged by the success of RandBal, this work proposes two approaches which extend RandBal to multiclass imbalance problems. Multiclass imbalance implies that at least two classes have substantially different proportion of instances. In the first approach proposed here, termed Multiple Random Balance (MultiRandBal), we deal with all classes simultaneously. The training data for each base classifier are sampled with random class proportions. The second approach we propose decomposes the multiclass problem into two-class problems using one-vs-one or one-vs-all, and builds an ensemble of RandBal ensembles. We call the two versions of the second approach OVO-RandBal and OVA-RandBal, respectively. These two approaches were chosen because they are the most straightforward extensions of RandBal for multiple classes. Our main objective is to evaluate both approaches for multiclass imbalanced problems. To this end, an experiment was carried out with 52 multiclass data sets. The results suggest that both MultiRandBal, and OVO/OVA-RandBal are viable extensions of the original two-class RandBal. Collectively, they consistently outperform acclaimed state-of-the art methods for multiclass imbalanced problems. (c) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Imbalance learning using heterogeneous ensembles
    Zefrehi, Hossein Ghaderi
    Altincay, Hakan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 142
  • [2] Kernel based online learning for imbalance multiclass classification
    Ding, Shuya
    Mirza, Bilal
    Lin, Zhiping
    Cao, Jiuwen
    Lai, Xiaoping
    Nguyen, Tam V.
    Sepulveda, Jose
    [J]. NEUROCOMPUTING, 2018, 277 : 139 - 148
  • [3] Combining Sampling and Ensemble Classifier for Multiclass Imbalance Data Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    Adnan, Fairuz
    Ahmad, Faudziah
    [J]. COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 262 - 272
  • [4] Feature Selection and Ensemble Meta Classifier for Multiclass Imbalance Data Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    Alias, Suraya
    Lammasha, Mohamed A. M.
    [J]. PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2018, 2018, : 134 - 139
  • [5] Diversity techniques improve the performance of the best imbalance learning ensembles
    Diez-Pastor, Jose F.
    Rodriguez, Juan J.
    Garcia-Osorio, Cesar I.
    Kuncheva, Ludmila I.
    [J]. INFORMATION SCIENCES, 2015, 325 : 98 - 117
  • [6] Random Balance: Ensembles of variable priors classifiers for imbalanced data
    Diez-Pastor, Jose F.
    Rodriguez, Juan J.
    Garcia-Osorio, Cesar
    Kuncheva, Ludmila I.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 85 : 96 - 111
  • [7] Random Separation Learning for Neural Network Ensembles
    Liu, Yong
    [J]. 2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [8] Introducing DeepBalance: Random Deep Belief Network Ensembles to Address Class Imbalance
    Xenopoulos, Peter
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3684 - 3689
  • [9] A novel cost sensitive neural network ensemble for multiclass imbalance data learning
    Cao, Peng
    Li, Bo
    Zhao, Dazhe
    Zaiane, Osmar
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [10] Double-kernel based class-specific broad learning system for multiclass imbalance learning
    Chen, Wuxing
    Yang, Kaixiang
    Yu, Zhiwen
    Zhang, Weiwen
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 253