Combining Sampling and Ensemble Classifier for Multiclass Imbalance Data Learning

被引:2
|
作者
Sainin, Mohd Shamrie [1 ]
Alfred, Rayner [1 ]
Adnan, Fairuz [2 ]
Ahmad, Faudziah [2 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Knowledge Technol Res Unit, Kota Kinabalu 88400, Sabah, Malaysia
[2] Univ Utara Malaysia, Coll Arts & Sci, Sch Comp, Data Sci Res Lab, Sintok 06010, Malaysia
关键词
Ensemble; Sampling; Multiclass; Imbalance; Random Forest;
D O I
10.1007/978-981-10-8276-4_25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this paper is to investigate the effects of combining various sampling and ensemble classifiers on the prediction performance in addressing the multiclass imbalance data learning. This research uses data obtained from the Malaysian medicinal leaf images shape data and three other large benchmark datasets in which seven ensemble methods from Weka machine learning tool were selected to perform the classification task. These ensemble methods include the AdaboostM1, Bagging, Decorate, END, Multi-boostAB, RotationForest, and stacking methods. In addition to that, five base classifiers were used; Naive Bayes, SMO, J48, Random Forest, and Random Tree in order to examine the performance of the ensemble methods. Two methods of combining the sampling and ensemble classifiers were used which are called the Resample with ensemble classifier and SMOTE with ensemble classifier. The results obtained from the experiments show that there is actually no single configuration that is "one design that fits all". However, it is proven that when using the sampling and ensemble classifier which is coupled with Random Forest, the prediction performance of the classification task can be improved on the multiclass imbalance dataset.
引用
收藏
页码:262 / 272
页数:11
相关论文
共 50 条
  • [1] Ensemble Meta Classifier with Sampling and Feature Selection for Data with Multiclass Imbalance Problem
    Sainin, Mohd Shamrie
    Alfred, Rayner
    Ahmad, Faudziah
    [J]. JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2021, 20 (02): : 103 - 133
  • [2] Feature Selection and Ensemble Meta Classifier for Multiclass Imbalance Data Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    Alias, Suraya
    Lammasha, Mohamed A. M.
    [J]. PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2018, 2018, : 134 - 139
  • [3] ENSEMBLE CLASSIFIER AND RESAMPLING FOR IMBALANCED MULTICLASS LEARNING
    Sainin, Mohd Shamrie
    Ahmad, Faudziah
    Alfred, Rayner
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS, 2015, : 751 - 756
  • [4] A Direct Ensemble Classifier for Imbalanced Multiclass Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    [J]. 2012 4TH CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2012, : 59 - 66
  • [5] A novel cost sensitive neural network ensemble for multiclass imbalance data learning
    Cao, Peng
    Li, Bo
    Zhao, Dazhe
    Zaiane, Osmar
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [6] An Adaptive Sampling Ensemble Classifier for Learning from Imbalanced Data Sets
    Geiler, Ordonez Jon
    Hong, Li
    Yue-Jian, Guo
    [J]. INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 513 - 517
  • [7] A New Adaptive Framework for Classifier Ensemble in Multiclass Large Data
    Parvin, Hamid
    Minaei, Behrouz
    Alizadeh, Hosein
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2011, PT I, 2011, 6782 : 526 - 536
  • [8] Measure optimized cost-sensitive neural network ensemble for multiclass imbalance data learning
    Cao, Peng
    Zhao, Dazhe
    Zaiane, Osmar
    [J]. 2013 13TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2013, : 35 - 40
  • [9] One-class ensemble classifier for data imbalance problems
    Hayashi, Toshitaka
    Fujita, Hamido
    [J]. APPLIED INTELLIGENCE, 2022, 52 (15) : 17073 - 17089
  • [10] One-class ensemble classifier for data imbalance problems
    Toshitaka Hayashi
    Hamido Fujita
    [J]. Applied Intelligence, 2022, 52 : 17073 - 17089