Combining Sampling and Ensemble Classifier for Multiclass Imbalance Data Learning

被引:2
|
作者
Sainin, Mohd Shamrie [1 ]
Alfred, Rayner [1 ]
Adnan, Fairuz [2 ]
Ahmad, Faudziah [2 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Knowledge Technol Res Unit, Kota Kinabalu 88400, Sabah, Malaysia
[2] Univ Utara Malaysia, Coll Arts & Sci, Sch Comp, Data Sci Res Lab, Sintok 06010, Malaysia
关键词
Ensemble; Sampling; Multiclass; Imbalance; Random Forest;
D O I
10.1007/978-981-10-8276-4_25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this paper is to investigate the effects of combining various sampling and ensemble classifiers on the prediction performance in addressing the multiclass imbalance data learning. This research uses data obtained from the Malaysian medicinal leaf images shape data and three other large benchmark datasets in which seven ensemble methods from Weka machine learning tool were selected to perform the classification task. These ensemble methods include the AdaboostM1, Bagging, Decorate, END, Multi-boostAB, RotationForest, and stacking methods. In addition to that, five base classifiers were used; Naive Bayes, SMO, J48, Random Forest, and Random Tree in order to examine the performance of the ensemble methods. Two methods of combining the sampling and ensemble classifiers were used which are called the Resample with ensemble classifier and SMOTE with ensemble classifier. The results obtained from the experiments show that there is actually no single configuration that is "one design that fits all". However, it is proven that when using the sampling and ensemble classifier which is coupled with Random Forest, the prediction performance of the classification task can be improved on the multiclass imbalance dataset.
引用
收藏
页码:262 / 272
页数:11
相关论文
共 50 条
  • [31] Geometric Classifier for Multiclass, High-Dimensional Data
    Aoshima, Makoto
    Yata, Kazuyoshi
    [J]. SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS, 2015, 34 (03): : 279 - 294
  • [32] A class imbalance-aware review rating prediction using hybrid sampling and ensemble learning
    Mahadevan, Anbazhagan
    Arock, Michael
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (05) : 6911 - 6938
  • [33] An Ensemble-Based Multiclass Classifier for Intrusion Detection Using Internet of Things
    Rani, Deepti
    Gill, Nasib Singh
    Gulia, Preeti
    Chatterjee, Jyotir Moy
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [34] A class imbalance-aware review rating prediction using hybrid sampling and ensemble learning
    Anbazhagan Mahadevan
    Michael Arock
    [J]. Multimedia Tools and Applications, 2021, 80 : 6911 - 6938
  • [35] A Classifier Ensemble Enriched with Unsupervised Learning
    Hamzeh-Khani, Mehdi
    Parvin, Hamid
    Rad, Farhad
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 509 - 517
  • [36] Cluster-Oriented Ensemble Classifier: Impact of Multicluster Characterization on Ensemble Classifier Learning
    Verma, Brijesh
    Rahman, Ashfaqur
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (04) : 605 - 618
  • [37] On dynamic ensemble selection and data preprocessing for multi-class imbalance learning
    Cruz, Rafael M. O.
    Sabourin, Robert
    Cavalcanti, George D. C.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 189 - 194
  • [38] Ensemble Methods with Statistics and Machine Learning on the Class Imbalance Problems of EEG data
    Mishra, Sneha
    Jaiswal, Umesh Chandra
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (05) : 453 - 462
  • [39] Dynamic Ensemble Selection and Data Preprocessing for Multi-Class Imbalance Learning
    Cruz, Rafael M. O.
    Souza, Mariana de Araujo
    Sabourin, Robert
    Cavalcanti, George D. C.
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)
  • [40] An Online Quality Detection Method With Ensemble Learning on Imbalance Data for Wave Soldering
    Gao, Hanpeng
    Guo, Yu
    Huang, Shaohua
    Xie, Jian
    Liu, Daoyuan
    Wu, Tao
    Tian, Xu
    [J]. JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2024, 24 (02)