Combining Sampling and Ensemble Classifier for Multiclass Imbalance Data Learning

被引:2
|
作者
Sainin, Mohd Shamrie [1 ]
Alfred, Rayner [1 ]
Adnan, Fairuz [2 ]
Ahmad, Faudziah [2 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Knowledge Technol Res Unit, Kota Kinabalu 88400, Sabah, Malaysia
[2] Univ Utara Malaysia, Coll Arts & Sci, Sch Comp, Data Sci Res Lab, Sintok 06010, Malaysia
关键词
Ensemble; Sampling; Multiclass; Imbalance; Random Forest;
D O I
10.1007/978-981-10-8276-4_25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this paper is to investigate the effects of combining various sampling and ensemble classifiers on the prediction performance in addressing the multiclass imbalance data learning. This research uses data obtained from the Malaysian medicinal leaf images shape data and three other large benchmark datasets in which seven ensemble methods from Weka machine learning tool were selected to perform the classification task. These ensemble methods include the AdaboostM1, Bagging, Decorate, END, Multi-boostAB, RotationForest, and stacking methods. In addition to that, five base classifiers were used; Naive Bayes, SMO, J48, Random Forest, and Random Tree in order to examine the performance of the ensemble methods. Two methods of combining the sampling and ensemble classifiers were used which are called the Resample with ensemble classifier and SMOTE with ensemble classifier. The results obtained from the experiments show that there is actually no single configuration that is "one design that fits all". However, it is proven that when using the sampling and ensemble classifier which is coupled with Random Forest, the prediction performance of the classification task can be improved on the multiclass imbalance dataset.
引用
收藏
页码:262 / 272
页数:11
相关论文
共 50 条
  • [21] Classifier Selection and Ensemble Model for Multi-class Imbalance Learning in Education Grants Prediction
    Sun, Yu
    Li, Zhanli
    Li, Xuewen
    Zhang, Jing
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (04) : 290 - 303
  • [22] An Effective Sampling Strategy for Ensemble Learning with Imbalanced Data
    Zhang, Chen
    Zhang, Xiaolong
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 377 - 388
  • [23] Controlled Under-Sampling with Majority Voting Ensemble Learning for Class Imbalance Problem
    Sikora, Riyaz
    Raina, Sahil
    [J]. INTELLIGENT COMPUTING, VOL 2, 2019, 857 : 33 - 39
  • [24] Learning to Grasp Objects based on Ensemble Learning Combining Simulation Data and Real Data
    Na, Yong-Ho
    Jo, HyunJun
    Song, Jae-Bok
    [J]. 2017 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2017, : 1030 - 1034
  • [25] Kernel based online learning for imbalance multiclass classification
    Ding, Shuya
    Mirza, Bilal
    Lin, Zhiping
    Cao, Jiuwen
    Lai, Xiaoping
    Nguyen, Tam V.
    Sepulveda, Jose
    [J]. NEUROCOMPUTING, 2018, 277 : 139 - 148
  • [26] The Effects of Class Imbalance and Training Data Size on Classifier Learning: An Empirical Study
    Zheng W.
    Jin M.
    [J]. SN Computer Science, 2020, 1 (2)
  • [27] Minimizing False-Rejection Rates in Gas Leak Testing Using an Ensemble Multiclass Classifier for Unbalanced Data
    Costa, Diogo
    Rocha, Eugenio M.
    Ramalho, Pedro
    [J]. ADVANCED RESEARCH IN TECHNOLOGIES, INFORMATION, INNOVATION AND SUSTAINABILITY, ARTIIS 2022, PT I, 2022, 1675 : 429 - 443
  • [28] Dynamic Sampling Approach to Training Neural Networks for Multiclass Imbalance Classification
    Lin, Minlong
    Tang, Ke
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (04) : 647 - 660
  • [29] Presenting a new multiclass classifier based on learning automata
    Afshar, Sorour
    Mosleh, Mohammad
    Kheyrandish, Mohammad
    [J]. NEUROCOMPUTING, 2013, 104 : 97 - 104
  • [30] Learning Multiclass Classifier Under Noisy Bandit Feedback
    Agarwal, Mudit
    Manwani, Naresh
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 448 - 460