Combining Sampling and Ensemble Classifier for Multiclass Imbalance Data Learning

被引:2
|
作者
Sainin, Mohd Shamrie [1 ]
Alfred, Rayner [1 ]
Adnan, Fairuz [2 ]
Ahmad, Faudziah [2 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Knowledge Technol Res Unit, Kota Kinabalu 88400, Sabah, Malaysia
[2] Univ Utara Malaysia, Coll Arts & Sci, Sch Comp, Data Sci Res Lab, Sintok 06010, Malaysia
关键词
Ensemble; Sampling; Multiclass; Imbalance; Random Forest;
D O I
10.1007/978-981-10-8276-4_25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this paper is to investigate the effects of combining various sampling and ensemble classifiers on the prediction performance in addressing the multiclass imbalance data learning. This research uses data obtained from the Malaysian medicinal leaf images shape data and three other large benchmark datasets in which seven ensemble methods from Weka machine learning tool were selected to perform the classification task. These ensemble methods include the AdaboostM1, Bagging, Decorate, END, Multi-boostAB, RotationForest, and stacking methods. In addition to that, five base classifiers were used; Naive Bayes, SMO, J48, Random Forest, and Random Tree in order to examine the performance of the ensemble methods. Two methods of combining the sampling and ensemble classifiers were used which are called the Resample with ensemble classifier and SMOTE with ensemble classifier. The results obtained from the experiments show that there is actually no single configuration that is "one design that fits all". However, it is proven that when using the sampling and ensemble classifier which is coupled with Random Forest, the prediction performance of the classification task can be improved on the multiclass imbalance dataset.
引用
收藏
页码:262 / 272
页数:11
相关论文
共 50 条
  • [11] Classifier ensemble for mammography CAD system combining feature selection with ensemble learning
    Nemoto, M
    Shimizu, A
    Kobatake, H
    Takeo, H
    Nawano, S
    [J]. CARS 2005: Computer Assisted Radiology and Surgery, 2005, 1281 : 1047 - 1051
  • [12] Learning of classifier ensemble using virtual data
    Jang, M
    Cho, S
    [J]. IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 955 - 959
  • [13] Multicriteria Classifier Ensemble Learning for Imbalanced Data
    Wegier, Weronika
    Koziarski, Michal
    Wozniak, Micha
    [J]. IEEE ACCESS, 2022, 10 : 16807 - 16818
  • [14] A Streaming Ensemble Classifier with Multi-Class Imbalance Learning for Activity Recognition
    Shahi, Ahmad
    Deng, Jeremiah D.
    Woodford, Brendon J.
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3983 - 3990
  • [15] Detecting Human Phosphorylated Protein by Using Class Imbalance Learning and Ensemble Classifier
    Xiao, Xuan
    Liao, Shun-lu
    Qiu, Wang-ren
    [J]. INTERNATIONAL CONFERENCE ON MATERIALS, MANUFACTURING AND MECHANICAL ENGINEERING (MMME 2016), 2016, : 349 - 354
  • [16] Combining wavelength importance ranking to the random forest classifier to analyze multiclass spectral data
    Fontes, Juliana de Abreu
    Anzanello, Michel Jose
    Brito, Joao B. G.
    Bucco, Guilherme Brandelli
    Fogliatto, Flavio Sanson
    Puglia, Fabio do Prado
    [J]. FORENSIC SCIENCE INTERNATIONAL, 2021, 328
  • [17] AMOS - LEARNING MULTICLASS PATTERN CLASSIFIER
    POSPISIL, A
    [J]. PATTERN RECOGNITION, 1971, 3 (03) : 253 - &
  • [18] Random Balance ensembles for multiclass imbalance learning
    Rodriguez, Juan J.
    Diez-Pastor, Jose-Francisco
    Arnaiz-Gonzalez, Alvar
    Kuncheva, Ludmila, I
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 193
  • [19] A study on combining dynamic selection and data preprocessing for imbalance learning
    Roy, Anandarup
    Cruz, Rafael M. O.
    Sabourin, Robert
    Cavalcanti, George D. C.
    [J]. NEUROCOMPUTING, 2018, 286 : 179 - 192
  • [20] Ensemble Classifier for Combining Stereo Matching Algorithms
    Spyropoulos, Aristotle
    Mordohai, Philippos
    [J]. 2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 73 - 81