Ensemble Meta Classifier with Sampling and Feature Selection for Data with Multiclass Imbalance Problem

被引:11
|
作者
Sainin, Mohd Shamrie [1 ]
Alfred, Rayner [1 ]
Ahmad, Faudziah [2 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Kota Kinabalu, Sabah, Malaysia
[2] Univ Utara Malaysia, Sch Comp, Changlun, Malaysia
关键词
Imbalance; multiclass; ensemble; feature selection; sampling; ROTATION FOREST;
D O I
10.32890/jict2021.20.2.1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensemble learning by combining several single classifiers or another ensemble classifier is one of the procedures to solve the imbalance problem in multiclass data. However, this approach still faces the question of how the ensemble methods obtain their higher performance. In this paper, an investigation was carried out on the design of the meta classifier ensemble with sampling and feature selection for multiclass imbalanced data. The specific objectives were: 1) to improve the ensemble classifier through data-level approach (sampling and feature selection); 2) to perform experiments on sampling, feature selection, and ensemble classifier model; and 3) to evaluate the performance of the ensemble classifier. To fulfil the objectives, a preliminary data collection of Malaysian plants' leaf images was prepared and experimented, and the results were compared. The ensemble design was also tested with three other high imbalance ratio benchmark data. It was found that the design using sampling, feature selection, and ensemble classifier method via AdaboostM1 with random forest (also an ensemble classifier) provided improved performance throughout the investigation. The result of this study is important to the on-going problem of multiclass imbalance where specific structure and its performance can be improved in terms of processing time and accuracy.
引用
收藏
页码:103 / 133
页数:31
相关论文
共 50 条
  • [1] Feature Selection and Ensemble Meta Classifier for Multiclass Imbalance Data Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    Alias, Suraya
    Lammasha, Mohamed A. M.
    [J]. PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2018, 2018, : 134 - 139
  • [2] Combining Sampling and Ensemble Classifier for Multiclass Imbalance Data Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    Adnan, Fairuz
    Ahmad, Faudziah
    [J]. COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 262 - 272
  • [3] Iterative ensemble feature selection for multiclass classification of imbalanced microarray data
    Yang, Junshan
    Zhou, Jiarui
    Zhu, Zexuan
    Ma, Xiaoliang
    Ji, Zhen
    [J]. JOURNAL OF BIOLOGICAL RESEARCH-THESSALONIKI, 2016, 23
  • [4] Classifier ensemble methods in feature selection
    Kiziloz, Hakan Ezgi
    [J]. NEUROCOMPUTING, 2021, 419 : 97 - 107
  • [5] An ensemble svm classifier with feature selection
    Hu, Han
    En-en, Ren
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2007, : 6 - 8
  • [6] Feature Selection Inspired Classifier Ensemble Reduction
    Diao, Ren
    Chao, Fei
    Peng, Taoxin
    Snooke, Neal
    Shen, Qiang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (08) : 1259 - 1268
  • [7] A classifier ensemble approach for the missing feature problem
    Nanni, Loris
    Lumini, Alessandra
    Brahnam, Sheryl
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2012, 55 (01) : 37 - 50
  • [8] SUBOPTIMUM LINEAR FEATURE SELECTION IN MULTICLASS PROBLEM
    ICHINO, M
    HIRAMATSU, K
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1974, SMC4 (01): : 28 - 33
  • [9] Ensemble classifier based big data classification with hybrid optimal feature selection
    Pamila, J. C. Miraclin Joyce
    Selvi, R. Senthamil
    Santhi, P.
    Nithya, T. M.
    [J]. ADVANCES IN ENGINEERING SOFTWARE, 2022, 173
  • [10] A New Adaptive Framework for Classifier Ensemble in Multiclass Large Data
    Parvin, Hamid
    Minaei, Behrouz
    Alizadeh, Hosein
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2011, PT I, 2011, 6782 : 526 - 536