Class Imbalance Ensemble Learning Based on the Margin Theory

被引:102
|
作者
Feng, Wei [1 ]
Huang, Wenjiang [1 ]
Ren, Jinchang [2 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[2] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow G1 1XW, Lanark, Scotland
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 05期
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
classification; ensemble margin; imbalance learning; ensemble learning; multi-class; SUPPORT VECTOR MACHINES; DATA-SETS; STATISTICAL COMPARISONS; CLASSIFICATION; PERFORMANCE; DIVERSITY; CLASSIFIERS; PREDICTION; ALGORITHM; IMPROVE;
D O I
10.3390/app8050815
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The proportion of instances belonging to each class in a data-set plays an important role in machine learning. However, the real world data often suffer from class imbalance. Dealing with multi-class tasks with different misclassification costs of classes is harder than dealing with two-class ones. Undersampling and oversampling are two of the most popular data preprocessing techniques dealing with imbalanced data-sets. Ensemble classifiers have been shown to be more effective than data sampling techniques to enhance the classification performance of imbalanced data. Moreover, the combination of ensemble learning with sampling methods to tackle the class imbalance problem has led to several proposals in the literature, with positive results. The ensemble margin is a fundamental concept in ensemble learning. Several studies have shown that the generalization performance of an ensemble classifier is related to the distribution of its margins on the training examples. In this paper, we propose a novel ensemble margin based algorithm, which handles imbalanced classification by employing more low margin examples which are more informative than high margin samples. This algorithm combines ensemble learning with undersampling, but instead of balancing classes randomly such as UnderBagging, our method pays attention to constructing higher quality balanced sets for each base classifier. In order to demonstrate the effectiveness of the proposed method in handling class imbalanced data, UnderBagging and SMOTEBagging are used in a comparative analysis. In addition, we also compare the performances of different ensemble margin definitions, including both supervised and unsupervised margins, in class imbalance learning.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Distribution Based Ensemble for Class Imbalance Learning
    Mustafa, Ghulam
    Niu, Zhendong
    Yousif, Abdallah
    Tarus, John
    [J]. FIFTH INTERNATIONAL CONFERENCE ON THE INNOVATIVE COMPUTING TECHNOLOGY (INTECH 2015), 2015, : 5 - 10
  • [2] Imputation-Based Ensemble Techniques for Class Imbalance Learning
    Razavi-Far, Roozbeh
    Farajzadeh-Zanajni, Maryam
    Wang, Boyu
    Saif, Mehrdad
    Chakrabarti, Shiladitya
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (05) : 1988 - 2001
  • [3] Prediction of rhinitis with class imbalance based on heterogeneous ensemble learning
    Yang, Jingdong
    Jiang, Biao
    Qiu, Zehao
    Meng, Yifei
    Zhang, Xiaolin
    Yu, Shaoqing
    Dai, Fu
    Qian, Yue
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2024,
  • [4] Unsupervised Ensemble Learning for Class Imbalance Problems
    Liu, Zihan
    Wu, Dongrui
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 3593 - 3600
  • [5] An Ensemble Based Incremental Learning Framework for Concept Drift and Class Imbalance
    Ditzler, Gregory
    Polikar, Robi
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [6] Resampling-Based Ensemble Methods for Online Class Imbalance Learning
    Wang, Shuo
    Minku, Leandro L.
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (05) : 1356 - 1368
  • [7] A clustering based ensemble of weighted kernelized extreme learning machine for class imbalance learning
    Choudhary, Roshani
    Shukla, Sanyam
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164
  • [8] Class imbalance learning via a fuzzy total margin based support vector machine
    Dai, Hong-Liang
    [J]. APPLIED SOFT COMPUTING, 2015, 31 : 172 - 184
  • [9] An Ensemble Learning-Based Undersampling Technique for Handling Class-Imbalance Problem
    Sarkar, Sobhan
    Khatedi, Nikhil
    Pramanik, Anima
    Maiti, J.
    [J]. PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 586 - 595
  • [10] SWSEL: Sliding Window-based Selective Ensemble Learning for class-imbalance problems
    Dai, Qi
    Liu, Jian-wei
    Yang, Jia-Peng
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121