Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data

被引:0
|
作者
Mateusz Lango
Jerzy Stefanowski
机构
[1] Poznań University of Technology,Institute of Computing Science
关键词
Class imbalance; Roughly balanced bagging; Types of minority examples; Feature selection; Multiple imbalanced classes;
D O I
暂无
中图分类号
学科分类号
摘要
Roughly Balanced Bagging is one of the most efficient ensembles specialized for class imbalanced data. In this paper, we study its basic properties that may influence its good classification performance. We experimentally analyze them with respect to bootstrap construction, deciding on the number of component classifiers, their diversity, and ability to deal with the most difficult types of the minority examples. Then, we introduce two generalizations of this ensemble for dealing with a higher number of attributes and for adapting it to handle multiple minority classes. Experiments with synthetic and real life data confirm usefulness of both proposals.
引用
收藏
页码:97 / 127
页数:30
相关论文
共 50 条
  • [1] Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data
    Lango, Mateusz
    Stefanowski, Jerzy
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (01) : 97 - 127
  • [2] The Usefulness of Roughly Balanced Bagging for Complex and High-Dimensional Imbalanced Data
    Lango, Mateusz
    Stefanowski, Jerzy
    NEW FRONTIERS IN MINING COMPLEX PATTERNS, 2016, 9607 : 93 - 107
  • [3] Feature Selection for Multi-Class Imbalanced Data Sets Based on Genetic Algorithm
    Du L.-M.
    Xu Y.
    Zhu H.
    Ann. Data Sci., 3 (293-300): : 293 - 300
  • [4] Diversity Analysis on Imbalanced Data Using Neighbourhood and Roughly Balanced Bagging Ensembles
    Blaszczynski, Jerzy
    Lango, Mateusz
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2016, 2016, 9692 : 552 - 562
  • [5] Feature selection and its combination with data over-sampling for multi-class imbalanced datasets
    Tsai, Chih-Fong
    Chen, Kuan-Chen
    Lin, Wei -Chao
    APPLIED SOFT COMPUTING, 2024, 153
  • [6] Multi-class Boosting for Imbalanced Data
    Fernandez-Baldera, Antonio
    Buenaposada, Jose M.
    Baumela, Luis
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 57 - 64
  • [7] Multi-class WHMBoost: An ensemble algorithm for multi-class imbalanced data
    Zhao, Jiakun
    Jin, Ju
    Zhang, Yibo
    Zhang, Ruifeng
    Chen, Si
    INTELLIGENT DATA ANALYSIS, 2022, 26 (03) : 599 - 614
  • [8] Multi-class extensions of the GLDB feature extraction algorithm for spectral data
    Paclík, Pavel
    Verzakov, Serguei
    Duin, Robert P.W.
    Proc. Int. Conf. Pattern Recognit., 1600, (629-632):
  • [9] Actively Balanced Bagging for Imbalanced Data
    Blaszczynski, Jerzy
    Stefanowski, Jerzy
    FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2017, 2017, 10352 : 271 - 281
  • [10] Multi-class extensions of the GLDB feature extraction algorithm for spectral data
    Paclík, P
    Verzakov, S
    Duin, RPW
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 629 - 632