Dynamic Centroid Insertion and Adjustment for Data Sets with Multiple Imbalanced Classes

被引:0
|
作者
Silva, Evandro J. R. [1 ]
Zanchettin, Cleber [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II | 2019年 / 11728卷
关键词
Prototype Generation; Imbalanced domains; Multiclass; CLASSIFICATION;
D O I
10.1007/978-3-030-30484-3_60
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The imbalance problem is receiving an increasing attention in the literature. Studies on binary cases are recurrent but limited when considering the multiple classes approach. Solutions to imbalance domains may be divided into two groups, data level approaches, and algorithmic approaches. The first approach is more common and focuses on changing the training data aiming to balance the data set, oversampling the smallest classes, undersampling the biggest ones or using a combination of both. Instance reduction is another approach to the problem. It tries to find the best-reduced set of instances that represent the original training set. In this work, we propose a new Prototype Generation method called DCIA. It dynamically inserts new prototypes for each class and then adjusts their positions with a search algorithm. The set of generated prototypes may be used to train any classifier. Experiments showed its potentiality by enabling an INN classifier to perform sometimes as well or even better than some ensemble classifiers created for different multiclass imbalanced domains.
引用
收藏
页码:766 / 778
页数:13
相关论文
共 50 条
  • [1] Dynamic Feature Weighting for Imbalanced Data Sets
    Dialameh, Maryam
    Jahromi, Mansoor Zolghadri
    2015 SIGNAL PROCESSING AND INTELLIGENT SYSTEMS CONFERENCE (SPIS), 2015, : 31 - 36
  • [2] Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches
    Fernandez, Alberto
    Lopez, Victoria
    Galar, Mikel
    Jose del Jesus, Maria
    Herrera, Francisco
    KNOWLEDGE-BASED SYSTEMS, 2013, 42 : 97 - 110
  • [3] The Influence of Multiple Classes on Learning from Imbalanced Data Streams
    Lipska, Agnieszka
    Stefanowski, Jerzy
    FOURTH INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 183, 2022, 183 : 187 - 198
  • [4] A multiple resampling method for learning from imbalanced data sets
    Estabrooks, A
    Jo, TH
    Japkowicz, N
    COMPUTATIONAL INTELLIGENCE, 2004, 20 (01) : 18 - 36
  • [5] Data Mining on Imbalanced Data Sets
    Gu, Qiong
    Cai, Zhihua
    Zhu, Li
    Huang, Bo
    2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 1020 - 1024
  • [6] AUC Estimation and Concept Drift Detection for Imbalanced Data Streams with Multiple Classes
    Wang, Shuo
    Minku, Leandro L.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [7] A hierarchical VQSVM for imbalanced data sets
    Yu, Ting
    Jan, Tony
    Simoff, Simeon
    Debenham, John
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 518 - 523
  • [8] A LEARNING METHOD FOR IMBALANCED DATA SETS
    de la Calleja, Jorge
    Fuentes, Olac
    Gonzalez, Jesus
    Aceves-Perez, Rita M.
    KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 307 - +
  • [9] Graph Classification with Imbalanced Data Sets
    Xiao, Gang-Song
    Chen, Xiao-Yun
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 57 - 61
  • [10] The Text Classification for Imbalanced Data Sets
    Li, Yanling
    Zhu, Yehang
    Yang, Ping
    ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 2, 2008, : 778 - +