Dynamic Classifier Selection for Data with Skewed Class Distribution Using Imbalance Ratio and Euclidean Distance

被引:1
|
作者
Zyblewski, Pawel [1 ]
Wozniak, Michal [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Fac Elect, Dept Syst & Comp Networks, Wybrzeze Wyspianskiego 27, PL-50370 Wroclaw, Poland
来源
关键词
Classifier ensemble; Dynamic Classifier Selection; Imbalanced data;
D O I
10.1007/978-3-030-50423-6_5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Imbalanced data analysis remains one of the critical challenges in machine learning. This work aims to adapt the concept of Dynamic Classifier Selection (dcs) to the pattern classification task with the skewed class distribution. Two methods, using the similarity (distance) to the reference instances and class imbalance ratio to select the most confident classifier for a given observation, have been proposed. Both approaches come in two modes, one based on the k-Nearest Oracles (knora) and the other also considering those cases where the classifier makes a mistake. The proposed methods were evaluated based on computer experiments carried out on 41 datasets with a high imbalance ratio. The obtained results and statistical analysis confirm the usefulness of the proposed solutions.
引用
收藏
页码:59 / 73
页数:15
相关论文
共 50 条
  • [21] Classifier Selection and Ensemble Model for Multi-class Imbalance Learning in Education Grants Prediction
    Sun, Yu
    Li, Zhanli
    Li, Xuewen
    Zhang, Jing
    APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (04) : 290 - 303
  • [22] Detecting Human Phosphorylated Protein by Using Class Imbalance Learning and Ensemble Classifier
    Xiao, Xuan
    Liao, Shun-lu
    Qiu, Wang-ren
    INTERNATIONAL CONFERENCE ON MATERIALS, MANUFACTURING AND MECHANICAL ENGINEERING (MMME 2016), 2016, : 349 - 354
  • [23] Similar Vague Concepts Selection Using Their Euclidean Distance at Different Granulation
    Prem Kumar Singh
    Cognitive Computation, 2018, 10 : 228 - 241
  • [24] A study on combining dynamic selection and data preprocessing for imbalance learning
    Roy, Anandarup
    Cruz, Rafael M. O.
    Sabourin, Robert
    Cavalcanti, George D. C.
    NEUROCOMPUTING, 2018, 286 : 179 - 192
  • [25] Similar Vague Concepts Selection Using Their Euclidean Distance at Different Granulation
    Singh, Prem Kumar
    COGNITIVE COMPUTATION, 2018, 10 (02) : 228 - 241
  • [26] The Distance-Based Balancing Ensemble Method for Data With a High Imbalance Ratio
    Chen, Dong
    Wang, Xiao-Jun
    Zhou, Changjun
    Wang, Bin
    IEEE ACCESS, 2019, 7 : 68940 - 68956
  • [27] Dynamic classifier selection using clustering for spam detection
    Saeedian, Mehrnoush Famil
    Beigy, Hamid
    2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, : 84 - 88
  • [28] Regression for non-Euclidean data using distance matrices
    Faraway, Julian J.
    JOURNAL OF APPLIED STATISTICS, 2014, 41 (11) : 2342 - 2357
  • [29] Dynamic classifier ensemble model for customer classification with imbalanced class distribution
    Xiao, Jin
    Xie, Ling
    He, Changzheng
    Jiang, Xiaoyi
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) : 3668 - 3675
  • [30] Novel Approach to Predict Hospital Readmissions Using Feature Selection from Unstructured Data with Class Imbalance
    Sundararaman, Arun
    Ramanathan, Srinivasan Valady
    Thati, Ramprasad
    BIG DATA RESEARCH, 2018, 13 : 65 - 75