Dynamic Classifier Selection for Data with Skewed Class Distribution Using Imbalance Ratio and Euclidean Distance

被引:1
|
作者
Zyblewski, Pawel [1 ]
Wozniak, Michal [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Fac Elect, Dept Syst & Comp Networks, Wybrzeze Wyspianskiego 27, PL-50370 Wroclaw, Poland
来源
关键词
Classifier ensemble; Dynamic Classifier Selection; Imbalanced data;
D O I
10.1007/978-3-030-50423-6_5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Imbalanced data analysis remains one of the critical challenges in machine learning. This work aims to adapt the concept of Dynamic Classifier Selection (dcs) to the pattern classification task with the skewed class distribution. Two methods, using the similarity (distance) to the reference instances and class imbalance ratio to select the most confident classifier for a given observation, have been proposed. Both approaches come in two modes, one based on the k-Nearest Oracles (knora) and the other also considering those cases where the classifier makes a mistake. The proposed methods were evaluated based on computer experiments carried out on 41 datasets with a high imbalance ratio. The obtained results and statistical analysis confirm the usefulness of the proposed solutions.
引用
收藏
页码:59 / 73
页数:15
相关论文
共 50 条
  • [31] On the estimation of unknown distances for a class of Euclidean distance matrix completion problems with interval data
    Baez Sanchez, Andres David
    Lavor, Carlile
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2020, 592 : 287 - 305
  • [32] Euclidean distance-based method for variable selection and its application in geological data
    Chen, Peng Yu
    Yu, Hong Ming
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2016, 19 (02) : 215 - 232
  • [33] Feature Selection Using Euclidean Distance and Cosine Similarity for Intrusion Detection Model
    Suebsing, Anirut
    Hiransakolwong, Nualsawat
    2009 FIRST ASIAN CONFERENCE ON INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2009, : 86 - 91
  • [34] Discrepancies Between Euclidean Distance and Compositional Analyses of Resource Selection Data With Known Parameters
    Bingham, Ralph L.
    Brennan, Leonard A.
    Ballard, Bart M.
    JOURNAL OF WILDLIFE MANAGEMENT, 2010, 74 (03): : 582 - 587
  • [35] Federated Learning for Distribution Skewed Data Using Sample Weights
    Nguyen H.
    Wu P.
    Chang J.M.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (06): : 2615 - 2626
  • [36] Dynamic classifier selection for effective mining from noisy data streams
    Zhu, XQ
    Wu, XD
    Yang, Y
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 305 - 312
  • [37] A Hybrid Supervised Machine Learning Classifier System for Breast Cancer Prognosis Using Feature Selection and Data Imbalance Handling Approaches
    Solanki, Yogendra Singh
    Chakrabarti, Prasun
    Jasinski, Michal
    Leonowicz, Zbigniew
    Bolshev, Vadim
    Vinogradov, Alexander
    Jasinska, Elzbieta
    Gono, Radomir
    Nami, Mohammad
    ELECTRONICS, 2021, 10 (06) : 1 - 16
  • [38] An Enhanced Dynamic Ensemble Selection Classifier for Imbalance Classification With Application to China Corporation Bond Default Prediction
    Wang, Yu
    Zhang, Junbin
    Yan, Wei
    IEEE ACCESS, 2023, 11 : 32082 - 32094
  • [39] Combating the Small Sample Class Imbalance Problem Using Feature Selection
    Wasikowski, Mike
    Chen, Xue-wen
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (10) : 1388 - 1400
  • [40] An empirical study of dynamic selection and random under-sampling for the class imbalance problem
    Liu, Shuhua Monica
    Chen, Jiun-Hung
    Liu, Zhiheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221