Multicriteria Classifier Ensemble Learning for Imbalanced Data

被引:0
|
作者
Wegier, Weronika [1 ]
Koziarski, Michal [2 ]
Wozniak, Micha [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Dept Syst & Comp Networks, PL-50370 Wroclaw, Poland
[2] AGH Univ Sci & Technol, Dept Elect, PL-30059 Krakow, Poland
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Measurement; Optimization; Costs; Task analysis; Bagging; Training; Licenses; Classifier ensemble; imbalanced data; multi-objective optimization; pattern classification; ALGORITHMS; DIVERSITY; AREA;
D O I
10.1109/ACCESS.2022.3149914
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the vital problems with the imbalanced data classifier training is the definition of an optimization criterion. Typically, since the exact cost of misclassification of the individual classes is unknown, combined metrics and loss functions that roughly balance the cost for each class are used. However, this approach can lead to a loss of information, since different trade-offs between class misclassification rates can produce similar combined metric values. To address this issue, this paper discusses a multi-criteria ensemble training method for the imbalanced data. The proposed method jointly optimizes precision and recall, and provides the end-user with a set of Pareto optimal solutions, from which the final one can be chosen according to the user's preference. The proposed approach was evaluated on a number of benchmark datasets and compared with the single-criterion approach (where the selected criterion was one of the chosen metrics). The results of the experiments confirmed the usefulness of the obtained method, which on the one hand guarantees good quality, i.e., not worse than the one obtained with the use of single-criterion optimization, and on the other hand, offers the user the opportunity to choose the solution that best meets their expectations regarding the trade-off between errors on the minority and the majority class.
引用
收藏
页码:16807 / 16818
页数:12
相关论文
共 50 条
  • [1] Multicriteria Classifier Ensemble Learning for Imbalanced Data
    Wegier, Weronika
    Koziarski, Michal
    Wozniak, Micha
    Wegier, Weronika
    [J]. IEEE Access, 2022, 10 : 16807 - 16818
  • [2] Hybrid Classifier Ensemble for Imbalanced Data
    Yang, Kaixiang
    Yu, Zhiwen
    Wen, Xin
    Cao, Wenming
    Chen, C. L. Philip
    Wong, Hau-San
    You, Jane
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (04) : 1387 - 1400
  • [3] An Adaptive Sampling Ensemble Classifier for Learning from Imbalanced Data Sets
    Geiler, Ordonez Jon
    Hong, Li
    Yue-Jian, Guo
    [J]. INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 513 - 517
  • [4] ENSEMBLE CLASSIFIER AND RESAMPLING FOR IMBALANCED MULTICLASS LEARNING
    Sainin, Mohd Shamrie
    Ahmad, Faudziah
    Alfred, Rayner
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS, 2015, : 751 - 756
  • [5] A Direct Ensemble Classifier for Imbalanced Multiclass Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    [J]. 2012 4TH CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2012, : 59 - 66
  • [6] Progressive Hybrid Classifier Ensemble for Imbalanced Data
    Yang, Kaixiang
    Yu, Zhiwen
    Chen, C. L. Philip
    Cao, Wenming
    Wong, Hau-San
    You, Jane
    Han, Guoqiang
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (04): : 2464 - 2478
  • [7] A new classifier for imbalanced data with iterative learning process and ensemble operating process
    Pan, Tingting
    Pedrycz, Witold
    Yang, Jie
    Wu, Wei
    Zhang, Yulin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 249
  • [8] Imbalanced Ensemble Classifier for Learning from Imbalanced Business School Dataset
    Chakraborty, Tanujit
    [J]. INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2019, 4 (04) : 861 - 869
  • [9] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Shi, Peibei
    Wang, Zhong
    [J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2021, 34 (06) : 2250 - 2266
  • [10] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    SHI Peibei
    WANG Zhong
    [J]. Journal of Systems Science & Complexity, 2021, 34 (06) : 2250 - 2266