THE METHODS FOR QUANTITATIVE SOLVING THE CLASS IMBALANCE PROBLEM

被引:3
|
作者
Kavrin, D. A. [1 ]
Subbotin, S. A. [1 ]
机构
[1] Zaporizhzhya Natl Tech Univ, Dept Software Tools, Zaporizhzhya, Ukraine
关键词
sample; example; quality metric; cluster; classificatory; majority class; minority class;
D O I
10.15588/1607-3274-2018-1-10
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Context. The problem of recovery the classes' balance in imbalanced samples is solved to increase the efficiency of diagnostic and recognition models. Objective. The purpose of the work is to modify the existing method of recovery classes' balance and to conduct comparative analysis of performance indicators with some modern methods. Method. The proposed data preprocessing method is based on combining the undersampling and cluster-analysis technologies. The method has allowed restoring the balance and reducing the sample while maintaining important topological properties of the sample, high accuracy and acceptable operating time. Results. The software that implements in proposed method has been developed and used in the computational experiments on the study of method's properties and comparative analysis with other methods of restoring classes' balance. Conclusions. The experiments confirmed the efficiency of the proposed method and its implemented software. The method has allowed reducing the majority class to the size of the minority class, thus reducing the training sample (the sample is considered imbalanced if the size of the minority class is less than 10% of the original sample size), while demonstrating the best indicators of model accuracy and comparable sampling speed. It can be recommended for the practical application in solving problems of imbalance data for diagnostic and recognition models.
引用
收藏
页码:83 / 90
页数:8
相关论文
共 50 条
  • [1] Improving classification of mature microRNA by solving class imbalance problem
    Wang, Ying
    Li, Xiaoye
    Tao, Bairui
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [2] Improving classification of mature microRNA by solving class imbalance problem
    Ying Wang
    Xiaoye Li
    Bairui Tao
    [J]. Scientific Reports, 6
  • [3] Oversampling Methods to Handle the Class Imbalance Problem: A Review
    Sharma, Harsh
    Gosain, Anushika
    [J]. SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 96 - 110
  • [4] Coupling different methods for overcoming the class imbalance problem
    Nanni, Loris
    Fantozzi, Carlo
    Lazzarini, Nicola
    [J]. NEUROCOMPUTING, 2015, 158 : 48 - 61
  • [5] Impact of class imbalance ratio on ensemble methods for imbalance problem: A new perspective
    Kumari, Ritika
    Singh, Jaspreeti
    Gosain, Anjana
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 10823 - 10834
  • [6] Quantitative Problem Solving Methods in the Airline Industry
    Cumming, Simon
    [J]. INTERFACES, 2013, 43 (05) : 491 - 494
  • [7] Solving the class imbalance problem using a counterfactual method for data augmentation
    Temraz, Mohammed
    Keane, Mark T.
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2022, 9
  • [8] The class imbalance problem
    Megahed, Fadel M.
    Chen, Ying-Ju
    Megahed, Aly
    Ong, Yuya
    Altman, Naomi
    Krzywinski, Martin
    [J]. NATURE METHODS, 2021, 18 (11) : 1270 - 1272
  • [9] The class imbalance problem
    Fadel M. Megahed
    Ying-Ju Chen
    Aly Megahed
    Yuya Ong
    Naomi Altman
    Martin Krzywinski
    [J]. Nature Methods, 2021, 18 : 1270 - 1272
  • [10] On the Class Imbalance Problem
    Guo, Xinjian
    Yin, Yilong
    Dong, Cailing
    Yang, Gongping
    Zhou, Guangtong
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 192 - 201