A systematic study of the class imbalance problem in convolutional neural networks

被引:1384
|
作者
Buda, Mateusz [1 ,2 ]
Maki, Atsuto [2 ]
Mazurowski, Maciej A. [1 ,3 ]
机构
[1] Duke Univ, Dept Radiol, Sch Med, Durham, NC 27710 USA
[2] KTH Royal Inst Technol, Sch Elect Engn & Comp Sci, Stockholm, Sweden
[3] Duke Univ, Dept Elect & Comp Engn, Durham, NC USA
关键词
Class imbalance; Convolutional neural networks; Deep learning; Image classification; NOVELTY DETECTION APPROACH; CLASSIFICATION; SMOTE; CLASSIFIERS;
D O I
10.1016/j.neunet.2018.07.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we systematically investigate the impact of class imbalance on classification performance of convolutional neural networks (CNNs) and compare frequently used methods to address the issue. Class imbalance is a common problem that has been comprehensively studied in classical machine learning, yet very limited systematic research is available in the context of deep learning. In our study, we use three benchmark datasets of increasing complexity, MNIST, CIFAR-10 and ImageNet, to investigate the effects of imbalance on classification and perform an extensive comparison of several methods to address the issue: oversampling, undersampling, two-phase training, and thresholding that compensates for prior class probabilities. Our main evaluation metric is area under the receiver operating characteristic curve (ROC AUC) adjusted to multi-class tasks since overall accuracy metric is associated with notable difficulties in the context of imbalanced data. Based on results from our experiments we conclude that (i) the effect of class imbalance on classification performance is detrimental; (ii) the method of addressing class imbalance that emerged as dominant in almost all analyzed scenarios was oversampling; (iii) oversampling should be applied to the level that completely eliminates the imbalance, whereas the optimal undersampling ratio depends on the extent of imbalance; (iv) as opposed to some classical machine learning models, oversampling does not cause overfitting of CNNs; (v) thresholding should be applied to compensate for prior class probabilities when overall number of properly classified cases is of interest. (c) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:249 / 259
页数:11
相关论文
共 50 条
  • [41] The class imbalance problem in UCS classifier system:: A preliminary study
    Orriols-Puig, Albert
    Bernado-Mansilla, Ester
    [J]. LEARNING CLASSIFIER SYSTEMS, 2007, 4399 : 161 - 180
  • [42] A Systematic Study of Online Class Imbalance Learning With Concept Drift
    Wang, Shuo
    Minku, Leandro L.
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 4802 - 4821
  • [43] HLA class I binding prediction via convolutional neural networks
    Vang, Yeeleng S.
    Xie, Xiaohui
    [J]. BIOINFORMATICS, 2017, 33 (17) : 2658 - 2665
  • [44] Layered Convolutional Neural Networks for Multi-Class Image Classification
    Kasinets, Dzmitry
    Saeed, Amir K.
    Johnson, Benjamin A.
    Rodriguez, Benjamin M.
    [J]. REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2024, 2024, 13034
  • [45] Convolutional Neural Networks for Multi-class Intrusion Detection System
    Potluri, Sasanka
    Ahmed, Shamim
    Diedrich, Christian
    [J]. MINING INTELLIGENCE AND KNOWLEDGE EXPLORATION, MIKE 2018, 2018, 11308 : 225 - 238
  • [46] On Class-Incremental Learning for Fully Binarized Convolutional Neural Networks
    Basso-Bert, Yanis
    Guiequero, William
    Molnos, Anca
    Lemaire, Romain
    Dupret, Antoine
    [J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [47] Efficient training algorithms for a class of shunting inhibitory convolutional neural networks
    Tivive, FHC
    Bouzerdoum, A
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (03): : 541 - 556
  • [48] A new class of convolutional neural networks (SICoNNets) and their application to face detection
    Tivive, FHC
    Bouzerdoum, A
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2157 - 2162
  • [49] Effectiveness of Adversarial Attacks on Class-Imbalanced Convolutional Neural Networks
    Possas, Rafael
    Zhou, Ying
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 333 - 342
  • [50] Convolutional neural networks
    Alexander Derry
    Martin Krzywinski
    Naomi Altman
    [J]. Nature Methods, 2023, 20 : 1269 - 1270