A systematic study of the class imbalance problem in convolutional neural networks

被引：1384

作者：

Buda, Mateusz ^{[1
,2
]}

Maki, Atsuto ^{[2
]}

Mazurowski, Maciej A. ^{[1
,3
]}

机构：

[1] Duke Univ, Dept Radiol, Sch Med, Durham, NC 27710 USA

[2] KTH Royal Inst Technol, Sch Elect Engn & Comp Sci, Stockholm, Sweden

[3] Duke Univ, Dept Elect & Comp Engn, Durham, NC USA

来源：

NEURAL NETWORKS | 2018年 / 106卷

关键词：

Class imbalance; Convolutional neural networks; Deep learning; Image classification; NOVELTY DETECTION APPROACH; CLASSIFICATION; SMOTE; CLASSIFIERS;

D O I：

10.1016/j.neunet.2018.07.011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this study, we systematically investigate the impact of class imbalance on classification performance of convolutional neural networks (CNNs) and compare frequently used methods to address the issue. Class imbalance is a common problem that has been comprehensively studied in classical machine learning, yet very limited systematic research is available in the context of deep learning. In our study, we use three benchmark datasets of increasing complexity, MNIST, CIFAR-10 and ImageNet, to investigate the effects of imbalance on classification and perform an extensive comparison of several methods to address the issue: oversampling, undersampling, two-phase training, and thresholding that compensates for prior class probabilities. Our main evaluation metric is area under the receiver operating characteristic curve (ROC AUC) adjusted to multi-class tasks since overall accuracy metric is associated with notable difficulties in the context of imbalanced data. Based on results from our experiments we conclude that (i) the effect of class imbalance on classification performance is detrimental; (ii) the method of addressing class imbalance that emerged as dominant in almost all analyzed scenarios was oversampling; (iii) oversampling should be applied to the level that completely eliminates the imbalance, whereas the optimal undersampling ratio depends on the extent of imbalance; (iv) as opposed to some classical machine learning models, oversampling does not cause overfitting of CNNs; (v) thresholding should be applied to compensate for prior class probabilities when overall number of properly classified cases is of interest. (c) 2018 Elsevier Ltd. All rights reserved.

引用

页码：249 / 259

页数：11

共 50 条

[41] The class imbalance problem in UCS classifier system:: A preliminary study
Orriols-Puig, Albert
Bernado-Mansilla, Ester
[J]. LEARNING CLASSIFIER SYSTEMS, 2007, 4399 : 161 - 180
[42] A Systematic Study of Online Class Imbalance Learning With Concept Drift
Wang, Shuo
Minku, Leandro L.
Yao, Xin
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 4802 - 4821
[43] HLA class I binding prediction via convolutional neural networks
Vang, Yeeleng S.
Xie, Xiaohui
[J]. BIOINFORMATICS, 2017, 33 (17) : 2658 - 2665
[44] Layered Convolutional Neural Networks for Multi-Class Image Classification
Kasinets, Dzmitry
Saeed, Amir K.
Johnson, Benjamin A.
Rodriguez, Benjamin M.
[J]. REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2024, 2024, 13034
[45] Convolutional Neural Networks for Multi-class Intrusion Detection System
Potluri, Sasanka
Ahmed, Shamim
Diedrich, Christian
[J]. MINING INTELLIGENCE AND KNOWLEDGE EXPLORATION, MIKE 2018, 2018, 11308 : 225 - 238
[46] On Class-Incremental Learning for Fully Binarized Convolutional Neural Networks
Basso-Bert, Yanis
Guiequero, William
Molnos, Anca
Lemaire, Romain
Dupret, Antoine
[J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
[47] Efficient training algorithms for a class of shunting inhibitory convolutional neural networks
Tivive, FHC
Bouzerdoum, A
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (03): : 541 - 556
[48] A new class of convolutional neural networks (SICoNNets) and their application to face detection
Tivive, FHC
Bouzerdoum, A
[J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2157 - 2162
[49] Effectiveness of Adversarial Attacks on Class-Imbalanced Convolutional Neural Networks
Possas, Rafael
Zhou, Ying
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 333 - 342
[50] Convolutional neural networks
Alexander Derry
Martin Krzywinski
Naomi Altman
[J]. Nature Methods, 2023, 20 : 1269 - 1270

← 1 2 3 4 5 →