Survey on deep learning with class imbalance

被引:0
|
作者
Justin M. Johnson
Taghi M. Khoshgoftaar
机构
[1] Florida Atlantic University,
来源
关键词
Deep learning; Deep neural networks; Class imbalance; Big data;
D O I
暂无
中图分类号
学科分类号
摘要
The purpose of this study is to examine existing deep learning techniques for addressing class imbalanced data. Effective classification with imbalanced data is an important area of research, as high class imbalance is naturally inherent in many real-world applications, e.g., fraud detection and cancer detection. Moreover, highly imbalanced data poses added difficulty, as most learners will exhibit bias towards the majority class, and in extreme cases, may ignore the minority class altogether. Class imbalance has been studied thoroughly over the last two decades using traditional machine learning models, i.e. non-deep learning. Despite recent advances in deep learning, along with its increasing popularity, very little empirical work in the area of deep learning with class imbalance exists. Having achieved record-breaking performance results in several complex domains, investigating the use of deep neural networks for problems containing high levels of class imbalance is of great interest. Available studies regarding class imbalance and deep learning are surveyed in order to better understand the efficacy of deep learning when applied to class imbalanced data. This survey discusses the implementation details and experimental results for each study, and offers additional insight into their strengths and weaknesses. Several areas of focus include: data complexity, architectures tested, performance interpretation, ease of use, big data application, and generalization to other domains. We have found that research in this area is very limited, that most existing work focuses on computer vision tasks with convolutional neural networks, and that the effects of big data are rarely considered. Several traditional methods for class imbalance, e.g. data sampling and cost-sensitive learning, prove to be applicable in deep learning, while more advanced methods that exploit neural network feature learning abilities show promising results. The survey concludes with a discussion that highlights various gaps in deep learning from class imbalanced data for the purpose of guiding future research.
引用
收藏
相关论文
共 50 条
  • [31] Tackling the class imbalance problem of deep learning-based head and neck organ segmentation
    Tappeiner, Elias
    Welk, Martin
    Schubert, Rainer
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022,
  • [32] Distribution Based Ensemble for Class Imbalance Learning
    Mustafa, Ghulam
    Niu, Zhendong
    Yousif, Abdallah
    Tarus, John
    FIFTH INTERNATIONAL CONFERENCE ON THE INNOVATIVE COMPUTING TECHNOLOGY (INTECH 2015), 2015, : 5 - 10
  • [33] Learning in the presence of class imbalance and concept drift
    Wang, Shuo
    Minku, Leandro L.
    Chawla, Nitesh
    Yao, Xin
    NEUROCOMPUTING, 2019, 343 : 1 - 2
  • [34] Hybrid Sampling with Bagging for Class Imbalance Learning
    Lu, Yang
    Cheung, Yiu-ming
    Tang, Yuan Yan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 14 - 26
  • [35] Learning from data streams and class imbalance
    Wang, Shuo
    Minku, Leandro L.
    Chawla, Nitesh
    Yao, Xin
    CONNECTION SCIENCE, 2019, 31 (02) : 103 - 104
  • [36] Dynamic class imbalance learning for incremental LPSVM
    Pang, Shaoning
    Zhu, Lei
    Chen, Gang
    Sarrafzadeh, Abdolhossein
    Ban, Tao
    Inoue, Daisuke
    NEURAL NETWORKS, 2013, 44 : 87 - 100
  • [37] A broad review on class imbalance learning techniques
    Rezvani, Salim
    Wang, Xizhao
    APPLIED SOFT COMPUTING, 2023, 143
  • [38] Few-Shot Learning With Class Imbalance
    Ochal M.
    Patacchiola M.
    Vazquez J.
    Storkey A.
    Wang S.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1348 - 1358
  • [39] Unsupervised Ensemble Learning for Class Imbalance Problems
    Liu, Zihan
    Wu, Dongrui
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 3593 - 3600
  • [40] Stop Oversampling for Class Imbalance Learning: A Review
    Tarawneh, Ahmad S.
    Hassanat, Ahmad B.
    Altarawneh, Ghada Awad
    Almuhaimeed, Abdullah
    IEEE ACCESS, 2022, 10 : 47643 - 47660