Tackling class imbalance in computer vision: a contemporary review

被引:6
|
作者
Saini, Manisha [1 ]
Susan, Seba [1 ]
机构
[1] Delhi Technol Univ, Delhi, India
关键词
Class imbalance; Computer vision; Data-level manipulation; Cost-sensitive learning; Deep learning; CONVOLUTIONAL NEURAL-NETWORKS; IMAGE CLASSIFICATION; MACHINE; CHALLENGES; SMOTE;
D O I
10.1007/s10462-023-10557-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class imbalance is a key issue affecting the performance of computer vision applications such as medical image analysis, objection detection and recognition, image segmentation, scene understanding, and many others. Class imbalance refers to the situation when the number of samples in the majority classes outnumber the minority class populations. The model might then get biased towards the majority classes while neglecting the minority classes, adversely affecting the classification performance. In this paper, an extensive literature survey has been conducted to discuss in depth about the class imbalance issues affecting various classification tasks in computer vision. The study analyzes the performance of several contemporary machine learning algorithms such as chi-square support vector machine and gradient boosted decision trees, and deep learning models such as deep pre-trained convolutional networks, generative adversarial networks and vision transformers, for effective learning from imbalanced computer vision datasets. Most of these models either perform data-level manipulation (data augmentation) or cost-sensitive learning (loss functions) or a combination of the two. This survey also includes a summary of novel deep learning frameworks customized to mitigate the effect of class imbalance. It has included recent advancement and new developments in this field such as Explainable AI. The scrutiny of various popular and benchmark imbalanced datasets in computer vision and performance evaluation metrics are also included as a part of this study. Along with that it has emphasized on the research gaps in contemporary literature which would contribute towards future artificial vision models that can learn effectively from imbalanced datasets.
引用
收藏
页码:1279 / 1335
页数:57
相关论文
共 50 条
  • [31] A new class of Zernike moments for computer vision applications
    Papakostas, G. A.
    Boutalis, Y. S.
    Karras, D. A.
    Mertzios, B. G.
    [J]. INFORMATION SCIENCES, 2007, 177 (13) : 2802 - 2819
  • [32] Oversampling Methods to Handle the Class Imbalance Problem: A Review
    Sharma, Harsh
    Gosain, Anushika
    [J]. SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 96 - 110
  • [33] A Review on Solution to Class Imbalance Problem: Undersampling Approaches
    Devi, Debashree
    Biswas, Saroj K.
    Purkayastha, Biswajit
    [J]. 2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 626 - 631
  • [34] Tackling the Imbalance Biases in the Code Cloze Test
    Qi, Xuexin
    Zhao, Lingxiao
    Li, Hui
    Guo, Shikai
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 733 - 738
  • [35] Computer Vision in Esophageal Cancer: A Literature Review
    Domingues, Ines
    Sampaio, Ines Lucena
    Duarte, Hugo
    Santos, Joao A. M.
    Abreu, Pedro H.
    [J]. IEEE ACCESS, 2019, 7 : 103080 - 103094
  • [36] A review of convolutional neural networks in computer vision
    Xia Zhao
    Limin Wang
    Yufei Zhang
    Xuming Han
    Muhammet Deveci
    Milan Parmar
    [J]. Artificial Intelligence Review, 57
  • [37] A Review of Gesture Recognition Based on Computer Vision
    Li, Bei
    Li, Gongfa
    Sun, Ying
    Jiang, Guozhang
    Kong, Jianyi
    Ju, Zhaojie
    Jiang, Du
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2017, PT I, 2017, 10462 : 528 - 538
  • [38] Computer Vision Techniques in Construction: A Critical Review
    Xu, Shuyuan
    Wang, Jun
    Shou, Wenchi
    Ngo, Tuan
    Sadick, Abdul-Manan
    Wang, Xiangyu
    [J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2021, 28 (05) : 3383 - 3397
  • [39] Computer vision technology in agricultural automation —A review
    Tian, Hongkun
    Wang, Tianhai
    Liu, Yadong
    Qiao, Xi
    Li, Yanzhou
    [J]. Information Processing in Agriculture, 2020, 7 (01): : 1 - 19
  • [40] ROBUST REGRESSION METHODS FOR COMPUTER VISION - A REVIEW
    MEER, P
    MINTZ, D
    ROSENFELD, A
    KIM, DY
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 1991, 6 (01) : 59 - 70