Multimodal Framework for Long-Tailed Recognition

被引:0
|
作者
Chen, Jian [1 ]
Zhao, Jianyin [1 ]
Gu, Jiaojiao [1 ]
Qin, Yufeng [1 ]
Ji, Hong [1 ]
机构
[1] College of Coastal Defense Force, Naval Aviation University, Yantai,264001, China
来源
Applied Sciences (Switzerland) | 2024年 / 14卷 / 22期
关键词
Data assimilation - Image classification - Spatio-temporal data;
D O I
10.3390/app142210572
中图分类号
学科分类号
摘要
Long-tailed data distribution (i.e., minority classes occupy most of the data, while most classes have very few samples) is a common problem in image classification. In this paper, we propose a novel multimodal framework for long-tailed data recognition. In the first stage, long-tailed data are used for visual-semantic contrastive learning to obtain good features, while in the second stage, class-balanced data are used for classifier training. The proposed framework leverages the advantages of multimodal models and mitigates the problem of class imbalance in long-tailed data recognition. Experimental results demonstrate that the proposed framework achieves competitive performance on the CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018 datasets for image classification. © 2024 by the authors.
引用
下载
收藏
相关论文
共 50 条
  • [1] Mutual Learning for Long-Tailed Recognition
    Park, Changhwa
    Yim, Junho
    Jun, Eunji
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2674 - 2683
  • [2] A Survey on Long-Tailed Visual Recognition
    Yang, Lu
    Jiang, He
    Song, Qing
    Guo, Jun
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (07) : 1837 - 1872
  • [3] Improving Calibration for Long-Tailed Recognition
    Zhong, Zhisheng
    Cui, Jiequan
    Liu, Shu
    Jia, Jiaya
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16484 - 16493
  • [4] A Survey on Long-Tailed Visual Recognition
    Lu Yang
    He Jiang
    Qing Song
    Jun Guo
    International Journal of Computer Vision, 2022, 130 : 1837 - 1872
  • [5] Learning Prototype Classifiers for Long-Tailed Recognition
    Sharma, Saurabh
    Xian, Yongqin
    Yu, Ning
    Singh, Ambuj
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1360 - 1368
  • [6] ResLT: Residual Learning for Long-Tailed Recognition
    Cui, Jiequan
    Liu, Shu
    Tian, Zhuotao
    Zhong, Zhisheng
    Jia, Jiaya
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3695 - 3706
  • [7] Long-Tailed Recognition via Weight Balancing
    Alshammari, Shaden
    Wang, Yu-Xiong
    Ramanan, Deva
    Kong, Shu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6887 - 6897
  • [8] Decoupled Contrastive Learning for Long-Tailed Recognition
    Xuan, Shiyu
    Zhang, Shiliang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6396 - 6403
  • [9] Decoupled Optimisation for Long-Tailed Visual Recognition
    Cong, Cong
    Xuan, Shiyu
    Liu, Sidong
    Zhang, Shiliang
    Pagnucco, Maurice
    Song, Yang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1380 - 1388
  • [10] Distilling Virtual Examples for Long-tailed Recognition
    He, Yin-Yin
    Wu, Jianxin
    Wei, Xiu-Shen
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 235 - 244