Multimodal Framework for Long-Tailed Recognition

被引:0
|
作者
Chen, Jian [1 ]
Zhao, Jianyin [1 ]
Gu, Jiaojiao [1 ]
Qin, Yufeng [1 ]
Ji, Hong [1 ]
机构
[1] College of Coastal Defense Force, Naval Aviation University, Yantai,264001, China
来源
Applied Sciences (Switzerland) | 2024年 / 14卷 / 22期
关键词
Data assimilation - Image classification - Spatio-temporal data;
D O I
10.3390/app142210572
中图分类号
学科分类号
摘要
Long-tailed data distribution (i.e., minority classes occupy most of the data, while most classes have very few samples) is a common problem in image classification. In this paper, we propose a novel multimodal framework for long-tailed data recognition. In the first stage, long-tailed data are used for visual-semantic contrastive learning to obtain good features, while in the second stage, class-balanced data are used for classifier training. The proposed framework leverages the advantages of multimodal models and mitigates the problem of class imbalance in long-tailed data recognition. Experimental results demonstrate that the proposed framework achieves competitive performance on the CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018 datasets for image classification. © 2024 by the authors.
引用
下载
收藏
相关论文
共 50 条
  • [21] Domain Balancing: Face Recognition on Long-Tailed Domains
    Cao, Dong
    Zhu, Xiangyu
    Huang, Xingyu
    Guo, Jianzhu
    Lei, Zhen
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5670 - 5678
  • [22] A dual progressive strategy for long-tailed visual recognition
    Liang, Hong
    Cao, Guoqing
    Shao, Mingwen
    Zhang, Qian
    MACHINE VISION AND APPLICATIONS, 2024, 35 (01)
  • [23] Local pseudo-attributes for long-tailed recognition
    Kim, Dong-Jin
    Ke, Tsung-Wei
    Yu, Stella X.
    PATTERN RECOGNITION LETTERS, 2023, 172 : 51 - 57
  • [24] Towards Effective Collaborative Learning in Long-Tailed Recognition
    Xu, Zhengzhuo
    Chai, Zenghao
    Xu, Chengyin
    Yuan, Chun
    Yang, Haiqin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3754 - 3764
  • [25] Nested Collaborative Learning for Long-Tailed Visual Recognition
    Li, Jun
    Tan, Zichang
    Wan, Jun
    Lei, Zhen
    Guo, Guodong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6939 - 6948
  • [26] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition
    Du, Chaoqun
    Wang, Yulin
    Song, Shiji
    Huang, Gao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 5890 - 5904
  • [27] Targeted Supervised Contrastive Learning for Long-Tailed Recognition
    Li, Tianhong
    Cao, Peng
    Yuan, Yuan
    Fan, Lijie
    Yang, Yuzhe
    Feris, Rogerio
    Indyk, Piotr
    Katabi, Dina
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6908 - 6918
  • [28] Inverse Image Frequency for Long-Tailed Image Recognition
    Alexandridis, Konstantinos Panagiotis
    Luo, Shan
    Nguyen, Anh
    Deng, Jiankang
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5721 - 5736
  • [29] Exploring the auxiliary learning for long-tailed visual recognition
    Zhang, Junjie
    Liu, Lingqiao
    Wang, Peng
    Zhang, Jian
    NEUROCOMPUTING, 2021, 449 : 303 - 314
  • [30] Balanced self-distillation for long-tailed recognition
    Ren, Ning
    Li, Xiaosong
    Wu, Yanxia
    Fu, Yan
    KNOWLEDGE-BASED SYSTEMS, 2024, 290