Multimodal Framework for Long-Tailed Recognition

被引:0
|
作者
Chen, Jian [1 ]
Zhao, Jianyin [1 ]
Gu, Jiaojiao [1 ]
Qin, Yufeng [1 ]
Ji, Hong [1 ]
机构
[1] College of Coastal Defense Force, Naval Aviation University, Yantai,264001, China
来源
Applied Sciences (Switzerland) | 2024年 / 14卷 / 22期
关键词
Data assimilation - Image classification - Spatio-temporal data;
D O I
10.3390/app142210572
中图分类号
学科分类号
摘要
Long-tailed data distribution (i.e., minority classes occupy most of the data, while most classes have very few samples) is a common problem in image classification. In this paper, we propose a novel multimodal framework for long-tailed data recognition. In the first stage, long-tailed data are used for visual-semantic contrastive learning to obtain good features, while in the second stage, class-balanced data are used for classifier training. The proposed framework leverages the advantages of multimodal models and mitigates the problem of class imbalance in long-tailed data recognition. Experimental results demonstrate that the proposed framework achieves competitive performance on the CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018 datasets for image classification. © 2024 by the authors.
引用
下载
收藏
相关论文
共 50 条
  • [41] SWRM: Similarity Window Reweighting and Margin for Long-Tailed Recognition
    Chen, Qiong
    Huang, Tianlin
    Liu, Qingfa
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (06)
  • [42] FCC: Feature Clusters Compression for Long-Tailed Visual Recognition
    Li, Jian
    Meng, Ziyao
    Shi, Daqian
    Song, Rui
    Diao, Xiaolei
    Wang, Jingwen
    Xu, Hao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24080 - 24089
  • [43] Balanced clustering contrastive learning for long-tailed visual recognition
    Byeong-il Kim
    Byoung Chul Ko
    Pattern Analysis and Applications, 2025, 28 (1)
  • [44] Long-tailed image recognition through balancing discriminant quality
    Wu, Yan-Xue
    Min, Fan
    Zhang, Ben-Wen
    Wang, Xian-Jie
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 1) : 833 - 856
  • [45] Hierarchical block aggregation network for long-tailed visual recognition
    Pang, Shanmin
    Wang, Weiye
    Zhang, Renzhong
    Hao, Wenyu
    NEUROCOMPUTING, 2023, 549
  • [46] Long-Tailed Recognition by Hierarchical Rebalancing Dual-Classifier
    Zhang, Junsong
    Gao, Linsheng
    Li, Hao
    Zhou, Hao
    IEEE ACCESS, 2023, 11 : 54839 - 54848
  • [47] MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition
    Li, Shuang
    Gong, Kaixiong
    Liu, Chi Harold
    Wang, Yulin
    Qiao, Feng
    Cheng, Xinjing
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5208 - 5217
  • [48] Adaptive Logit Adjustment Loss for Long-Tailed Visual Recognition
    Zhao, Yan
    Chen, Weicong
    Tan, Xu
    Huang, Kai
    Zhu, Jihong
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3472 - 3480
  • [49] Subclass-balancing Contrastive Learning for Long-tailed Recognition
    Hou, Chengkai
    Zhang, Jieyu
    Wang, Haonan
    Zhou, Tianyi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5372 - 5384
  • [50] Long-tailed image recognition through balancing discriminant quality
    Yan-Xue Wu
    Fan Min
    Ben-Wen Zhang
    Xian-Jie Wang
    Artificial Intelligence Review, 2023, 56 : 833 - 856