R2-trans: Fine-grained visual categorization with redundancy reduction

被引:2
|
作者
Ye, Shuo [1 ]
Yu, Shujian [2 ,3 ]
Wang, Yu [1 ]
You, Xinge [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[2] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
[3] UiT Arctic Univ Norway, Machine Learning Grp, Tromso, Norway
关键词
Fine-grained visual categorization; Batch -based dynamic mask; Information bottleneck; NETWORK;
D O I
10.1016/j.imavis.2024.104923
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization (FGVC) aims to discriminate similar subcategories, whose main challenge is the large intraclass diversities and subtle inter-class differences. Existing FGVC methods usually select discriminant regions found by a trained model, which is prone to neglect other potential discriminant information. On the other hand, the massive interactions between the sequence of image patches in ViT make the resulting class token contain lots of redundant information, which may also impact FGVC performance. In this paper, we present a novel approach for FGVC, which can simultaneously make use of partial yet sufficient discriminative information in environmental cues and also compress the redundant information in class-token with respect to the target. Specifically, our model calculates the ratio of high-weight regions in a batch, adaptively adjusts the masking threshold, and achieves moderate extraction of background information in the input space. Moreover, we also use the Information Bottleneck (IB) approach to guide our network to learn a minimum sufficient representations in the feature space. Experimental results on three widely-used benchmark datasets verify that our approach can achieve better performance than other state-of-the-art approaches and baseline models. The code of our model is available at: https://github.com/SYe-hub/R-2-Trans.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] SHAPE-GUIDED SEGMENTATION FOR FINE-GRAINED VISUAL CATEGORIZATION
    Sun, Ming
    Yang, Jufeng
    Sun, Bo
    Wang, Kai
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [22] Orientational Spatial Part Modeling for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Shiliang
    Xie, Fei
    Zhang, Yongdong
    Zhang, Dongming
    Su, Yu
    Tian, Qi
    [J]. 2015 IEEE THIRD INTERNATIONAL CONFERENCE ON MOBILE SERVICES MS 2015, 2015, : 360 - 367
  • [23] A Deep Sparse Coding Method for Fine-Grained Visual Categorization
    Guo, Lihua
    Guo, Chenggang
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 632 - 639
  • [24] Part-Stacked CNN for Fine-Grained Visual Categorization
    Huang, Shaoli
    Xu, Zhe
    Tao, Dacheng
    Zhang, Ya
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1173 - 1182
  • [25] Refined probability distribution module for fine-grained visual categorization
    Zhao, Peipei
    Miao, Qiguang
    Li, Hongsheng
    Liu, Ruyi
    Quan, Yining
    Song, Jianfeng
    [J]. NEUROCOMPUTING, 2023, 518 : 533 - 544
  • [26] Category attention transfer for efficient fine-grained visual categorization
    Liao, Qiyu
    Wang, Dadong
    Xu, Min
    [J]. PATTERN RECOGNITION LETTERS, 2022, 153 : 10 - 15
  • [27] A survey of fine-grained visual categorization based on deep learning
    Xie Yuxiang
    Gong Quanzhi
    Luan Xidao
    Yan Jie
    Zhang Jiahui
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023,
  • [28] Attentional Kernel Encoding Networks for Fine-Grained Visual Categorization
    Hu, Yutao
    Yang, Yandan
    Zhang, Jun
    Cao, Xianbin
    Zhen, Xiantong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 301 - 314
  • [29] Birdsnap: Large-scale Fine-grained Visual Categorization of Birds
    Berg, Thomas
    Liu, Jiongxin
    Lee, Seung Woo
    Alexander, Michelle L.
    Jacobs, David W.
    Belhumeur, Peter N.
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2019 - 2026
  • [30] Filtration and Distillation: Enhancing Region Attention for Fine-Grained Visual Categorization
    Liu, Chuanbin
    Xie, Hongtao
    Zha, Zheng-Jun
    Ma, Lingfeng
    Yu, Lingyun
    Zhang, Yongdong
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11555 - 11562