An Erudite Fine-Grained Visual Classification Model

被引:3
|
作者
Chang, Dongliang [1 ]
Tong, Yujun [1 ]
Du, Ruoyi [1 ]
Hospedales, Timothy [2 ]
Song, Yi-Zhe [3 ]
Ma, Zhanyu [1 ]
机构
[1] Beijing Univ Posts & Telecommunicat, Beijing, Peoples R China
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[3] Univ Surrey, SketchX, CVSSP, Surrey, England
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.00702
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current fine-grained visual classification (FGVC) models are isolated. In practice, we first need to identify the coarse-grained label of an object, then select the corresponding FGVC model for recognition. This hinders the application of FGVC algorithms in real-life scenarios. In this paper, we propose an erudite FGVC model jointly trained by several different datasets(1), which can efficiently and accurately predict an object's fine-grained label across the combined label space. We found through a pilot study that positive and negative transfers co-occur when different datasets are mixed for training, i.e., the knowledge from other datasets is not always useful. Therefore, we first propose a feature disentanglement module and a feature re-fusion module to reduce negative transfer and boost positive transfer between different datasets. In detail, we reduce negative transfer by decoupling the deep features through many dataset-specific feature extractors. Subsequently, these are channel-wise re-fused to facilitate positive transfer. Finally, we propose a meta-learning based dataset-agnostic spatial attention layer to take full advantage of the multi-dataset training data, given that localisation is dataset-agnostic between different datasets. Experimental results across 11 different mixed-datasets built on four different FGVC datasets demonstrate the effectiveness of the proposed method. Furthermore, the proposed method can be easily combined with existing FGVC methods to obtain state-of-the-art results. Our code is available at https:// github.com/PRIS-CV/An-Erudite-FGVC-Model.
引用
下载
收藏
页码:7268 / 7277
页数:10
相关论文
共 50 条
  • [21] Feature Boosting, Suppression, and Diversification for Fine-Grained Visual Classification
    Song, Jianwei
    Yang, Ruoyu
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [22] TransFGVC: transformer-based fine-grained visual classification
    Shen, Longfeng
    Hou, Bin
    Jian, Yulei
    Tu, Xisong
    Zhang, Yingjie
    Shuai, Lingying
    Ge, Fangzhen
    Chen, Debao
    VISUAL COMPUTER, 2024,
  • [23] WEB-SUPERVISED NETWORK FOR FINE-GRAINED VISUAL CLASSIFICATION
    Zhang, Chuanyi
    Ya, Yazhou
    Zhang, Jiachao
    Chen, Jiaxin
    Huang, Pu
    Zhang, Jian
    Tang, Zhenmin
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [24] SemLa: A Visual Analysis System for Fine-Grained Text Classification
    Battogtokh, Munkhtulga
    Davidescu, Cosmin
    Luck, Michael
    Borgo, Rita
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23772 - 23774
  • [25] Fine-Grained Visual Entailment
    Thomas, Christopher
    Zhang, Yipeng
    Chang, Shih-Fu
    COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 398 - 416
  • [26] A light-weight model with granularity feature representation for fine-grained visual classification
    Zheng, Qiumei
    Peng, Tianqi
    Huang, Ding
    Wang, Fenghua
    Xu, Nengxiang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2024, 27 (03) : 341 - 351
  • [27] Fine-Grained Visual Prompting
    Yang, Lingfeng
    Wang, Yueze
    Li, Xiang
    Wang, Xinlong
    Yang, Jian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [28] Adaptive Triplet Model for Fine-Grained Visual Categorization
    Liang, Jingyun
    Guo, Jinlin
    Guo, Yanming
    Lao, Songyang
    IEEE ACCESS, 2018, 6 : 76776 - 76786
  • [29] Convolutionally Enhanced Feature Fusion Visual Transformer for Fine-Grained Visual Classification
    Huang, Min
    Zhu, Saixing
    Wang, Zehua
    Qu, Shuanghong
    2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 447 - 452
  • [30] Fine-Grained Visual Classification via Internal Ensemble Learning Transformer
    Xu, Qin
    Wang, Jiahui
    Jiang, Bo
    Luo, Bin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9015 - 9028