Siamese transformer with hierarchical concept embedding for fine-grained image recognition

被引:0
|
作者
Yilin Lyu
Liping Jing
Jiaqi Wang
Mingzhe Guo
Xinyue Wang
Jian Yu
机构
[1] Beijing Jiaotong University,School of Computer and Information Technology
[2] Beijing Jiaotong University,Beijing Key Lab of Traffic Data Analysis and Mining
[3] Alibaba Group,undefined
来源
关键词
fine-grained image recognition; transformer; hierarchical concept embedding; adaptive sampling; Siamese network;
D O I
暂无
中图分类号
学科分类号
摘要
Distinguishing the subtle differences among fine-grained images from subordinate concepts of a concept hierarchy is a challenging task. In this paper, we propose a Siamese transformer with hierarchical concept embedding (STrHCE), which contains two transformer subnetworks sharing all configurations, and each subnetwork is equipped with the hierarchical semantic information at different concept levels for fine-grained image embeddings. In particular, one subnetwork is for coarse-scale patches to learn the discriminative regions with the aid of the innate multi-head self-attention mechanism of the transformer. The other subnetwork is for finer-scale patches, which are adaptively sampled from the discriminative regions, to capture subtle yet discriminative visual cues and eliminate redundant information. STrHCE connects the two subnetworks through a score margin adjustor to enforce the most discriminative regions generating more confident predictions. Extensive experiments conducted on four commonly-used benchmark datasets, including CUB-200-2011, FGVC-Aircraft, Stanford Dogs, and NABirds, empirically demonstrate the superiority of the proposed STrHCE over state-of-the-art baselines.
引用
收藏
相关论文
共 50 条
  • [1] Siamese transformer with hierarchical concept embedding for fine-grained image recognition
    Yilin LYU
    Liping JING
    Jiaqi WANG
    Mingzhe GUO
    Xinyue WANG
    Jian YU
    [J]. Science China(Information Sciences), 2023, 66 (03) : 188 - 203
  • [2] Siamese transformer with hierarchical concept embedding for fine-grained image recognition
    Lyu, Yilin
    Jing, Liping
    Wang, Jiaqi
    Guo, Mingzhe
    Wang, Xinyue
    Yu, Jian
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (03)
  • [3] Hybrid Granularities Transformer for Fine-Grained Image Recognition
    Yu, Ying
    Wang, Jinghui
    [J]. ENTROPY, 2023, 25 (04)
  • [4] Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic Embedding
    Chen, Tianshui
    Wu, Wenxi
    Gao, Yuefang
    Dong, Le
    Luo, Xiaonan
    Lin, Liang
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2023 - 2031
  • [5] Transformer with peak suppression and knowledge guidance for fine-grained image recognition
    Liu, Xinda
    Wang, Lili
    Han, Xiaoguang
    [J]. NEUROCOMPUTING, 2022, 492 : 137 - 149
  • [6] Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition
    Yu, Jun
    Tan, Min
    Zhang, Hongyuan
    Tao, Dacheng
    Rui, Yong
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 563 - 578
  • [7] TransFG: A Transformer Architecture for Fine-Grained Recognition
    He, Ju
    Chen, Jie-Neng
    Liu, Shuai
    Kortylewski, Adam
    Yang, Cheng
    Bai, Yutong
    Wang, Changhu
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 852 - 860
  • [8] Ultra Fine-Grained Image Semantic Embedding
    Juan, Da-Cheng
    Lu, Chun-To
    Li, Zhen
    Peng, Futang
    Timofeev, Aleksei
    Chen, Yi-Ting
    Gao, Yaxi
    Duerig, Tom
    Tomkins, Andrew
    Ravi, Sujith
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 277 - 285
  • [9] Token Adaptive Vision Transformer with Efficient Deployment for Fine-Grained Image Recognition
    Lee, Chonghan
    Brufau, Rita Brugarolas
    Ding, Ke
    Narayanan, Vijaykrishnan
    [J]. 2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
  • [10] Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition
    Sun, Jiayin
    Wang, Hong
    Dong, Qiulei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3891 - 3904