Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization

被引:142
|
作者
Ji, Ruyi [1 ,2 ]
Wen, Longyin [3 ]
Zhang, Libo [1 ]
Du, Dawei [4 ]
Wu, Yanjun [1 ]
Zhao, Chen [1 ]
Liu, Xianglong [5 ]
Huang, Feiyue [6 ]
机构
[1] ISCAS, State Key Lab Comp Sci, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] JD Finance Amer Corp, Mountain View, CA USA
[4] SUNY Albany, Albany, NY 12222 USA
[5] Beihang Univ, Beijing, Peoples R China
[6] Tencent Youtu Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR42600.2020.01048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization (FGVC) is an important but challenging task due to high intra-class variances and low inter-class variances caused by deformation, occlusion, illumination, etc. An attention convolutional binary neural tree is presented to address those problems for weakly supervised FGVC. Specifically, we incorporate convolutional operations along edges of the tree structure, and use the routing functions in each node to determine the root-to-leaf computational paths within the tree. The final decision is computed as the summation of the predictions from leaf nodes. The deep convolutional operations learn to capture the representations of objects, and the tree structure characterizes the coarse-to-fine hierarchical feature learning process. In addition, we use the attention transformer module to enforce the network to capture discriminative features. Several experiments on the CUB200-2011, Stanford Cars and Aircraft datasets demonstrate that our method performs favorably against the state-of-the-arts. Code can be found at https://isrc.iscas.ac.cn/gitlab/research/acnet.
引用
收藏
页码:10465 / 10474
页数:10
相关论文
共 50 条
  • [1] Evolving Convolutional Neural Network and Its Application in Fine-Grained Visual Categorization
    Xuan, Qi
    Xiao, Haoquan
    Fu, Chenbo
    Liu, Yi
    IEEE ACCESS, 2018, 6 : 31110 - 31116
  • [2] Attention-shift based deep neural network for fine-grained visual categorization
    Niu, Yi
    Jiao, Yang
    Shi, Guangming
    PATTERN RECOGNITION, 2021, 116
  • [3] Category attention transfer for efficient fine-grained visual categorization
    Liao, Qiyu
    Wang, Dadong
    Xu, Min
    PATTERN RECOGNITION LETTERS, 2022, 153 : 10 - 15
  • [4] On fine-grained visual explanation in convolutional neural networks
    Lei, Xia
    Fan, Yongkai
    Luo, Xiong-Lin
    DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (05) : 1141 - 1147
  • [5] On fine-grained visual explanation in convolutional neural networks
    Xia Lei
    Yongkai Fan
    XiongLin Luo
    Digital Communications and Networks, 2023, 9 (05) : 1141 - 1147
  • [6] Filtration and Distillation: Enhancing Region Attention for Fine-Grained Visual Categorization
    Liu, Chuanbin
    Xie, Hongtao
    Zha, Zheng-Jun
    Ma, Lingfeng
    Yu, Lingyun
    Zhang, Yongdong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11555 - 11562
  • [7] Multiscale attention dynamic aware network for fine-grained visual categorization
    Ou, Jichu
    Li, Wanyi
    Huang, Jingmin
    Huang, Xiaojie
    Xie, Xuan
    ELECTRONICS LETTERS, 2023, 59 (01)
  • [8] Multistage attention region supplement transformer for fine-grained visual categorization
    Mei, Aokun
    Huo, Hua
    Xu, Jiaxin
    Xu, Ningya
    VISUAL COMPUTER, 2025, 41 (03): : 1873 - 1889
  • [9] P-CNN: Part-Based Convolutional Neural Networks for Fine-Grained Visual Categorization
    Han, Junwei
    Yao, Xiwen
    Cheng, Gong
    Feng, Xiaoxu
    Xu, Dong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 579 - 590
  • [10] Interpretable and Fine-Grained Visual Explanations for Convolutional Neural Networks
    Wagner, Joerg
    Koehler, Jan Mathias
    Gindele, Tobias
    Hetzel, Leon
    Wiedemer, Jakob Thaddaeus
    Behnke, Sven
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9089 - 9099