A survey of fine-grained visual categorization based on deep learning

被引:0
|
作者
Xie Yuxiang [1 ]
Gong Quanzhi [1 ]
Luan Xidao [2 ]
Yan Jie [1 ]
Zhang Jiahui [1 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410000, Peoples R China
[2] Changsha Univ, Coll Comp Engn & Appl Math, Changsha 410003, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning; fine-grained visual categorization; convolutional neural network (CNN); visual attention; ATTENTION; NETWORK;
D O I
10.23919/JSEE.2022.000155
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning has achieved excellent results in various tasks in the field of computer vision, especially in fine-grained visual categorization. It aims to distinguish the subordinate categories of the label-level categories. Due to high intra-class variances and high inter-class similarity, the fine-grained visual categorization is extremely challenging. This paper first briefly introduces and analyzes the related public datasets. After that, some of the latest methods are reviewed. Based on the feature types, the feature processing methods, and the overall structure used in the model, we divide them into three types of methods: methods based on general convolutional neural network (CNN) and strong supervision of parts, methods based on single feature processing, and methods based on multiple feature processing. Most methods of the first type have a relatively simple structure, which is the result of the initial research. The methods of the other two types include models that have special structures and training processes, which are helpful to obtain discriminative features. We conduct a specific analysis on several methods with high accuracy on public datasets. In addition, we support that the focus of the future research is to solve the demand of existing methods for the large amount of the data and the computing power. In terms of technology, the extraction of the subtle feature information with the burgeoning vision transformer (ViT) network is also an important research direction.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Part-Stacked CNN for Fine-Grained Visual Categorization
    Huang, Shaoli
    Xu, Zhe
    Tao, Dacheng
    Zhang, Ya
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1173 - 1182
  • [42] Orientational Spatial Part Modeling for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Shiliang
    Xie, Fei
    Zhang, Yongdong
    Zhang, Dongming
    Su, Yu
    Tian, Qi
    2015 IEEE THIRD INTERNATIONAL CONFERENCE ON MOBILE SERVICES MS 2015, 2015, : 360 - 367
  • [43] Category attention transfer for efficient fine-grained visual categorization
    Liao, Qiyu
    Wang, Dadong
    Xu, Min
    PATTERN RECOGNITION LETTERS, 2022, 153 : 10 - 15
  • [44] Attentional Kernel Encoding Networks for Fine-Grained Visual Categorization
    Hu, Yutao
    Yang, Yandan
    Zhang, Jun
    Cao, Xianbin
    Zhen, Xiantong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 301 - 314
  • [45] Fine-grained Categorization and Dataset Bootstrapping using Deep Metric Learning with Humans in the Loop
    Cui, Yin
    Zhou, Feng
    Lin, Yuanqing
    Belongie, Serge
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1153 - 1162
  • [46] Graph Neural Networks Based Multi-granularity Feature Representation Learning for Fine-Grained Visual Categorization
    Wu, Hongyan
    Guo, Haiyun
    Miao, Qinghai
    Huang, Min
    Wang, Jinqiao
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 230 - 242
  • [47] Fine-Grained Classification of Hyperspectral Imagery Based on Deep Learning
    Chen, Yushi
    Huang, Lingbo
    Zhu, Lin
    Yokoya, Naoto
    Jia, Xiuping
    REMOTE SENSING, 2019, 11 (22)
  • [48] Fine-grained Android Malware Detection based on Deep Learning
    Li, Dongfang
    Wang, Zhaoguo
    Xue, Yibo
    2018 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2018,
  • [49] A model for fine-grained vehicle classification based on deep learning
    Yu, Shaoyong
    Wu, Yun
    Li, Wei
    Song, Zhijun
    Zeng, Wenhua
    NEUROCOMPUTING, 2017, 257 : 97 - 103
  • [50] A deep learning based fine-grained classification algorithm for grading of visual impairment in cataract patients
    Jiang, Jiewei
    Zhang, Yi
    Xie, He
    Yang, Jingshi
    Gong, Jiamin
    Li, Zhongwen
    OPTOELECTRONICS LETTERS, 2024, 20 (01) : 48 - 57