A benchmark dataset and approach for fine-grained visual categorization in complex scenes

被引:0
|
作者
Zhang, Xiang [1 ]
Zhang, Keran [2 ]
Zhao, Wanqing [1 ]
Luo, Hangzai [1 ]
Zhong, Sheng [1 ]
Tang, Lei [2 ]
Peng, Jinye [1 ]
Fan, Jianping [1 ]
机构
[1] Northwest Univ, Sch Informat & Technol, Xian 710127, Shaanxi, Peoples R China
[2] Xian Microelectron Technol Inst, Xian, Shaanxi, Peoples R China
关键词
Fine-grained visual categorization; Location; -aware; Complex scenes; Channel attention; Spatial attention; ATTENTION;
D O I
10.1016/j.dsp.2023.104033
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the vast development of deep learning, many deep learning-based approaches have demonstrated their outstanding performance on the task of fine-grained visual categorization (FGVC). However, existing fine-grained datasets mainly focus on simple images (i.e., objects tend to occupy a significantly larger portion of the image and appear in a relatively clear background). This seriously restricts the application of FGVC in real-world scenarios. In this paper, we construct a fine-grained dataset named AIBD-Cars, which contains 28, 471 car images with complex backgrounds belonging to 196 fine-grained classes. Furthermore, we propose a Location-Aware Channel-Spatial Attention Network (LCSANet), which considers both locating object regions and mining discriminative information to achieve better fine-grained visual categorization in complex scenes. We evaluate popular fine-grained visual categorization algorithms to build a benchmark. Extensive experiments show that our proposed method achieves a new state of the art on AIBD-Cars and FGVC Aircraft, and competitive results on CUB-200-2011 and Stanford Cars. (c) 2023 Elsevier Inc. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Feathers Dataset for Fine-Grained Visual Categorization
    Belko, Alina
    Dobratulin, Konstantin
    Kuznetsov, Andrey
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
  • [2] VegFru: A Domain-Specific Dataset for Fine-grained Visual Categorization
    Hou, Saihui
    Feng, Yushan
    Wang, Zilei
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 541 - 549
  • [3] PaperNet: A Dataset and Benchmark for Fine-Grained Paper Classification
    Yue, Tan
    Li, Yong
    Shi, Xuzhao
    Qin, Jiedong
    Fan, Zijiao
    Hu, Zonghai
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [4] Coarse-to-Fine Description for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Shiliang
    Zhang, Yongdong
    Li, Jintao
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (10) : 4858 - 4872
  • [5] FINE-GRAINED VISUAL CATEGORIZATION WITH FINE-TUNED SEGMENTATION
    Li, Lingyun
    Guo, Yanqing
    Xie, Lingxi
    Kong, Xiangwei
    Tian, Qi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2025 - 2029
  • [6] Adaptive Triplet Model for Fine-Grained Visual Categorization
    Liang, Jingyun
    Guo, Jinlin
    Guo, Yanming
    Lao, Songyang
    [J]. IEEE ACCESS, 2018, 6 : 76776 - 76786
  • [7] Squeezed Bilinear Pooling for Fine-Grained Visual Categorization
    Liao, Qiyu
    Wang, Dadong
    Holewa, Hamish
    Xu, Min
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 728 - 732
  • [8] Alignment Enhancement Network for Fine-grained Visual Categorization
    Hu, Yutao
    Liu, Xuhui
    Zhang, Baochang
    Han, Jungong
    Cao, Xianbin
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [9] Discriminative Suprasphere Embedding for Fine-Grained Visual Categorization
    Ye, Shuo
    Peng, Qinmu
    Sun, Wenju
    Xu, Jiamiao
    Wang, Yu
    You, Xinge
    Cheung, Yiu-Ming
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5092 - 5102
  • [10] ProtoSimi: label correction for fine-grained visual categorization
    Shen, Jialiang
    Yao, Yu
    Huang, Shaoli
    Wang, Zhiyong
    Zhang, Jing
    Wang, Ruxing
    Yu, Jun
    Liu, Tongliang
    [J]. MACHINE LEARNING, 2024, 113 (04) : 1903 - 1920