Multiscale attention dynamic aware network for fine-grained visual categorization

被引:0
|
作者
Ou, Jichu [1 ]
Li, Wanyi [2 ]
Huang, Jingmin [2 ]
Huang, Xiaojie [2 ]
Xie, Xuan [2 ]
机构
[1] Guangdong Univ Educ, Sch Math, Guangzhou, Peoples R China
[2] Guangdong Univ Educ, Sch Comp Sci, Guangzhou, Peoples R China
关键词
data mining; image classification; image recognition;
D O I
10.1049/ell2.12696
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Fine-grained visual categorization (FGVC) is a challenging task, facing the issues such as inter-class similarities, large intra-class variances, scale variation, and angle variation. To address these issues, the authors propose a novel multiscale attention dynamic aware network (MADA-Net). The core of network consists of three parallel sub-networks, which learn features from different scales. Each sub-network is composed of three serial sub-modules: (1) A self-attention module (SAM) locates objects according to relative importance scattered throughout feature map. (2) A multiscale feature extractor (MFE) learns the non-linear features of objects. (3) A dynamic aware module (DAM) enhances the learning capability of spatial deformation of the network to generate high-quality feature map. In addition, the authors propose a multiscale adjusted loss (MA-Loss) to improve the performance of network. Experiments on three prevailing benchmark datasets demonstrate that our method can achieve state-of-the-art performance.
引用
收藏
页数:3
相关论文
共 50 条
  • [41] A Survey of Fine-Grained Visual Categorization Based on Deep Learning
    Xie, Yuxiang
    Gong, Quanzhi
    Luan, Xidao
    Yan, Jie
    Zhang, Jiahui
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1337 - 1356
  • [42] SHAPE-GUIDED SEGMENTATION FOR FINE-GRAINED VISUAL CATEGORIZATION
    Sun, Ming
    Yang, Jufeng
    Sun, Bo
    Wang, Kai
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [43] Refined probability distribution module for fine-grained visual categorization
    Zhao, Peipei
    Miao, Qiguang
    Li, Hongsheng
    Liu, Ruyi
    Quan, Yining
    Song, Jianfeng
    NEUROCOMPUTING, 2023, 518 : 533 - 544
  • [44] Part-Stacked CNN for Fine-Grained Visual Categorization
    Huang, Shaoli
    Xu, Zhe
    Tao, Dacheng
    Zhang, Ya
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1173 - 1182
  • [45] A Deep Sparse Coding Method for Fine-Grained Visual Categorization
    Guo, Lihua
    Guo, Chenggang
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 632 - 639
  • [46] Orientational Spatial Part Modeling for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Shiliang
    Xie, Fei
    Zhang, Yongdong
    Zhang, Dongming
    Su, Yu
    Tian, Qi
    2015 IEEE THIRD INTERNATIONAL CONFERENCE ON MOBILE SERVICES MS 2015, 2015, : 360 - 367
  • [47] Universal Fine-Grained Visual Categorization by Concept Guided Learning
    Bi, Qi
    Zhou, Beichen
    Ji, Wei
    Xia, Gui-Song
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 394 - 409
  • [48] A coarse-to-fine capsule network for fine-grained image categorization
    Lin, Zhongqi
    Jia, Jingdun
    Huang, Feng
    Gao, Wanlin
    NEUROCOMPUTING, 2021, 456 : 200 - 219
  • [49] Attentional Kernel Encoding Networks for Fine-Grained Visual Categorization
    Hu, Yutao
    Yang, Yandan
    Zhang, Jun
    Cao, Xianbin
    Zhen, Xiantong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 301 - 314
  • [50] A survey of fine-grained visual categorization based on deep learning
    Xie Yuxiang
    Gong Quanzhi
    Luan Xidao
    Yan Jie
    Zhang Jiahui
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023,