A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition

被引:0
|
作者
Dakshayani Himabindu D. [1 ,2 ]
Praveen Kumar S. [1 ]
机构
[1] Department of CSE, GIT, GITAM University
[2] Department of IT, VNRVJIET
来源
Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in) | 1600年 / Brno University of Technology卷 / 27期
关键词
Channel Attention; Deep Learning; Fine-Grained Visual Recognition; Image Classification; Spatial Attention; Visual Attention;
D O I
10.13164/mendel.2021.2.059
中图分类号
学科分类号
摘要
In the recent advancements attention mechanism in deep learning had played a vital role in proving better results in tasks under computer vision. There exists multiple kinds of works under attention mechanism which includes under image classification, fine-grained visual recognition, image captioning, video captioning, object detection and recognition tasks. Global and local attention are the two attention based mechanisms which helps in interpreting the attentive partial. Considering this criteria, there exists channel and spatial attention where in channel attention considers the most attentive channel among the produced block of channels and spatial attention considers which region among the space needs to be focused on. We have proposed a streamlined attention block module which helps in enhancing the feature based learning with less number of additional layers i.e., a GAP layer followed by a linear layer with an incorporation of second order pooling (GSoP) after every layer in the utilized encoder. This mechanism has produced better range dependencies by the conducted experimentation. We have experimented our model on CIFAR-10, CIFAR-100 and FGVC-Aircrafts datasets considering finegrained visual recognition. We were successful in achieving state-of-the-result for FGVC-Aircrafts with an accuracy of 97%. © 2021, Brno University of Technology. All rights reserved.
引用
下载
收藏
页码:59 / 67
页数:8
相关论文
共 50 条
  • [31] Leveraging Fine-Grained Labels to Regularize Fine-Grained Visual Classification
    Wu, Junfeng
    Yao, Li
    Liu, Bin
    Ding, Zheyuan
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION (ICCMS 2019) AND 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS (ICICA 2019), 2019, : 133 - 136
  • [32] Progressive Co-Attention Network for Fine-Grained Visual Classification
    Zhang, Tian
    Chang, Dongliang
    Ma, Zhanyu
    Guo, Jun
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [33] Dual-Dependency Attention Transformer for Fine-Grained Visual Classification
    Cui, Shiyan
    Hui, Bin
    SENSORS, 2024, 24 (07)
  • [34] Bidirectional Attention-Recognition Model for Fine-Grained Object Classification
    Liu, Chuanbin
    Xie, Hongtao
    Zha, Zhengjun
    Yu, Lingyun
    Chen, Zhineng
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1785 - 1795
  • [35] FEATURE COMPARISON BASED CHANNEL ATTENTION FOR FINE-GRAINED VISUAL CLASSIFICATION
    Jia, Shukun
    Bai, Yan
    Zhang, Jing
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1776 - 1780
  • [36] Fine-grained and Semantic-guided Visual Attention for Image Captioning
    Zhang, Zongjian
    Wu, Qiang
    Wang, Yang
    Chen, Fang
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1709 - 1717
  • [37] Object-Part Attention Model for Fine-Grained Image Classification
    Peng, Yuxin
    He, Xiangteng
    Zhao, Junjie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1487 - 1500
  • [38] Fine-Grained Image Classification Based on Cross-Attention Network
    Zheng, Zhiwen
    Zhou, Juxiang
    Gan, Jianhou
    Luo, Sen
    Gao, Wei
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2022, 18 (01)
  • [39] Fine-grained image classification method based on hybrid attention module
    Lu, Weixiang
    Yang, Ying
    Yang, Lei
    FRONTIERS IN NEUROROBOTICS, 2024, 18
  • [40] Fine-grained image retrieval by combining attention mechanism and context information
    Li, Xiaoqing
    Ma, Jinwen
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1881 - 1897