A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition

被引:0
|
作者
Dakshayani Himabindu D. [1 ,2 ]
Praveen Kumar S. [1 ]
机构
[1] Department of CSE, GIT, GITAM University
[2] Department of IT, VNRVJIET
来源
Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in) | 1600年 / Brno University of Technology卷 / 27期
关键词
Channel Attention; Deep Learning; Fine-Grained Visual Recognition; Image Classification; Spatial Attention; Visual Attention;
D O I
10.13164/mendel.2021.2.059
中图分类号
学科分类号
摘要
In the recent advancements attention mechanism in deep learning had played a vital role in proving better results in tasks under computer vision. There exists multiple kinds of works under attention mechanism which includes under image classification, fine-grained visual recognition, image captioning, video captioning, object detection and recognition tasks. Global and local attention are the two attention based mechanisms which helps in interpreting the attentive partial. Considering this criteria, there exists channel and spatial attention where in channel attention considers the most attentive channel among the produced block of channels and spatial attention considers which region among the space needs to be focused on. We have proposed a streamlined attention block module which helps in enhancing the feature based learning with less number of additional layers i.e., a GAP layer followed by a linear layer with an incorporation of second order pooling (GSoP) after every layer in the utilized encoder. This mechanism has produced better range dependencies by the conducted experimentation. We have experimented our model on CIFAR-10, CIFAR-100 and FGVC-Aircrafts datasets considering finegrained visual recognition. We were successful in achieving state-of-the-result for FGVC-Aircrafts with an accuracy of 97%. © 2021, Brno University of Technology. All rights reserved.
引用
下载
收藏
页码:59 / 67
页数:8
相关论文
共 50 条
  • [11] Aggregate attention module for fine-grained image classification
    Wang, Xingmei
    Shi, Jiahao
    Fujita, Hamido
    Zhao, Yilin
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (7) : 8335 - 8345
  • [12] Efficient Image Embedding for Fine-Grained Visual Classification
    Payatsuporn, Soranan
    Kijsirikul, Boonserm
    2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 40 - 45
  • [13] A Fine-Grained Visual Attention Approach for Fingerspelling Recognition in the Wild
    Gajurel, Kamala
    Zhong, Cuncong
    Wang, Guanghui
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3266 - 3271
  • [14] A collaborative gated attention network for fine-grained visual classification
    Zhu, Qiangxi
    Kuang, Wenlan
    Li, Zhixin
    DISPLAYS, 2023, 79
  • [15] A Progressive Gated Attention Model for Fine-Grained Visual Classification
    Zhu, Qiangxi
    Li, Zhixin
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2063 - 2068
  • [16] Learning Hierarchal Channel Attention for Fine-grained Visual Classification
    Guan, Xiang
    Wang, Guoqing
    Xu, Xing
    Bin, Yi
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5011 - 5019
  • [17] Hierarchical attention vision transformer for fine-grained visual classification
    Hu, Xiaobin
    Zhu, Shining
    Peng, Taile
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 91
  • [18] Diversified Visual Attention Networks for Fine-Grained Object Classification
    Zhao, Bo
    Wu, Xiao
    Feng, Jiashi
    Peng, Qiang
    Yan, Shuicheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (06) : 1245 - 1256
  • [19] CariesFG: A fine-grained RGB image classification framework with attention mechanism for dental caries
    Jiang, Hao
    Zhang, Peiliang
    Che, Chao
    Jin, Bo
    Zhu, Yongjun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [20] ConvNeXt-Based Fine-Grained Image Classification and Bilinear Attention Mechanism Model
    Li, Zhiheng
    Gu, Tongcheng
    Li, Bing
    Xu, Wubin
    He, Xin
    Hui, Xiangyu
    APPLIED SCIENCES-BASEL, 2022, 12 (18):