Transformer-Based Fused Attention Combined with CNNs for Image Classification

被引:0
|
作者
Jielin Jiang
Hongxiang Xu
Xiaolong Xu
Yan Cui
Jintao Wu
机构
[1] Nanjing University of Information Science and Technology,School of Software
[2] Nanjing University of Information Science and Technology,Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology (CICAEET)
[3] Nanjing Normal University of Special Education,College of Mathematics and Information Science
来源
Neural Processing Letters | 2023年 / 55卷
关键词
Image classification; Swin transformer; Fusion attention; Residual convolution;
D O I
暂无
中图分类号
学科分类号
摘要
The receptive field of convolutional neural networks (CNNs) is focused on the local context, while the transformer receptive field is concerned with the global context. Transformers are the new backbone of computer vision due to their powerful ability to extract global features, which is supported by pre-training on extensive amounts of data. However, it is challenging to collect a large number of high-quality labeled images for the pre-training phase. Therefore, this paper proposes a classification network (CofaNet) that combines CNNs and transformer-based fused attention to address the limitations of transformers without pre-training, such as low accuracy. CofaNet introduces patch sequence dimension attention to capture the relationship among subsequences and incorporates it into self-attention to construct a new attention feature extraction layer. Then, a residual convolution block is used instead of multi-layer perception after the fusion attention layer to compensate for the limited feature extraction of the attention layer on small datasets. The experimental results on three benchmark datasets demonstrate that CofaNet achieves excellent classification accuracy when compared to some transformer-based networks without pre-traning.
引用
收藏
页码:11905 / 11919
页数:14
相关论文
共 50 条
  • [21] Hyperspectral Image Classification Based on Multibranch Attention Transformer Networks
    Bai, Jing
    Wen, Zheng
    Xiao, Zhu
    Ye, Fawang
    Zhu, Yongdong
    Alazab, Mamoun
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [22] Attention Fusion of Transformer-Based and Scale-Based Method for Hyperspectral and LiDAR Joint Classification
    Zhang, Maqun
    Gao, Feng
    Zhang, Tiange
    Gan, Yanhai
    Dong, Junyu
    Yu, Hui
    [J]. REMOTE SENSING, 2023, 15 (03)
  • [23] TransPath: Transformer-Based Self-supervised Learning for Histopathological Image Classification
    Wang, Xiyue
    Yang, Sen
    Zhang, Jun
    Wang, Minghui
    Zhang, Jing
    Huang, Junzhou
    Yang, Wei
    Han, Xiao
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 186 - 195
  • [24] HyperSFormer: A Transformer-Based End-to-End Hyperspectral Image Classification Method for Crop Classification
    Xie, Jiaxing
    Hua, Jiajun
    Chen, Shaonan
    Wu, Peiwen
    Gao, Peng
    Sun, Daozong
    Lyu, Zhendong
    Lyu, Shilei
    Xue, Xiuyun
    Lu, Jianqiang
    [J]. REMOTE SENSING, 2023, 15 (14)
  • [25] Transformer-based Hierarchical Encoder for Document Classification
    Sakhrani, Harsh
    Parekh, Saloni
    Ratadiya, Pratik
    [J]. 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 852 - 858
  • [26] Practical Transformer-based Multilingual Text Classification
    Wang, Cindy
    Banko, Michele
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 121 - 129
  • [27] BertSRC: transformer-based semantic relation classification
    Lee, Yeawon
    Son, Jinseok
    Song, Min
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [28] A transformer-based architecture for fake news classification
    Divyam Mehta
    Aniket Dwivedi
    Arunabha Patra
    M. Anand Kumar
    [J]. Social Network Analysis and Mining, 2021, 11
  • [29] A transformer-based architecture for fake news classification
    Mehta, Divyam
    Dwivedi, Aniket
    Patra, Arunabha
    Anand Kumar, M.
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2021, 11 (01)
  • [30] Transformer-based Neural Network for Electrocardiogram Classification
    Atiea, Mohammed A.
    Adel, Mark
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 357 - 363