MSANet: Multi-scale attention networks for image classification

被引:3
|
作者
Cao, Ping [1 ,2 ]
Xie, Fangxin [1 ]
Zhang, Shichao [1 ]
Zhang, Zuping [1 ]
Zhang, Jianfeng [3 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
[3] Natl Univ Def Technol, Coll Comp Sci, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
Image classification; Convolutional neural network; Multi-scale feature; Channel attention; Spatial attention; TEXTURE; SCALE;
D O I
10.1007/s11042-022-12792-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The classification of images based on the principles of human vision is a major task in the field of computer vision. It is a common method to use multi-scale information and attention mechanism to obtain better classification performance. The methods based on multi-scale can obtain more accurate feature description by fusing different levels of information, and the methods based on attention can make the deep learning models focus on more valuable information in the image. However, the current methods usually treat the acquisition of multi-scale feature maps and the acquisition of attention weights as two separate steps in sequence. Since human eyes usually use these two methods at the same time when observing objects, we propose a multi-scale attention (MSA) module. The proposed MSA module directly extracts the attention information of different scales from a feature map, that is, the multi-scale and attention methods are simultaneously completed in one step. In the MSA module, we obtain different scales of channel and spatial attention by controlling the size of the convolution kernel for cross-channel and cross-space information interaction. Our module can be easily integrated into different convolutional neural networks to form Multi-scale attention networks (MSANet) architectures. We demonstrate the performance of MSANet on CIFAR-10 and CIFAR-100 data sets. In particular, the accuracy of our ResNet-110 based model on CIFAR-10 is 94.39%. Compared with the benchmark convolution model, our proposed multi-scale attention module can bring a roughly 3% increase in accuracy rate on CIFAR-100. Experimental results show that the proposed multi-scale attention module is superior in image classification.
引用
收藏
页码:34325 / 34344
页数:20
相关论文
共 50 条
  • [1] MSANet: Multi-scale attention networks for image classification
    Ping Cao
    Fangxin Xie
    Shichao Zhang
    Zuping Zhang
    Jianfeng Zhang
    [J]. Multimedia Tools and Applications, 2022, 81 : 34325 - 34344
  • [2] Multi-scale Contrastive Learning with Attention for Histopathology Image Classification
    Tan, Jing Wei
    Khoa Tuan Nguyen
    Lee, Kyoungbun
    Jeong, Won-Ki
    [J]. MEDICAL IMAGING 2023, 2023, 12471
  • [3] HYPERSPECTRAL IMAGE CLASSIFICATION VIA MULTI-SCALE RESIDUAL ATTENTION NETWORK
    Xie, Wen
    Wu, Qinzhe
    Ren, Wen
    Zhang, Yuzhuo
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 7649 - 7652
  • [4] MSANet: an improved semantic segmentation method using multi-scale attention for remote sensing images
    Zhang, Xiaolu
    Wang, Zhaoshun
    Zhang, Jianheng
    Wei, Anlei
    [J]. REMOTE SENSING LETTERS, 2022, 13 (12) : 1249 - 1259
  • [5] Dynamic multi-scale image classification
    Salden, A
    Iacob, S
    [J]. INTERNET IMAGING II, 2001, 4311 : 156 - 167
  • [6] Hyperspectral Image Classification Based on Multi-Scale Residual Network with Attention Mechanism
    Qing, Yuhao
    Liu, Wenyi
    [J]. REMOTE SENSING, 2021, 13 (03) : 1 - 18
  • [7] CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
    Chen, Chun-Fu
    Fan, Quanfu
    Panda, Rameswar
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 347 - 356
  • [8] A New Multi-Scale Convolutional Model Based on Multiple Attention for Image Classification
    Yang, Yadong
    Xu, Chengji
    Dong, Feng
    Wang, Xiaofeng
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [9] Multi-Scale Dense Networks for Hyperspectral Remote Sensing Image Classification
    Zhang, Chunju
    Li, Guandong
    Du, Shihong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 9201 - 9222
  • [10] Multi-scale hierarchical recurrent neural networks for hyperspectral image classification
    Shi, Cheng
    Pun, Chi-Man
    [J]. NEUROCOMPUTING, 2018, 294 : 82 - 93