MSANet: Multi-scale attention networks for image classification

被引:4
|
作者
Cao, Ping [1 ,2 ]
Xie, Fangxin [1 ]
Zhang, Shichao [1 ]
Zhang, Zuping [1 ]
Zhang, Jianfeng [3 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
[3] Natl Univ Def Technol, Coll Comp Sci, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
Image classification; Convolutional neural network; Multi-scale feature; Channel attention; Spatial attention; TEXTURE; SCALE;
D O I
10.1007/s11042-022-12792-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The classification of images based on the principles of human vision is a major task in the field of computer vision. It is a common method to use multi-scale information and attention mechanism to obtain better classification performance. The methods based on multi-scale can obtain more accurate feature description by fusing different levels of information, and the methods based on attention can make the deep learning models focus on more valuable information in the image. However, the current methods usually treat the acquisition of multi-scale feature maps and the acquisition of attention weights as two separate steps in sequence. Since human eyes usually use these two methods at the same time when observing objects, we propose a multi-scale attention (MSA) module. The proposed MSA module directly extracts the attention information of different scales from a feature map, that is, the multi-scale and attention methods are simultaneously completed in one step. In the MSA module, we obtain different scales of channel and spatial attention by controlling the size of the convolution kernel for cross-channel and cross-space information interaction. Our module can be easily integrated into different convolutional neural networks to form Multi-scale attention networks (MSANet) architectures. We demonstrate the performance of MSANet on CIFAR-10 and CIFAR-100 data sets. In particular, the accuracy of our ResNet-110 based model on CIFAR-10 is 94.39%. Compared with the benchmark convolution model, our proposed multi-scale attention module can bring a roughly 3% increase in accuracy rate on CIFAR-100. Experimental results show that the proposed multi-scale attention module is superior in image classification.
引用
收藏
页码:34325 / 34344
页数:20
相关论文
共 50 条
  • [41] Lightweight multi-scale aggregated residual attention networks for image super-resolution
    Shurong Pang
    Zhe Chen
    Fuliang Yin
    [J]. Multimedia Tools and Applications, 2022, 81 : 4797 - 4819
  • [42] Image Inpainting with EMMA Attention and Multi-scale Fusion
    Wei, Yun
    Wang, Lulu
    Wu, Kaijun
    Shan, Hongquan
    Tian, Bin
    [J]. Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2024, 51 (12): : 87 - 97
  • [43] Multi-scale attention in attention neural network for single image deblurring
    Lee, Ho Sub
    Cho, Sung In
    [J]. Displays, 2024, 85
  • [44] Radar Signal Classification with Multi-Frequency Multi-Scale Deformable Convolutional Networks and Attention Mechanisms
    Liang, Ruofei
    Cen, Yigang
    [J]. REMOTE SENSING, 2024, 16 (08)
  • [45] Hierarchical Multi-scale Attention Networks for action recognition
    Yan, Shiyang
    Smith, Jeremy S.
    Lu, Wenjin
    Zhang, Bailing
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 61 : 73 - 84
  • [46] Incorporating Attention Mechanism, Dense Connection Blocks, and Multi-Scale Reconstruction Networks for Open-Set Hyperspectral Image Classification
    Zhou, Huaming
    Wu, Haibin
    Wang, Aili
    Iwahori, Yuji
    Yu, Xiaoyu
    [J]. REMOTE SENSING, 2023, 15 (18)
  • [47] Bi-directional LSTM with multi-scale dense attention mechanism for hyperspectral image classification
    Jinxiong Gao
    Xiumei Gao
    Nan Wu
    Hongye Yang
    [J]. Multimedia Tools and Applications, 2022, 81 : 24003 - 24020
  • [48] Remote Sensing Image Scene Classification Method Based on Multi-Scale Cyclic Attention Network
    Ma X.
    Wang L.
    Qi K.
    Zheng G.
    [J]. Diqiu Kexue - Zhongguo Dizhi Daxue Xuebao/Earth Science - Journal of China University of Geosciences, 2021, 46 (10): : 3740 - 3752
  • [49] Recursive Multi-Scale Channel-Spatial Attention for Fine-Grained Image Classification
    Liu, Dichao
    Wang, Yu
    Mase, Kenji
    Kato, Jien
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (03) : 713 - 726
  • [50] A Recurrent Attention Multi-Scale CNN-LSTM Network Based on Hyperspectral Image Classification
    Zhang, Xinyue
    Zuo, Jing
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (11)