Wavelet-Attention CNN for image classification

被引:0
|
作者
Zhao, Xiangyu [1 ]
Huang, Peng [1 ]
Shu, Xiangbo [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Xiaolingwei St, Nanjing 210094, Jiangsu, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Convolutional neural network; Wavelet Transform; Wavelet-Attention; Image classification; NEURAL-NETWORK; TRANSFORM;
D O I
10.1007/s00530-022-00889-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The feature learning methods based on convolutional neural network (CNN) have successfully produced tremendous achievements in image classification tasks. However, the inherent noise and some other factors may weaken the effectiveness of the convolutional feature statistics. In this paper, we investigate Discrete Wavelet Transform (DWT) in the frequency domain and design a new Wavelet-Attention (WA) block to only implement attention in the high-frequency domain. Based on this, we propose a Wavelet-Attention convolutional neural network (WA-CNN) for image classification. Specifically, WA-CNN decomposes the feature maps into low-frequency and high-frequency components for storing the structures of the basic objects, as well as the detailed information and noise, respectively. Then, the WA block is leveraged to capture the detailed information in the high-frequency domain with different attention factors but reserves the basic object structures in the low-frequency domain. Experimental results on CIFAR-10 and CIFAR-100 datasets show that our proposed WA-CNN achieves significant improvements in classification accuracy compared to other related networks. Specifically, based on MobileNetV2 backbones, WA-CNN achieves 1.26% Top-1 accuracy improvement on the CIFAR-10 benchmark and 1.54% Top-1 accuracy improvement on the CIFAR-100 benchmark.
引用
收藏
页码:915 / 924
页数:10
相关论文
共 50 条
  • [1] Wavelet-Attention CNN for image classification
    Xiangyu Zhao
    Peng Huang
    Xiangbo Shu
    [J]. Multimedia Systems, 2022, 28 : 915 - 924
  • [2] Wavelet-Attention Swin for Automatic Diabetic Retinopathy Classification
    Dihin, Rasha Ali
    Alshemmary, Ebtesam N.
    Al-Jawher, Waleed A. M.
    [J]. BAGHDAD SCIENCE JOURNAL, 2024, 21 (08) : 2741 - 2756
  • [3] Multi-head attention with CNN and wavelet for classification of hyperspectral image
    Tulapurkar, Harshula
    Banerjee, Biplab
    Buddhiraju, Krishna Mohan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (10): : 7595 - 7609
  • [4] Multi-head attention with CNN and wavelet for classification of hyperspectral image
    Harshula Tulapurkar
    Biplab Banerjee
    Krishna Mohan Buddhiraju
    [J]. Neural Computing and Applications, 2023, 35 : 7595 - 7609
  • [5] Dual Wavelet Attention Networks for Image Classification
    Yang, Yuting
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Yang, Shuyuan
    Li, Lingling
    Chen, Puhua
    Li, Xiufang
    Huang, Zhongjian
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1899 - 1910
  • [6] Fusion of ESRGAN, Adaptive NSCT, and Multi-Attention CNN With Wavelet Transform for Histopathological Image Classification
    Mukadam, Sufiyan Bashir
    Patil, Hemprasad Yashwant
    [J]. IEEE ACCESS, 2024, 12 : 129977 - 129993
  • [7] Texture and Materials Image Classification Based on Wavelet Pooling Layer in CNN
    Manuel Fortuna-Cervantes, Juan
    Tulio Ramirez-Torres, Marco
    Mejia-Carlos, Marcela
    Salome Murguia, Jose
    Martinez-Carranza, Jose
    Soubervielle-Montalvo, Carlos
    Arturo Guerra-Garcia, Cesar
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [8] Brain-inspired Hierarchical Attention Recurrent CNN for Image Classification
    Song, Xinjing
    Wang, Yanjiang
    Liu, Baodi
    Liu, Weifeng
    [J]. 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 160 - 165
  • [9] Feedback Attention-Based Dense CNN for Hyperspectral Image Classification
    Yu, Chunyan
    Han, Rui
    Song, Meiping
    Liu, Caiyu
    Chang, Chein-I
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [10] WAVELET-BASED FREQUENCY-DIVIDING INTERACTIVE CNN FOR IMAGE CLASSIFICATION
    Cao, Jidong
    He, Chu
    Pan, Jiahao
    Zhang, Qingyi
    Chen, Xi
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2415 - 2419