Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining

被引:0
|
作者
Bin Liu
Siyan Fang
机构
[1] Hubei University,School of Computer and Information Engineering
来源
关键词
Single image deraining; Convolution; Transformer; Attention; Wavelet transform;
D O I
暂无
中图分类号
学科分类号
摘要
Removing rain streaks from rainy images can improve the accuracy of computer vision applications such as object detection. In order to make full use of the frequency domain analysis characteristics of wavelet and combine the advantages of Convolutional Neural Network (CNN) and Transformer, a Multi-level Wavelet Network Based on CNN-Transformer Hybrid Attention (MWN-CTHA) for single image deraining is proposed. MWN-CTHA obtains multi-scale low-frequency and high-frequency images through multi-level non-separable lifting wavelet transform and uses CNN-Transformer Hybrid Attention Block (CTHAB) to learn global structure and detail information from low-frequency and high-frequency, respectively. CTHAB consists of CA-SA Layer (CSL) and Detail-enhanced Attention Feed-forward Layer (DAFL). CSL uses the non-local modeling ability of self-attention to capture long-range rain streaks and uses convolutional attention to enhance the search ability for local rain streaks, where convolution can assist self-attention to achieve better feature representation. DAFL utilizes Depth-wise Convolutional Layer to supplement detailed features and filters the information of feed-forward layer through Dual-branch Attention. The experimental results on the four synthetic datasets demonstrate that the proposed method achieves higher PSNR and SSIM than the state-of-the-art method DANet, with an improvement of 1.07 dB and 0.0098, respectively. The code is available at https://github.com/fashyon/MWN-CTHA.
引用
收藏
页码:22387 / 22404
页数:17
相关论文
共 50 条
  • [1] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
    Liu, Bin
    Fang, Siyan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30): : 22387 - 22404
  • [2] Hybrid CNN-Transformer Feature Fusion for Single Image Deraining
    Chen, Xiang
    Pan, Jinshan
    Lu, Jiyang
    Fan, Zhentao
    Li, Hao
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 378 - 386
  • [3] MLTDNet: an efficient multi-level transformer network for single image deraining
    Gao, Feng
    Mu, Xiangyu
    Ouyang, Chao
    Yang, Kai
    Ji, Shengchang
    Guo, Jie
    Wei, Haokun
    Wang, Nan
    Ma, Lei
    Yang, Biao
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16): : 14013 - 14027
  • [4] MLTDNet: an efficient multi-level transformer network for single image deraining
    Feng Gao
    Xiangyu Mu
    Chao Ouyang
    Kai Yang
    Shengchang Ji
    Jie Guo
    Haokun Wei
    Nan Wang
    Lei Ma
    Biao Yang
    [J]. Neural Computing and Applications, 2022, 34 : 14013 - 14027
  • [5] Recurrent multi-level residual and global attention network for single image deraining
    Meihua Wang
    Chao Li
    Fanhui Ke
    [J]. Neural Computing and Applications, 2023, 35 : 3697 - 3708
  • [6] Recurrent multi-level residual and global attention network for single image deraining
    Wang, Meihua
    Li, Chao
    Ke, Fanhui
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (05): : 3697 - 3708
  • [7] Image harmonization with Simple Hybrid CNN-Transformer Network
    Li, Guanlin
    Zhao, Bin
    Li, Xuelong
    [J]. NEURAL NETWORKS, 2024, 180
  • [8] Learning a multi-level guided residual network for single image deraining
    Wang, Cong
    Zhang, Man
    Su, Zhixun
    Wu, Yutong
    Yao, Guangle
    Wang, Hongyan
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 78 : 206 - 215
  • [9] TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation
    Li, Zihan
    Li, Dihan
    Xu, Cangbai
    Wang, Weice
    Hong, Qingqi
    Li, Qingde
    Tian, Jie
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 781 - 792
  • [10] Cross Attention Multi Scale CNN-Transformer Hybrid Encoder Is General Medical Image Learner
    Zhou, Rongzhou
    Yao, Junfeng
    Hong, Qingqi
    Li, Xingxin
    Cao, Xianpeng
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 85 - 97