Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining

被引：0

作者：

Bin Liu

Siyan Fang

机构：

[1] Hubei University,School of Computer and Information Engineering

来源：

Neural Computing and Applications | 2023年 / 35卷

关键词：

Single image deraining; Convolution; Transformer; Attention; Wavelet transform;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Removing rain streaks from rainy images can improve the accuracy of computer vision applications such as object detection. In order to make full use of the frequency domain analysis characteristics of wavelet and combine the advantages of Convolutional Neural Network (CNN) and Transformer, a Multi-level Wavelet Network Based on CNN-Transformer Hybrid Attention (MWN-CTHA) for single image deraining is proposed. MWN-CTHA obtains multi-scale low-frequency and high-frequency images through multi-level non-separable lifting wavelet transform and uses CNN-Transformer Hybrid Attention Block (CTHAB) to learn global structure and detail information from low-frequency and high-frequency, respectively. CTHAB consists of CA-SA Layer (CSL) and Detail-enhanced Attention Feed-forward Layer (DAFL). CSL uses the non-local modeling ability of self-attention to capture long-range rain streaks and uses convolutional attention to enhance the search ability for local rain streaks, where convolution can assist self-attention to achieve better feature representation. DAFL utilizes Depth-wise Convolutional Layer to supplement detailed features and filters the information of feed-forward layer through Dual-branch Attention. The experimental results on the four synthetic datasets demonstrate that the proposed method achieves higher PSNR and SSIM than the state-of-the-art method DANet, with an improvement of 1.07 dB and 0.0098, respectively. The code is available at https://github.com/fashyon/MWN-CTHA.

引用

页码：22387 / 22404

页数：17

共 50 条

[1] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
Liu, Bin
Fang, Siyan
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30): : 22387 - 22404
[2] Hybrid CNN-Transformer Feature Fusion for Single Image Deraining
Chen, Xiang
Pan, Jinshan
Lu, Jiyang
Fan, Zhentao
Li, Hao
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 378 - 386
[3] MLTDNet: an efficient multi-level transformer network for single image deraining
Gao, Feng
Mu, Xiangyu
Ouyang, Chao
Yang, Kai
Ji, Shengchang
Guo, Jie
Wei, Haokun
Wang, Nan
Ma, Lei
Yang, Biao
[J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16): : 14013 - 14027
[4] MLTDNet: an efficient multi-level transformer network for single image deraining
Feng Gao
Xiangyu Mu
Chao Ouyang
Kai Yang
Shengchang Ji
Jie Guo
Haokun Wei
Nan Wang
Lei Ma
Biao Yang
[J]. Neural Computing and Applications, 2022, 34 : 14013 - 14027
[5] Recurrent multi-level residual and global attention network for single image deraining
Meihua Wang
Chao Li
Fanhui Ke
[J]. Neural Computing and Applications, 2023, 35 : 3697 - 3708
[6] Recurrent multi-level residual and global attention network for single image deraining
Wang, Meihua
Li, Chao
Ke, Fanhui
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (05): : 3697 - 3708
[7] Image harmonization with Simple Hybrid CNN-Transformer Network
Li, Guanlin
Zhao, Bin
Li, Xuelong
[J]. NEURAL NETWORKS, 2024, 180
[8] Learning a multi-level guided residual network for single image deraining
Wang, Cong
Zhang, Man
Su, Zhixun
Wu, Yutong
Yao, Guangle
Wang, Hongyan
[J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 78 : 206 - 215
[9] TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation
Li, Zihan
Li, Dihan
Xu, Cangbai
Wang, Weice
Hong, Qingqi
Li, Qingde
Tian, Jie
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 781 - 792
[10] Cross Attention Multi Scale CNN-Transformer Hybrid Encoder Is General Medical Image Learner
Zhou, Rongzhou
Yao, Junfeng
Hong, Qingqi
Li, Xingxin
Cao, Xianpeng
[J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 85 - 97

← 1 2 3 4 5 →