UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration

被引：0

作者：

Yu, Yongbo ^{[1
]}

Li, Weidong ^{[1
]}

Bai, Linyan ^{[2
,3
]}

Duan, Jinlong ^{[1
]}

Zhang, Xuehai ^{[1
]}

机构：

[1] Henan Univ Technol, Coll Informat Sci & Engn, Zhengzhou 450001, Peoples R China

[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China

[3] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China

来源：

VISUAL COMPUTER | 2024年

关键词：

Diffusion models; Attention mechanism; Weather-degraded image restoration; Image restoration; Vision transformer; RAINDROP REMOVAL; NETWORK;

D O I：

10.1007/s00371-024-03659-x

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Restoring multi-weather-degraded images is significant for subsequent high-level computer vision tasks. However, most existing image restoration algorithms only target single-weather-degraded images, and there are few general models for multi-weather-degraded image restoration. In this paper, we propose a diffusion model for multi-weather-degraded image restoration, namely a universal transformer-based diffusion model (UTDM) for multi-weather-degraded images restoration, by combining the denoising diffusion probability model and Vision Transformer (ViT). First, UTDM uses weather-degraded images as conditions to guide the diffusion model to generate clean background images through reverse sampling. Secondly, we propose a Cascaded Fusion Noise Estimation Transformer (CFNET) based on ViT, which utilizes degraded and noisy images for noise estimation. By introducing cascaded contextual fusion attention in a cascaded manner to compute contextual fusion attention mechanisms for different heads, CFNET explores the commonalities and characteristics of multi-weather-degraded images, fully capturing global and local feature information to improve the model's generalization ability on various weather-degraded images. UTDM outperformed the existing algorithm by 0.14-4.55,dB on the Raindrop-A test set, and improved by 0.99 dB and 1.24 dB compared with Transweather on the Snow100K-L and Test1 test sets. Experimental results show that our method outperforms general and specific restoration task algorithms on synthetic and real-world degraded image datasets. Code and dataset are available at: https://github.com/RHEPI/UTDM.

引用

页数：17

共 50 条

[1] TransWeather: Transformer-based Restoration of Images Degraded by Adverse Weather Conditions
Valanarasu, Jeya Maria Jose
Yasarla, Rajeev
Patel, Vishal M.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2343 - 2353
[2] A Transformer-Based Diffusion Model for All-in-One Weather-Degraded Image Restoration
Qin, Jing
Wen, Yuanbo
Gao, Tao
Liu, Yao
Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2024, 58 (10): : 1606 - 1617
[3] TransDDPM: Transformer-Based Denoising Diffusion Probabilistic Model for Image Restoration
Wei, Pan
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI, 2024, 14435 : 250 - 263
[4] TUFusion: A Transformer-Based Universal Fusion Algorithm for Multimodal Images
Zhao, Yangyang
Zheng, Qingchun
Zhu, Peihao
Zhang, Xu
Ma, Wenpeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1712 - 1725
[5] LayoutDM: Transformer-based Diffusion Model for Layout Generation
Chai, Shang
Zhuang, Liansheng
Yan, Fengying
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18349 - 18358
[6] TransDiffSeg: Transformer-Based Conditional Diffusion Segmentation Model for Abdominal Multi-Objective
Gu, WenWen
Zhang, GuoDong
Ju, RongHui
Wang, SuRan
Li, YanLin
Liang, TingYu
Guo, Wei
Gong, ZhaoXuan
JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, : 262 - 280
[7] Adaptive shock-diffusion model for restoration of degraded document images
Guo, Jiebin
He, Chuanjiang
APPLIED MATHEMATICAL MODELLING, 2020, 79 (79) : 555 - 565
[8] A weighted logarithmic model based enhancement of weather degraded images
1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (06):
[9] A Transformer-based Semantic Segmentation Model for Street Fashion Images
Peng, Dingjie
Kameyama, Wataru
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
[10] Research on Restoration of Murals Based on Diffusion Model and Transformer
Wang, Yaoyao
Xiao, Mansheng
Hu, Yuqing
Yan, Jin
Zhu, Zeyu
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4433 - 4449

← 1 2 3 4 5 →