UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration

被引:0
|
作者
Yu, Yongbo [1 ]
Li, Weidong [1 ]
Bai, Linyan [2 ,3 ]
Duan, Jinlong [1 ]
Zhang, Xuehai [1 ]
机构
[1] Henan Univ Technol, Coll Informat Sci & Engn, Zhengzhou 450001, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[3] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China
来源
关键词
Diffusion models; Attention mechanism; Weather-degraded image restoration; Image restoration; Vision transformer; RAINDROP REMOVAL; NETWORK;
D O I
10.1007/s00371-024-03659-x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Restoring multi-weather-degraded images is significant for subsequent high-level computer vision tasks. However, most existing image restoration algorithms only target single-weather-degraded images, and there are few general models for multi-weather-degraded image restoration. In this paper, we propose a diffusion model for multi-weather-degraded image restoration, namely a universal transformer-based diffusion model (UTDM) for multi-weather-degraded images restoration, by combining the denoising diffusion probability model and Vision Transformer (ViT). First, UTDM uses weather-degraded images as conditions to guide the diffusion model to generate clean background images through reverse sampling. Secondly, we propose a Cascaded Fusion Noise Estimation Transformer (CFNET) based on ViT, which utilizes degraded and noisy images for noise estimation. By introducing cascaded contextual fusion attention in a cascaded manner to compute contextual fusion attention mechanisms for different heads, CFNET explores the commonalities and characteristics of multi-weather-degraded images, fully capturing global and local feature information to improve the model's generalization ability on various weather-degraded images. UTDM outperformed the existing algorithm by 0.14-4.55,dB on the Raindrop-A test set, and improved by 0.99 dB and 1.24 dB compared with Transweather on the Snow100K-L and Test1 test sets. Experimental results show that our method outperforms general and specific restoration task algorithms on synthetic and real-world degraded image datasets. Code and dataset are available at: https://github.com/RHEPI/UTDM.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] TransWeather: Transformer-based Restoration of Images Degraded by Adverse Weather Conditions
    Valanarasu, Jeya Maria Jose
    Yasarla, Rajeev
    Patel, Vishal M.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2343 - 2353
  • [2] A Transformer-Based Diffusion Model for All-in-One Weather-Degraded Image Restoration
    Qin, Jing
    Wen, Yuanbo
    Gao, Tao
    Liu, Yao
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2024, 58 (10): : 1606 - 1617
  • [3] TransDDPM: Transformer-Based Denoising Diffusion Probabilistic Model for Image Restoration
    Wei, Pan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI, 2024, 14435 : 250 - 263
  • [4] TUFusion: A Transformer-Based Universal Fusion Algorithm for Multimodal Images
    Zhao, Yangyang
    Zheng, Qingchun
    Zhu, Peihao
    Zhang, Xu
    Ma, Wenpeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1712 - 1725
  • [5] LayoutDM: Transformer-based Diffusion Model for Layout Generation
    Chai, Shang
    Zhuang, Liansheng
    Yan, Fengying
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18349 - 18358
  • [6] TransDiffSeg: Transformer-Based Conditional Diffusion Segmentation Model for Abdominal Multi-Objective
    Gu, WenWen
    Zhang, GuoDong
    Ju, RongHui
    Wang, SuRan
    Li, YanLin
    Liang, TingYu
    Guo, Wei
    Gong, ZhaoXuan
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, : 262 - 280
  • [7] Adaptive shock-diffusion model for restoration of degraded document images
    Guo, Jiebin
    He, Chuanjiang
    APPLIED MATHEMATICAL MODELLING, 2020, 79 (79) : 555 - 565
  • [8] A weighted logarithmic model based enhancement of weather degraded images
    1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (06):
  • [9] A Transformer-based Semantic Segmentation Model for Street Fashion Images
    Peng, Dingjie
    Kameyama, Wataru
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [10] Research on Restoration of Murals Based on Diffusion Model and Transformer
    Wang, Yaoyao
    Xiao, Mansheng
    Hu, Yuqing
    Yan, Jin
    Zhu, Zeyu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4433 - 4449