UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration

被引:0
|
作者
Yu, Yongbo [1 ]
Li, Weidong [1 ]
Bai, Linyan [2 ,3 ]
Duan, Jinlong [1 ]
Zhang, Xuehai [1 ]
机构
[1] Henan Univ Technol, Coll Informat Sci & Engn, Zhengzhou 450001, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[3] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China
来源
关键词
Diffusion models; Attention mechanism; Weather-degraded image restoration; Image restoration; Vision transformer; RAINDROP REMOVAL; NETWORK;
D O I
10.1007/s00371-024-03659-x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Restoring multi-weather-degraded images is significant for subsequent high-level computer vision tasks. However, most existing image restoration algorithms only target single-weather-degraded images, and there are few general models for multi-weather-degraded image restoration. In this paper, we propose a diffusion model for multi-weather-degraded image restoration, namely a universal transformer-based diffusion model (UTDM) for multi-weather-degraded images restoration, by combining the denoising diffusion probability model and Vision Transformer (ViT). First, UTDM uses weather-degraded images as conditions to guide the diffusion model to generate clean background images through reverse sampling. Secondly, we propose a Cascaded Fusion Noise Estimation Transformer (CFNET) based on ViT, which utilizes degraded and noisy images for noise estimation. By introducing cascaded contextual fusion attention in a cascaded manner to compute contextual fusion attention mechanisms for different heads, CFNET explores the commonalities and characteristics of multi-weather-degraded images, fully capturing global and local feature information to improve the model's generalization ability on various weather-degraded images. UTDM outperformed the existing algorithm by 0.14-4.55,dB on the Raindrop-A test set, and improved by 0.99 dB and 1.24 dB compared with Transweather on the Snow100K-L and Test1 test sets. Experimental results show that our method outperforms general and specific restoration task algorithms on synthetic and real-world degraded image datasets. Code and dataset are available at: https://github.com/RHEPI/UTDM.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Histology Image Artifact Restoration with Lightweight Transformer Based Diffusion Model
    Wang, Chong
    He, Zhenqi
    He, Junjun
    Ye, Jin
    Shen, Yiqing
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PT II, AIME 2024, 2024, 14845 : 81 - 89
  • [22] SRT: Improved transformer-based model for classification of 2D heartbeat images
    Wu, Wenwen
    Huang, Yanqi
    Wu, Xiaomei
    Biomedical Signal Processing and Control, 2024, 88
  • [23] Multi-Modal Pedestrian Crossing Intention Prediction with Transformer-Based Model
    Wang, Ting-Wei
    Lai, Shang-Hong
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (05)
  • [24] TransMF: Transformer-Based Multi-Scale Fusion Model for Crack Detection
    Ju, Xiaochen
    Zhao, Xinxin
    Qian, Shengsheng
    MATHEMATICS, 2022, 10 (13)
  • [25] SRT: Improved transformer-based model for classification of 2D heartbeat images
    Wu, Wenwen
    Huang, Yanqi
    Wu, Xiaomei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [26] Pedestrian Crossing Intention Prediction with Multi-Modal Transformer-Based Model
    Wang, Ting Wei
    Lai, Shang-Hong
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1349 - 1356
  • [27] Research on Multi-Scale CNN and Transformer-Based Multi-Level Multi-Classification Method for Images
    Gou, Quandeng
    Ren, Yuheng
    IEEE ACCESS, 2024, 12 : 103049 - 103059
  • [28] Transformer-based multi-task learning for classification and segmentation of gastrointestinal tract endoscopic images
    Tang, Suigu
    Yu, Xiaoyuan
    Cheang, Chak Fong
    Liang, Yanyan
    Zhao, Penghui
    Yu, Hon Ho
    Choi, I. Cheong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [29] MelodyDiffusion: Chord-Conditioned Melody Generation Using a Transformer-Based Diffusion Model
    Li, Shuyu
    Sung, Yunsick
    MATHEMATICS, 2023, 11 (08)
  • [30] Transformer-based framework for multi-class segmentation of skin cancer from histopathology images
    Imran, Muhammad
    Tiwana, Mohsin Islam
    Mohsan, Mashood Mohammad
    Alghamdi, Norah Saleh
    Akram, Muhammad Usman
    FRONTIERS IN MEDICINE, 2024, 11