DIRformer: A Novel Image Restoration Approach Based on U-shaped Transformer and Diffusion Models

被引:0
|
作者
Hu, Cong [1 ]
Wei, Xiao-zhong [1 ]
Wu, Xiao-jun [1 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi, Peoples R China
基金
中国博士后科学基金; 国家重点研发计划; 中国国家自然科学基金;
关键词
Diffusion models; Image restoration; Transformer; QUALITY ASSESSMENT; CNN;
D O I
10.1145/3703632
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image restoration (IR) involves the retrieval of missing or damaged image information and represents a significant challenge in the field of visual reconstruction. Currently, U-Net based Diffusion Models (DMs) display favorable results when utilized for IR tasks. However, the DM based on U-Net demonstrates shortcomings in capturing the global context for IR. To address this issue, we propose a Novel Image Restoration Approach Based on U-shaped Transformer and DMs (DIRformer). DIRformer enhances the modeling capacity for longrange dependencies within DMs. In particular, DIRformer replaces the traditional U-Net downsampling with Patch merging, dedicated to improving detail preservation, and replaces upsampling with Dual up-sample, strategically designed to alleviate checkerboard artifacts. Besides, as a lightweight and versatile transformer- based solution for IR, DIRformer incorporates time and degradation mapping into the transformer design, all while preserving the fundamental U-shaped structural framework. We assess the efficacy of DIRformer in a multi-tasking IR setting across four datasets. The experimental performance illustrates that DIRformer achieves competitive performance on distortion metrics, including PSNR and SSIM. Remarkably, our proposed approach is almost 25x smaller and 2x faster than the existing methods while achieving comparable high performance.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Uformer: A General U-Shaped Transformer for Image Restoration
    Wang, Zhendong
    Cun, Xiaodong
    Bao, Jianmin
    Zhou, Wengang
    Liu, Jianzhuang
    Li, Houqiang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17662 - 17672
  • [2] Collaborative transformer U-shaped network for medical image segmentation
    Gao, Yufei
    Zhang, Shichao
    Shi, Lei
    Zhao, Guohua
    Shi, Yucheng
    APPLIED SOFT COMPUTING, 2025, 173
  • [3] Video summarization with u-shaped transformer
    Yaosen Chen
    Bing Guo
    Yan Shen
    Renshuang Zhou
    Weichen Lu
    Wei Wang
    Xuming Wen
    Xinhua Suo
    Applied Intelligence, 2022, 52 : 17864 - 17880
  • [4] U2-Former: Nested U-Shaped Transformer for Image Restoration via Multi-View Contrastive Learning
    Feng, Xin
    Ji, Haobo
    Pei, Wenjie
    Li, Jinxing
    Lu, Guangming
    Zhang, David
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 168 - 181
  • [5] Optimization of U-shaped pure transformer medical image segmentation network
    Dan, Yongping
    Jin, Weishou
    Wang, Zhida
    Sun, Changhao
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [6] Video summarization with u-shaped transformer
    Chen, Yaosen
    Guo, Bing
    Shen, Yan
    Zhou, Renshuang
    Lu, Weichen
    Wang, Wei
    Wen, Xuming
    Suo, Xinhua
    APPLIED INTELLIGENCE, 2022, 52 (15) : 17864 - 17880
  • [7] UDT: U-shaped deformable transformer for subarachnoid haemorrhage image segmentation
    Xie, Wei
    Jin, Lianghao
    Hua, Shiqi
    Sun, Hao
    Sun, Bo
    Tu, Zhigang
    Liu, Jun
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (03) : 756 - 768
  • [8] Uformer-ICS: A U-Shaped Transformer for Image Compressive Sensing Service
    Zhang, Kuiyuan
    Hua, Zhongyun
    Li, Yuanman
    Zhang, Yushu
    Zhou, Yicong
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2974 - 2988
  • [9] Transformer-Based Cascade U-shaped Network for Action Segmentation
    Bao, Wenxia
    Lin, An
    Huang, Hua
    Yang, Xianjun
    Chen, Hemu
    2024 3RD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MEDIA COMPUTING, ICIPMC 2024, 2024, : 157 - 161
  • [10] A U-Shaped Convolution-Aided Transformer with Double Attention for Hyperspectral Image Classification
    Qin, Ruiru
    Wang, Chuanzhi
    Wu, Yongmei
    Du, Huafei
    Lv, Mingyun
    REMOTE SENSING, 2024, 16 (02)