Restormer: Efficient Transformer for High-Resolution Image Restoration

被引:912
|
作者
Zamir, Syed Waqas [1 ]
Arora, Aditya [1 ]
Khan, Salman [2 ]
Hayat, Munawar [2 ,3 ]
Khan, Fahad Shahbaz [2 ,4 ]
Yang, Ming-Hsuan [5 ,6 ,7 ]
机构
[1] Incept Inst AI, Abu Dhabi, U Arab Emirates
[2] Mohamed Bin Zayed Univ AI, Abu Dhabi, U Arab Emirates
[3] Monash Univ, Clayton, Vic, Australia
[4] Linkoping Univ, Linkoping, Sweden
[5] Univ Calif Merced, Merced, CA USA
[6] Yonsei Univ, Seoul, South Korea
[7] Google Res, Mountain View, CA USA
基金
澳大利亚研究理事会;
关键词
D O I
10.1109/CVPR52688.2022.00564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since convolutional neural networks (CNNs) perform well at learning generalizable image priors from largescale data, these models have been extensively applied to image restoration and related tasks. Recently, another class of neural architectures, Transformers, have shown significant performance gains on natural language and high-level vision tasks. While the Transformer model mitigates the shortcomings of CNNs (i.e., limited receptive field and inadaptability to input content), its computational complexity grows quadratically with the spatial resolution, therefore making it infeasible to apply to most image restoration tasks involving high-resolution images. In this work, we propose an efficient Transformer model by making several key designs in the building blocks (multi-head attention and feed-forward network) such that it can capture long-range pixel interactions, while still remaining applicable to large images. Our model, named Restoration Transformer (Restormer), achieves state-of-the-art results on several image restoration tasks, including image deraining, single-image motion deblurring, defocus deblurring (single-image and dual-pixel data), and image denoising (Gaussian grayscale/color denoising, and real image denoising). The source code and pre-trained models are available at https://github.com/swz30/Restormer.
引用
收藏
页码:5718 / 5729
页数:12
相关论文
共 50 条
  • [1] HIGH-RESOLUTION RESTORATION OF DYNAMIC IMAGE SEQUENCES
    WENYUSU
    KIM, SP
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 1994, 5 (04) : 330 - 339
  • [2] Blind Blur Image Restoration For High-resolution Images
    Teranishi, Ryohei
    Nagata, Takahiro
    Goto, Tomio
    Hirano, Satoshi
    2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 533 - 534
  • [3] High-resolution terahertz reflective imaging and image restoration
    Ding, Sheng-Hui
    Li, Qi
    Yao, Rui
    Wang, Qi
    APPLIED OPTICS, 2010, 49 (36) : 6834 - 6839
  • [4] High-Resolution Swin Transformer for Automatic Medical Image Segmentation
    Wei, Chen
    Ren, Shenghan
    Guo, Kaitai
    Hu, Haihong
    Liang, Jimin
    SENSORS, 2023, 23 (07)
  • [5] Comprehensive and Delicate: An Efficient Transformer for Image Restoration
    Zhao, Haiyu
    Gou, Yuanbiao
    Li, Boyun
    Peng, Dezhong
    Lv, Jiancheng
    Peng, Xi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14122 - 14132
  • [6] SegEM: Efficient Image Analysis for High-Resolution Connectomics
    Berning, Manuel
    Boergens, Kevin M.
    Helmstaedter, Moritz
    NEURON, 2015, 87 (06) : 1193 - 1206
  • [7] Style-Guided Inference of Transformer for High-resolution Image Synthesis
    Yim, Jonghwa
    Kim, Minjae
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1745 - 1755
  • [8] StyleSwin: Transformer-based GAN for High-resolution Image Generation
    Zhang, Bowen
    Gu, Shuyang
    Zhang, Bo
    Bao, Jianmin
    Chen, Dong
    Wen, Fang
    Wang, Yong
    Guo, Baining
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11294 - 11304
  • [9] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
    Chen, Xuanyao
    Liu, Zhijian
    Tang, Haotian
    Yi, Li
    Zhao, Hang
    Han, Song
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2061 - 2070
  • [10] Improved Transformer for High-Resolution GANs
    Zhao, Long
    Zhang, Zizhao
    Chen, Ting
    Metaxas, Dimitris N.
    Zhang, Han
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34