CUR Transformer: A Convolutional Unbiased Regional Transformer for Image Denoising

被引:10
|
作者
Xu, Kang [1 ]
Li, Weixin [2 ,3 ]
Wang, Xia [1 ]
Hu, Xiaoyan [1 ]
Yan, Ke [4 ]
Wang, Xiaojie [1 ]
Dong, Xuan [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Beihang Univ, Beijing, Peoples R China
[3] Zhongguancun Lab, Beijing, Peoples R China
[4] DAMO Acad, Beijing, Peoples R China
关键词
Image denoising; non-local filters; regional self-attention; computer vision; local non-overlapped windows; NETWORK;
D O I
10.1145/3566125
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image denoising is a fundamental problem in computer vision and multimedia computation. Non-local filters are effective for image denoising. But existing deep learning methods that use non-local computation structures are mostly designed for high-level tasks, and global self-attention is usually adopted. For the task of image denoising, they have high computational complexity and have a lot of redundant computation of uncorrelated pixels. To solve this problem and combine the marvelous advantages of non-local filter and deep learning, we propose a Convolutional Unbiased Regional (CUR) transformer. Based on the prior that, for each pixel, its similar pixels are usually spatially close, our insights are that (1) we partition the image into non-overlapped windows and perform regional self-attention to reduce the search range of each pixel, and (2) we encourage pixels across different windows to communicate with each other. Based on our insights, the CUR transformer is cascaded by a series of convolutional regional self-attention (CRSA) blocks with U-style short connections. In each CRSA block, we use convolutional layers to extract the query, key, and value features, namely Q, K, and V, of the input feature. Then, we partition the Q, K, and V features into local non-overlapped windows and perform regional self-attention within each window to obtain the output feature of this CRSA block. Among different CRSA blocks, we perform the unbiased window partition by changing the partition positions of the windows. Experimental results show that the CUR transformer outperforms the state-of-the-art methods significantly on four low-level vision tasks, including real and synthetic image denoising, JPEG compression artifact reduction, and low-light image enhancement.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A cross Transformer for image denoising
    Tian, Chunwei
    Zheng, Menghua
    Zuo, Wangmeng
    Zhang, Shichao
    Zhang, Yanning
    Lin, Chia-Wen
    INFORMATION FUSION, 2024, 102
  • [2] A Dynamic Network with Transformer for Image Denoising
    Song, Mingjian
    Wang, Wenbo
    Zhao, Yue
    ELECTRONICS, 2024, 13 (09)
  • [3] Heterogeneous Window Transformer for Image Denoising
    Tian, Chunwei
    Zheng, Menghua
    Lin, Chia-Wen
    Li, Zhiwu
    Zhang, David
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024,
  • [4] Dense Residual Transformer for Image Denoising
    Yao, Chao
    Jin, Shuo
    Liu, Meiqin
    Ban, Xiaojuan
    ELECTRONICS, 2022, 11 (03)
  • [5] Progressive convolutional transformer for image restoration
    Wan, Yecong
    Shao, Mingwen
    Cheng, Yuanshuo
    Meng, Deyu
    Zuo, Wangmeng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [6] IMAGE STEGANALYSIS WITH CONVOLUTIONAL VISION TRANSFORMER
    Luo, Ge
    Wei, Ping
    Zhu, Shuwen
    Zhang, Xinpeng
    Qian, Zhenxing
    Li, Sheng
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3089 - 3093
  • [7] GLUformer: An Efficient Transformer Network for Image Denoising
    Xue, Chenghao
    Qian, Pengjiang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 797 - 807
  • [8] SUNet: Swin Transformer UNet for Image Denoising
    Fan, Chi-Mao
    Liu, Tsung-Jung
    Liu, Kuan-Hsien
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2333 - 2337
  • [9] An effective masked transformer network for image denoising
    Xu, Shaoping
    Xiao, Nan
    Tao, Wuyong
    Zhou, Changfei
    Xiong, Minghai
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (6-7) : 4997 - 5010
  • [10] Convolutional Transformer Network for Hyperspectral Image Classification
    Zhao, Zhengang
    Hu, Dan
    Wang, Hao
    Yu, Xianchuan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19