Attention-guided video super-resolution with recurrent multi-scale spatial–temporal transformer

被引:0
|
作者
Wei Sun
Xianguang Kong
Yanning Zhang
机构
[1] Xi’an University of Posts and Telecommunications,School of Computer Science and Technology
[2] Shaanxi Key Laboratory of Network Data Analysis and Intelligent Processing,School of Mechano
[3] Xidian University,Electronic Engineering
[4] Northwestern Polytechnical University,School of Computer Science and Engineering
来源
关键词
Video super-resolution; Spatial-temporal transformer; Attention mechanism; Motion compensation;
D O I
暂无
中图分类号
学科分类号
摘要
Video super-resolution (VSR) aims to recover the high-resolution (HR) contents from the low-resolution (LR) observations relying on compositing the spatial–temporal information in the LR frames. It is crucial to propagate and aggregate spatial–temporal information. Recently, while transformers show impressive performance on high-level vision tasks, few attempts have been made on image restoration, especially on VSR. In addition, previous transformers simultaneously process spatial–temporal information, easily synthesizing confused textures and high computational cost limit its development. Towards this end, we construct a novel bidirectional recurrent VSR architecture. Our model disentangles the task of learning spatial–temporal information into two easier sub-tasks, each sub-task focuses on propagating and aggregating specific information with a multi-scale transformer-based design, which alleviates the difficulty of learning. Additionally, an attention-guided motion compensation module is applied to get rid of the influence of misalignment between frames. Experiments on three widely used benchmark datasets show that, relying on superior feature correlation learning, the proposed network can outperform previous state-of-the-art methods, especially for recovering the fine details.
引用
收藏
页码:3989 / 4002
页数:13
相关论文
共 50 条
  • [1] Attention-guided video super-resolution with recurrent multi-scale spatial-temporal transformer
    Sun, Wei
    Kong, Xianguang
    Zhang, Yanning
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3989 - 4002
  • [2] Multi-Scale Video Super-Resolution Transformer With Polynomial Approximation
    Zhang, Fan
    Chen, Gongguan
    Wang, Hua
    Li, Jinjiang
    Zhang, Caiming
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4496 - 4506
  • [3] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830
  • [4] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhenghua Zhou
    Boxiang Xue
    Hai Wang
    Jianwei Zhao
    [J]. Multimedia Tools and Applications, 2024, 83 : 27809 - 27830
  • [5] MAPANet: A Multi-Scale Attention-Guided Progressive Aggregation Network for Multi-Contrast MRI Super-Resolution
    Liu, Licheng
    Liu, Tao
    Zhou, Wei
    Wang, Yaonan
    Liu, Min
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2024, 10 : 928 - 940
  • [6] Efficient Multi-Scale Cosine Attention Transformer for Image Super-Resolution
    Chen, Yuzhen
    Wang, Gencheng
    Chen, Rong
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1442 - 1446
  • [7] Space-time super-resolution for satellite video: A joint framework based on multi-scale spatial-temporal transformer
    Xiao, Yi
    Yuan, Qiangqiang
    He, Jiang
    Zhang, Qiang
    Sun, Jing
    Su, Xin
    Wu, Jialian
    Zhang, Liangpei
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 108
  • [8] Global attention guided multi-scale network for face image super-resolution
    Jinlu Zhang
    Mingliang Liu
    Xiaohang Wang
    [J]. Machine Vision and Applications, 2023, 34
  • [9] Global attention guided multi-scale network for face image super-resolution
    Zhang, Jinlu
    Liu, Mingliang
    Wang, Xiaohang
    [J]. MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
  • [10] Multi-scale attention network for image super-resolution
    Wang, Li
    Shen, Jie
    Tang, E.
    Zheng, Shengnan
    Xu, Lizhong
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80