Dual Attention with the Self-Attention Alignment for Efficient Video Super-resolution

被引:0
|
作者
Yuezhong Chu
Yunan Qiao
Heng Liu
Jungong Han
机构
[1] Anhui University of Technology,School of Computer Science and Technology
[2] Aberystwyth University,Department of Computer Science
来源
Cognitive Computation | 2022年 / 14卷
关键词
Video super-resolution; Dual attention; Self-attention alignment; FLOPS;
D O I
暂无
中图分类号
学科分类号
摘要
By selectively enhancing the features extracted from convolution networks, the attention mechanism has shown its effectiveness for low-level visual tasks, especially for image super-resolution (SR). However, due to the spatiotemporal continuity of video sequences, simply applying image attention to a video does not seem to obtain good SR results. At present, there is still a lack of suitable attention structure to achieve efficient video SR. In this work, building upon the dual attention, i.e., position attention and channel attention, we proposed deep dual attention, underpinned by self-attention alignment (DASAA), for video SR. Specifically, we start by constructing a dual attention module (DAM) to strengthen the acquired spatiotemporal features and adopt a self-attention structure with the morphological mask to achieve attention alignment. Then, on top of the attention features, we utilize the up-sampling operation to reconstruct the super-resolved video images and introduce the LSTM (long short-time memory) network to guarantee the coherent consistency of the generated video frames both temporally and spatially. Experimental results and comparisons on the actual Youku-VESR dataset and the typical benchmark dataset-Vimeo-90 k demonstrate that our proposed approach achieves the best video SR effect while taking the least amount of computation. Specifically, in the Youku-VESR dataset, our proposed approach achieves a test PSNR of 35.290db and a SSIM of 0.939, respectively. In the Vimeo-90 k dataset, the PSNR/SSIM indexes of our approach are 32.878db and 0.774. Moreover, the FLOPS (float-point operations per second) of our approach is as low as 6.39G. The proposed DASAA method surpasses all video SR algorithms in the comparison. It is also revealed that there is no linear relationship between positional attention and channel attention. It suggests that our DASAA with LSTM coherent consistency architecture may have great potential for many low-level vision video applications.
引用
收藏
页码:1140 / 1151
页数:11
相关论文
共 50 条
  • [41] From Local to Global: Efficient Dual Attention Mechanism for Single Image Super-Resolution
    Zhang, Pei
    Lam, Edmund Y.
    IEEE ACCESS, 2021, 9 : 114957 - 114964
  • [42] Dynamic dual attention iterative network for image super-resolution
    Feng, Hao
    Wang, Liejun
    Cheng, Shuli
    Du, Anyu
    Li, Yongming
    APPLIED INTELLIGENCE, 2022, 52 (07) : 8189 - 8208
  • [43] Dynamic dual attention iterative network for image super-resolution
    Hao Feng
    Liejun Wang
    Shuli Cheng
    Anyu Du
    Yongming Li
    Applied Intelligence, 2022, 52 : 8189 - 8208
  • [44] Dual-Camera Super-Resolution with Aligned Attention Modules
    Wang, Tengfei
    Xie, Jiaxin
    Sun, Wenxiu
    Yan, Qiong
    Chen, Qifeng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1981 - 1990
  • [45] Efficient residual attention network for single image super-resolution
    Fangwei Hao
    Taiping Zhang
    Linchang Zhao
    Yuanyan Tang
    Applied Intelligence, 2022, 52 : 652 - 661
  • [46] Efficient residual attention network for single image super-resolution
    Hao, Fangwei
    Zhang, Taiping
    Zhao, Linchang
    Tang, Yuanyan
    APPLIED INTELLIGENCE, 2022, 52 (01) : 652 - 661
  • [47] Efficient Global Attention Networks for Image Super-Resolution Reconstruction
    Wang Qingqing
    Xin Yuelan
    Zhao Jia
    Guo Jiang
    Wang Haochen
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (10)
  • [48] A generative adversarial network model fused with a self-attention mechanism for the super-resolution reconstruction of ancient murals
    Cao, Jianfang
    Hu, Xiaohui
    Cui, Hongyan
    Liang, Yunchuan
    Chen, Zeyu
    IET IMAGE PROCESSING, 2023, 17 (08) : 2336 - 2349
  • [49] Self-Attention Based Video Summarization
    Li Y.
    Wang J.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (04): : 652 - 659
  • [50] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830