Dual Attention with the Self-Attention Alignment for Efficient Video Super-resolution

被引:0
|
作者
Yuezhong Chu
Yunan Qiao
Heng Liu
Jungong Han
机构
[1] Anhui University of Technology,School of Computer Science and Technology
[2] Aberystwyth University,Department of Computer Science
来源
Cognitive Computation | 2022年 / 14卷
关键词
Video super-resolution; Dual attention; Self-attention alignment; FLOPS;
D O I
暂无
中图分类号
学科分类号
摘要
By selectively enhancing the features extracted from convolution networks, the attention mechanism has shown its effectiveness for low-level visual tasks, especially for image super-resolution (SR). However, due to the spatiotemporal continuity of video sequences, simply applying image attention to a video does not seem to obtain good SR results. At present, there is still a lack of suitable attention structure to achieve efficient video SR. In this work, building upon the dual attention, i.e., position attention and channel attention, we proposed deep dual attention, underpinned by self-attention alignment (DASAA), for video SR. Specifically, we start by constructing a dual attention module (DAM) to strengthen the acquired spatiotemporal features and adopt a self-attention structure with the morphological mask to achieve attention alignment. Then, on top of the attention features, we utilize the up-sampling operation to reconstruct the super-resolved video images and introduce the LSTM (long short-time memory) network to guarantee the coherent consistency of the generated video frames both temporally and spatially. Experimental results and comparisons on the actual Youku-VESR dataset and the typical benchmark dataset-Vimeo-90 k demonstrate that our proposed approach achieves the best video SR effect while taking the least amount of computation. Specifically, in the Youku-VESR dataset, our proposed approach achieves a test PSNR of 35.290db and a SSIM of 0.939, respectively. In the Vimeo-90 k dataset, the PSNR/SSIM indexes of our approach are 32.878db and 0.774. Moreover, the FLOPS (float-point operations per second) of our approach is as low as 6.39G. The proposed DASAA method surpasses all video SR algorithms in the comparison. It is also revealed that there is no linear relationship between positional attention and channel attention. It suggests that our DASAA with LSTM coherent consistency architecture may have great potential for many low-level vision video applications.
引用
收藏
页码:1140 / 1151
页数:11
相关论文
共 50 条
  • [31] DLGSANet: Lightweight Dynamic Local and Global Self-Attention Network for Image Super-Resolution
    Li, Xiang
    Dong, Jiangxin
    Tang, Jinhui
    Pan, Jinshan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12746 - 12755
  • [32] InfraFFN: A Feature Fusion Network leveraging dual-path convolution and self-attention for infrared image super-resolution
    Qin, Feiwei
    Shen, Zhengwei
    Ge, Ruiquan
    Zhang, Kai
    Lin, Fei
    Wang, Yeru
    Gorriz, Juan M.
    Elazab, Ahmed
    Wang, Changmiao
    Knowledge-Based Systems, 2025, 310
  • [33] Hybrid Domain Attention Network for Efficient Super-Resolution
    Zhang, Qian
    Feng, Linxia
    Liang, Hong
    Yang, Ying
    SYMMETRY-BASEL, 2022, 14 (04):
  • [34] Replacing Averaging with More Powerful Self-Attention Mechanism for Multi-Image Super-Resolution
    Zhao, Dingyi
    Zhao, Jiying
    2023 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE, 2023,
  • [35] Super-Resolution Based on Degradation Learning and Self-attention for Small-Scale Pedestrian Detection
    Wu, Yaocai
    Yu, Hancheng
    Lv, Zhengkai
    Yan, Shihang
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT IV, 2022, 13458 : 592 - 603
  • [36] SSIR: Spatial shuffle multi-head self-attention for Single Image Super-Resolution
    Zhao, Liangliang
    Gao, Junyu
    Deng, Donghu
    Li, Xuelong
    PATTERN RECOGNITION, 2024, 148
  • [37] SUPER-RESOLUTION AND SELF-ATTENTION WITH GENERATIVE ADVERSARIAL NETWORK FOR IMPROVING MALIGNANCY CHARACTERIZATION OF HEPATOCELLULAR CARCINOMA
    Li, Yunling
    Huang, Hui
    Zhang, Lijuan
    Wang, Guangyi
    Zhang, Honglai
    Zhou, Wu
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1556 - 1560
  • [38] Image Super-Resolution Reconstruction Based on Self-Attention Mechanism and Deep Generative Adversarial Network
    Zhao, Yu-Feng
    He, Jie
    Journal of Network Intelligence, 2024, 9 (04): : 1936 - 1950
  • [39] Fast Online Video Super-Resolution with Deformable Attention Pyramid
    Fuoli, Dario
    Danelljan, Martin
    Timofte, Radu
    Van Gool, Luc
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1735 - 1744
  • [40] A Lightweight Recurrent Grouping Attention Network for Video Super-Resolution
    Zhu, Yonggui
    Li, Guofang
    SENSORS, 2023, 23 (20)