Bidirectional Multi-scale Deformable Attention for Video Super-Resolution

被引:0
|
作者
Zhou, Zhenghua [1 ]
Xue, Boxiang [2 ]
Wang, Hai [3 ]
Zhao, Jianwei [4 ]
机构
[1] Zhejiang Univ Finance & Econ, Sch Data Sci, Hangzhou 310018, Peoples R China
[2] China Jiliang Univ, Coll Sci, Dept Data Sci, Hangzhou 310018, Zhejiang, Peoples R China
[3] Murdoch Univ, Discipline Engn & Energy, Perth, WA 6150, Australia
[4] China Jiliang Univ, Coll Informat Engn, Hangzhou 310018, Zhejiang, Peoples R China
关键词
Video super-resolution; Multi-scale deformable convolution; Multi-scale attention; Bidirectional propagation;
D O I
10.1007/s11042-023-16072-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video super-resolution aims to generate a high-resolution video frame from its low-resolution video sequences. Video super-resolution is still a challenging problem due to performing the temporal frame alignment and spatial feature fusion during the process of spatial-temporal modeling. Existing deep learning based methods have limitations in handling accurate alignment and effective fusion of frames with multi-scale feature information. In this paper, we propose Bidirectional Multi-scale Deformable Attention (BMDA) for video Super-Resolution in terms of propagation, alignment and fusion. More specifically, the developed Deformable Alignment Module (DAM) in BMDA contains two kinds of modules: Multi-scale Deformable Convolution Module (MDCM) and Multi-scale Attention Module (MAM). MDCM is leveraged to deal with the offset information in different scales and align adjacent frames at the feature level, improving the robustness of the alignment among adjacent frames. MAM is designed to extract the local and global features of the aligned features for aggregation, such that the feature information compensation between pixels is achieved. Additionally, in order to make full use of shallow features, dense connection structure between each layer is adopted in the framework of bidirectional propagation to achieve better visual performance on video super-resolution. In particular, our proposed BDAM outperforms BasicVSR by up to 1.28dB in PSNR when batch size is set to 2. Experimental results on public video benchmark datasets demonstrate that the proposed method can achieve superior performance on large motion videos as compared with the state-of-the art methods.
引用
收藏
页码:27809 / 27830
页数:22
相关论文
共 50 条
  • [1] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhenghua Zhou
    Boxiang Xue
    Hai Wang
    Jianwei Zhao
    [J]. Multimedia Tools and Applications, 2024, 83 : 27809 - 27830
  • [2] Multi-scale attention network for image super-resolution
    Wang, Li
    Shen, Jie
    Tang, E.
    Zheng, Shengnan
    Xu, Lizhong
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
  • [3] Accurate and lightweight MRI super-resolution via multi-scale bidirectional fusion attention network
    Xu, Ling
    Li, Guanyao
    Chen, Qiaochuan
    [J]. PLOS ONE, 2022, 17 (12):
  • [4] Accurate and Lightweight MRI Super-Resolution Via Multi-Scale Bidirectional Fusion Attention Network
    Xu, Ling
    Li, Guangyao
    Chen, Qiaochuan
    [J]. SSRN, 2022,
  • [5] TBNet: Stereo Image Super-Resolution with Multi-Scale Attention
    Zhu, Jiyang
    Han, Xue
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (18)
  • [6] Image super-resolution reconstruction with multi-scale attention fusion
    Chen, Chun-yi
    Wu, Xin-yi
    Hu, Xiao-juan
    Yu, Hai-yang
    [J]. CHINESE OPTICS, 2023, 16 (05) : 1034 - 1044
  • [7] Multi-Scale Video Super-Resolution Transformer With Polynomial Approximation
    Zhang, Fan
    Chen, Gongguan
    Wang, Hua
    Li, Jinjiang
    Zhang, Caiming
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4496 - 4506
  • [8] Multi-scale Residual Dense Block for Video Super-Resolution
    Cui, Hetao
    Sun, Quansen
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 424 - 434
  • [9] Attention-guided video super-resolution with recurrent multi-scale spatial–temporal transformer
    Wei Sun
    Xianguang Kong
    Yanning Zhang
    [J]. Complex & Intelligent Systems, 2023, 9 : 3989 - 4002
  • [10] Multi-scale deformable transformer for multi-contrast knee MRI super-resolution
    Zou, Beiji
    Ji, Zexin
    Zhu, Chengzhang
    Dai, Yulan
    Zhang, Wensheng
    Kui, Xiaoyan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79