Fast Online Video Super-Resolution with Deformable Attention Pyramid

被引:15
|
作者
Fuoli, Dario [1 ]
Danelljan, Martin [1 ]
Timofte, Radu [1 ,2 ]
Van Gool, Luc [1 ,3 ]
机构
[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[2] Univ Wurzburg, CAIDAS, Wurzburg, Germany
[3] Katholieke Univ Leuven, Leuven, Belgium
关键词
D O I
10.1109/WACV56688.2023.00178
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution (VSR) has many applications that pose strict causal, real-time, and latency constraints, including video streaming and TV. We address the VSR problem under these settings, which poses additional important challenges since information from future frames is unavailable. Importantly, designing efficient, yet effective frame alignment and fusion modules remain central problems. In this work, we propose a recurrent VSR architecture based on a deformable attention pyramid (DAP). Our DAP aligns and integrates information from the recurrent state into the current frame prediction. To circumvent the computational cost of traditional attention-based methods, we only attend to a limited number of spatial locations, which are dynamically predicted by the DAP. Comprehensive experiments and analysis of the proposed key innovations show the effectiveness of our approach. We significantly reduce processing time and computational complexity in comparison to state-of-the-art methods, while maintaining a high performance. We surpass state-of-the-art method EDVR-M on two standard benchmarks with a speed-up of over 3x.
引用
收藏
页码:1735 / 1744
页数:10
相关论文
共 50 条
  • [1] FLOW-GUIDED DEFORMABLE ATTENTION NETWORK FOR FAST ONLINE VIDEO SUPER-RESOLUTION
    Yang, Xi
    Zhang, Xindong
    Zhang, Lei
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 390 - 394
  • [2] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830
  • [3] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    Multimedia Tools and Applications, 83 (09): : 27809 - 27830
  • [4] Deformable Spatial-Temporal Attention for Lightweight Video Super-Resolution
    Xue, Tong
    Huang, Xinyi
    Li, Dengshi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 482 - 493
  • [5] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhenghua Zhou
    Boxiang Xue
    Hai Wang
    Jianwei Zhao
    Multimedia Tools and Applications, 2024, 83 : 27809 - 27830
  • [6] STDAN: Deformable Attention Network for Space-Time Video Super-Resolution
    Wang, Hai
    Xiang, Xiaoyu
    Tian, Yapeng
    Yang, Wenming
    Liao, Qingmin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10606 - 10616
  • [7] STDAN: Deformable Attention Network for Space-Time Video Super-Resolution
    Wang, Hai
    Xiang, Xiaoyu
    Tian, Yapeng
    Yang, Wenming
    Liao, Qingmin
    arXiv, 2022,
  • [8] Deformable Attention Network for Efficient Space-Time Video Super-Resolution
    Wang, Hua
    Chamchong, Rapeeporn
    Chomphuwiset, Phatthanaphong
    Pawara, Pornntiwa
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [9] Understanding Deformable Alignment in Video Super-Resolution
    Chan, Kelvin C. K.
    Wang, Xintao
    Yu, Ke
    Dong, Chao
    Loy, Chen Change
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 973 - 981
  • [10] Deformable transformer for endoscopic video super-resolution
    Song, Xiaowei
    Tang, Hui
    Yang, Chunfeng
    Zhou, Guangquan
    Wang, Yangang
    Huang, Xinjun
    Hua, Jie
    Coatrieux, Gouenou
    He, Xiaopu
    Chen, Yang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 77