Fast Online Video Super-Resolution with Deformable Attention Pyramid

被引:15
|
作者
Fuoli, Dario [1 ]
Danelljan, Martin [1 ]
Timofte, Radu [1 ,2 ]
Van Gool, Luc [1 ,3 ]
机构
[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[2] Univ Wurzburg, CAIDAS, Wurzburg, Germany
[3] Katholieke Univ Leuven, Leuven, Belgium
来源
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年
关键词
D O I
10.1109/WACV56688.2023.00178
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution (VSR) has many applications that pose strict causal, real-time, and latency constraints, including video streaming and TV. We address the VSR problem under these settings, which poses additional important challenges since information from future frames is unavailable. Importantly, designing efficient, yet effective frame alignment and fusion modules remain central problems. In this work, we propose a recurrent VSR architecture based on a deformable attention pyramid (DAP). Our DAP aligns and integrates information from the recurrent state into the current frame prediction. To circumvent the computational cost of traditional attention-based methods, we only attend to a limited number of spatial locations, which are dynamically predicted by the DAP. Comprehensive experiments and analysis of the proposed key innovations show the effectiveness of our approach. We significantly reduce processing time and computational complexity in comparison to state-of-the-art methods, while maintaining a high performance. We surpass state-of-the-art method EDVR-M on two standard benchmarks with a speed-up of over 3x.
引用
收藏
页码:1735 / 1744
页数:10
相关论文
共 50 条
  • [21] Super-resolution image pyramid
    Lu, Y
    Inamura, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (08) : 1436 - 1446
  • [22] Wavelet Attention Embedding Networks for Video Super-Resolution
    Choi, Young-Ju
    Lee, Young-Woon
    Kim, Byung-Gyu
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7314 - 7320
  • [23] Pyramid Separable Channel Attention Network for Single Image Super-Resolution
    Ma, Congcong
    Mi, Jiaqi
    Gao, Wanlin
    Tao, Sha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4687 - 4701
  • [24] A Fast Kernel Regression Framework for Video Super-Resolution
    Yu, Wen-sen
    Wang, Ming-hui
    Chang, Hua-wen
    Chen, Shu-qing
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2014, 8 (01): : 232 - 248
  • [25] Fast Video Super-Resolution via Sparse Coding
    Dong, Jiaquan
    Zhang, Hong
    Yuan, Ding
    Chen, Hao
    You, Yuhu
    SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014), 2015, 9443
  • [26] Reference-Based Image Super-Resolution with Deformable Attention Transformer
    Cao, Jiezhang
    Liang, Jingyun
    Zhang, Kai
    Li, Yawei
    Zhang, Yulun
    Wang, Wenguan
    Van Gool, Luc
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 325 - 342
  • [27] Video super-resolution with phase-aided deformable alignment network
    Cai, Zhuojun
    Chen, Yaowu
    Tian, Xiang
    Jiang, Rongxin
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (03)
  • [28] TDAN: Temporally-Deformable Alignment Network for Video Super-Resolution
    Tian, Yapeng
    Zhang, Yulun
    Fu, Yun
    Xu, Chenliang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3357 - 3366
  • [29] Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks
    Lai, Wei-Sheng
    Huang, Jia-Bin
    Ahuja, Narendra
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) : 2599 - 2613
  • [30] Learning a Deep Dual Attention Network for Video Super-Resolution
    Li, Feng
    Bai, Huihui
    Zhao, Yao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 4474 - 4488