Fast Online Video Super-Resolution with Deformable Attention Pyramid

被引：15

作者：

Fuoli, Dario ^{[1
]}

Danelljan, Martin ^{[1
]}

Timofte, Radu ^{[1
,2
]}

Van Gool, Luc ^{[1
,3
]}

机构：

[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland

[2] Univ Wurzburg, CAIDAS, Wurzburg, Germany

[3] Katholieke Univ Leuven, Leuven, Belgium

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00178

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video super-resolution (VSR) has many applications that pose strict causal, real-time, and latency constraints, including video streaming and TV. We address the VSR problem under these settings, which poses additional important challenges since information from future frames is unavailable. Importantly, designing efficient, yet effective frame alignment and fusion modules remain central problems. In this work, we propose a recurrent VSR architecture based on a deformable attention pyramid (DAP). Our DAP aligns and integrates information from the recurrent state into the current frame prediction. To circumvent the computational cost of traditional attention-based methods, we only attend to a limited number of spatial locations, which are dynamically predicted by the DAP. Comprehensive experiments and analysis of the proposed key innovations show the effectiveness of our approach. We significantly reduce processing time and computational complexity in comparison to state-of-the-art methods, while maintaining a high performance. We surpass state-of-the-art method EDVR-M on two standard benchmarks with a speed-up of over 3x.

引用

页码：1735 / 1744

页数：10

共 50 条

[1] FLOW-GUIDED DEFORMABLE ATTENTION NETWORK FOR FAST ONLINE VIDEO SUPER-RESOLUTION
Yang, Xi
Zhang, Xindong
Zhang, Lei
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 390 - 394
[2] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
Zhou, Zhenghua
Xue, Boxiang
Wang, Hai
Zhao, Jianwei
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830
[3] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
Zhou, Zhenghua
Xue, Boxiang
Wang, Hai
Zhao, Jianwei
Multimedia Tools and Applications, 83 (09): : 27809 - 27830
[4] Deformable Spatial-Temporal Attention for Lightweight Video Super-Resolution
Xue, Tong
Huang, Xinyi
Li, Dengshi
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 482 - 493
[5] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
Zhenghua Zhou
Boxiang Xue
Hai Wang
Jianwei Zhao
Multimedia Tools and Applications, 2024, 83 : 27809 - 27830
[6] STDAN: Deformable Attention Network for Space-Time Video Super-Resolution
Wang, Hai
Xiang, Xiaoyu
Tian, Yapeng
Yang, Wenming
Liao, Qingmin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10606 - 10616
[7] STDAN: Deformable Attention Network for Space-Time Video Super-Resolution
Wang, Hai
Xiang, Xiaoyu
Tian, Yapeng
Yang, Wenming
Liao, Qingmin
arXiv, 2022,
[8] Deformable Attention Network for Efficient Space-Time Video Super-Resolution
Wang, Hua
Chamchong, Rapeeporn
Chomphuwiset, Phatthanaphong
Pawara, Pornntiwa
IET IMAGE PROCESSING, 2025, 19 (01)
[9] Understanding Deformable Alignment in Video Super-Resolution
Chan, Kelvin C. K.
Wang, Xintao
Yu, Ke
Dong, Chao
Loy, Chen Change
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 973 - 981
[10] Deformable transformer for endoscopic video super-resolution
Song, Xiaowei
Tang, Hui
Yang, Chunfeng
Zhou, Guangquan
Wang, Yangang
Huang, Xinjun
Hua, Jie
Coatrieux, Gouenou
He, Xiaopu
Chen, Yang
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 77

← 1 2 3 4 5 →