Fast Online Video Super-Resolution with Deformable Attention Pyramid

被引：15

作者：

Fuoli, Dario ^{[1
]}

Danelljan, Martin ^{[1
]}

Timofte, Radu ^{[1
,2
]}

Van Gool, Luc ^{[1
,3
]}

机构：

[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland

[2] Univ Wurzburg, CAIDAS, Wurzburg, Germany

[3] Katholieke Univ Leuven, Leuven, Belgium

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00178

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video super-resolution (VSR) has many applications that pose strict causal, real-time, and latency constraints, including video streaming and TV. We address the VSR problem under these settings, which poses additional important challenges since information from future frames is unavailable. Importantly, designing efficient, yet effective frame alignment and fusion modules remain central problems. In this work, we propose a recurrent VSR architecture based on a deformable attention pyramid (DAP). Our DAP aligns and integrates information from the recurrent state into the current frame prediction. To circumvent the computational cost of traditional attention-based methods, we only attend to a limited number of spatial locations, which are dynamically predicted by the DAP. Comprehensive experiments and analysis of the proposed key innovations show the effectiveness of our approach. We significantly reduce processing time and computational complexity in comparison to state-of-the-art methods, while maintaining a high performance. We surpass state-of-the-art method EDVR-M on two standard benchmarks with a speed-up of over 3x.

引用

页码：1735 / 1744

页数：10

共 50 条

[21] Super-resolution image pyramid
Lu, Y
Inamura, M
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (08) : 1436 - 1446
[22] Wavelet Attention Embedding Networks for Video Super-Resolution
Choi, Young-Ju
Lee, Young-Woon
Kim, Byung-Gyu
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7314 - 7320
[23] Pyramid Separable Channel Attention Network for Single Image Super-Resolution
Ma, Congcong
Mi, Jiaqi
Gao, Wanlin
Tao, Sha
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4687 - 4701
[24] A Fast Kernel Regression Framework for Video Super-Resolution
Yu, Wen-sen
Wang, Ming-hui
Chang, Hua-wen
Chen, Shu-qing
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2014, 8 (01): : 232 - 248
[25] Fast Video Super-Resolution via Sparse Coding
Dong, Jiaquan
Zhang, Hong
Yuan, Ding
Chen, Hao
You, Yuhu
SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014), 2015, 9443
[26] Reference-Based Image Super-Resolution with Deformable Attention Transformer
Cao, Jiezhang
Liang, Jingyun
Zhang, Kai
Li, Yawei
Zhang, Yulun
Wang, Wenguan
Van Gool, Luc
COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 325 - 342
[27] Video super-resolution with phase-aided deformable alignment network
Cai, Zhuojun
Chen, Yaowu
Tian, Xiang
Jiang, Rongxin
JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (03)
[28] TDAN: Temporally-Deformable Alignment Network for Video Super-Resolution
Tian, Yapeng
Zhang, Yulun
Fu, Yun
Xu, Chenliang
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3357 - 3366
[29] Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks
Lai, Wei-Sheng
Huang, Jia-Bin
Ahuja, Narendra
Yang, Ming-Hsuan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) : 2599 - 2613
[30] Learning a Deep Dual Attention Network for Video Super-Resolution
Li, Feng
Bai, Huihui
Zhao, Yao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 4474 - 4488

← 1 2 3 4 5 →