DSTnet: Deformable Spatio-Temporal Convolutional Residual Network for Video Super-Resolution

被引：2

作者：

Khan, Anusha ^{[1
]}

Sargano, Allah Bux ^{[1
]}

Habib, Zulfiqar ^{[1
]}

机构：

[1] COMSATS Univ Islamabad, Dept Comp Sci, Lahore 54000, Pakistan

来源：

MATHEMATICS | 2021年 / 9卷 / 22期

基金：

欧盟地平线“2020”;

关键词：

video super-resolution; deformable convolution; 3D convolution; spatio-temporal; residual neural network; deep learning; IMAGE SUPERRESOLUTION; ENHANCEMENT;

D O I：

10.3390/math9222873

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Video super-resolution (VSR) aims at generating high-resolution (HR) video frames with plausible and temporally consistent details using their low-resolution (LR) counterparts, and neighboring frames. The key challenge for VSR lies in the effective exploitation of intra-frame spatial relation and temporal dependency between consecutive frames. Many existing techniques utilize spatial and temporal information separately and compensate motion via alignment. These methods cannot fully exploit the spatio-temporal information that significantly affects the quality of resultant HR videos. In this work, a novel deformable spatio-temporal convolutional residual network (DSTnet) is proposed to overcome the issues of separate motion estimation and compensation methods for VSR. The proposed framework consists of 3D convolutional residual blocks decomposed into spatial and temporal (2+1) D streams. This decomposition can simultaneously utilize input video's spatial and temporal features without a separate motion estimation and compensation module. Furthermore, the deformable convolution layers have been used in the proposed model that enhances its motion-awareness capability. Our contribution is twofold; firstly, the proposed approach can overcome the challenges in modeling complex motions by efficiently using spatio-temporal information. Secondly, the proposed model has fewer parameters to learn than state-of-the-art methods, making it a computationally lean and efficient framework for VSR. Experiments are conducted on a benchmark Vid4 dataset to evaluate the efficacy of the proposed approach. The results demonstrate that the proposed approach achieves superior quantitative and qualitative performance compared to the state-of-the-art methods.

引用

页数：15

共 50 条

[1] Residual Invertible Spatio-Temporal Network for Video Super-Resolution
Zhu, Xiaobin
Li, Zhuangzi
Zhang, Xiao-Yu
Li, Changsheng
Liu, Yaqi
Xue, Ziyu
[J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5981 - 5988
[2] Fast Spatio-Temporal Residual Network for Video Super-Resolution
Li, Sheng
He, Fengxiang
Du, Bo
Zhang, Lefei
Xu, Yonghao
Tao, Dacheng
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10514 - 10523
[3] Spatio-Temporal Fusion Network for Video Super-Resolution
Li, Huabin
Zhang, Pingjian
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[4] Video super-resolution based on a spatio-temporal matching network
Zhu, Xiaobin
Li, Zhuangzi
Lou, Jungang
Shen, Qing
[J]. PATTERN RECOGNITION, 2021, 110
[5] Grouped Spatio-Temporal Alignment Network for Video Super-Resolution
Lu, Mingxuan
Zhang, Peng
[J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2193 - 2197
[6] Video Super-Resolution via a Spatio-Temporal Alignment Network
Wen, Weilei
Ren, Wenqi
Shi, Yinghuan
Nie, Yunfeng
Zhang, Jingang
Cao, Xiaochun
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1761 - 1773
[7] Video super-resolution reconstruction based on deep convolutional neural network and spatio-temporal similarity
Li Linghui
Du Junping
Liang Meiyu
Ren Nan
Fan Dan
[J]. The Journal of China Universities of Posts and Telecommunications, 2016, 23 (05) : 68 - 81
[8] Video super-resolution reconstruction based on deep convolutional neural network and spatio-temporal similarity
Li Linghui
Du Junping
Liang Meiyu
Ren Nan
Fan Dan
[J]. The Journal of China Universities of Posts and Telecommunications, 2016, (05) : 68 - 81
[9] Deformable and residual convolutional network for image super-resolution
Zhang, Yan
Sun, Yemei
Liu, Shudong
[J]. APPLIED INTELLIGENCE, 2022, 52 (01) : 295 - 304
[10] Deformable and residual convolutional network for image super-resolution
Yan Zhang
Yemei Sun
Shudong Liu
[J]. Applied Intelligence, 2022, 52 : 295 - 304

← 1 2 3 4 5 →