DSTnet: Deformable Spatio-Temporal Convolutional Residual Network for Video Super-Resolution

被引:2
|
作者
Khan, Anusha [1 ]
Sargano, Allah Bux [1 ]
Habib, Zulfiqar [1 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Lahore 54000, Pakistan
基金
欧盟地平线“2020”;
关键词
video super-resolution; deformable convolution; 3D convolution; spatio-temporal; residual neural network; deep learning; IMAGE SUPERRESOLUTION; ENHANCEMENT;
D O I
10.3390/math9222873
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Video super-resolution (VSR) aims at generating high-resolution (HR) video frames with plausible and temporally consistent details using their low-resolution (LR) counterparts, and neighboring frames. The key challenge for VSR lies in the effective exploitation of intra-frame spatial relation and temporal dependency between consecutive frames. Many existing techniques utilize spatial and temporal information separately and compensate motion via alignment. These methods cannot fully exploit the spatio-temporal information that significantly affects the quality of resultant HR videos. In this work, a novel deformable spatio-temporal convolutional residual network (DSTnet) is proposed to overcome the issues of separate motion estimation and compensation methods for VSR. The proposed framework consists of 3D convolutional residual blocks decomposed into spatial and temporal (2+1) D streams. This decomposition can simultaneously utilize input video's spatial and temporal features without a separate motion estimation and compensation module. Furthermore, the deformable convolution layers have been used in the proposed model that enhances its motion-awareness capability. Our contribution is twofold; firstly, the proposed approach can overcome the challenges in modeling complex motions by efficiently using spatio-temporal information. Secondly, the proposed model has fewer parameters to learn than state-of-the-art methods, making it a computationally lean and efficient framework for VSR. Experiments are conducted on a benchmark Vid4 dataset to evaluate the efficacy of the proposed approach. The results demonstrate that the proposed approach achieves superior quantitative and qualitative performance compared to the state-of-the-art methods.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Residual Invertible Spatio-Temporal Network for Video Super-Resolution
    Zhu, Xiaobin
    Li, Zhuangzi
    Zhang, Xiao-Yu
    Li, Changsheng
    Liu, Yaqi
    Xue, Ziyu
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5981 - 5988
  • [2] Fast Spatio-Temporal Residual Network for Video Super-Resolution
    Li, Sheng
    He, Fengxiang
    Du, Bo
    Zhang, Lefei
    Xu, Yonghao
    Tao, Dacheng
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10514 - 10523
  • [3] Spatio-Temporal Fusion Network for Video Super-Resolution
    Li, Huabin
    Zhang, Pingjian
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Video super-resolution based on a spatio-temporal matching network
    Zhu, Xiaobin
    Li, Zhuangzi
    Lou, Jungang
    Shen, Qing
    [J]. PATTERN RECOGNITION, 2021, 110
  • [5] Grouped Spatio-Temporal Alignment Network for Video Super-Resolution
    Lu, Mingxuan
    Zhang, Peng
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2193 - 2197
  • [6] Video Super-Resolution via a Spatio-Temporal Alignment Network
    Wen, Weilei
    Ren, Wenqi
    Shi, Yinghuan
    Nie, Yunfeng
    Zhang, Jingang
    Cao, Xiaochun
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1761 - 1773
  • [7] Video super-resolution reconstruction based on deep convolutional neural network and spatio-temporal similarity
    Li Linghui
    Du Junping
    Liang Meiyu
    Ren Nan
    Fan Dan
    [J]. The Journal of China Universities of Posts and Telecommunications, 2016, 23 (05) - 81
  • [8] Video super-resolution reconstruction based on deep convolutional neural network and spatio-temporal similarity
    Li Linghui
    Du Junping
    Liang Meiyu
    Ren Nan
    Fan Dan
    [J]. The Journal of China Universities of Posts and Telecommunications, 2016, (05) : 68 - 81
  • [9] Deformable and residual convolutional network for image super-resolution
    Zhang, Yan
    Sun, Yemei
    Liu, Shudong
    [J]. APPLIED INTELLIGENCE, 2022, 52 (01) : 295 - 304
  • [10] Deformable and residual convolutional network for image super-resolution
    Yan Zhang
    Yemei Sun
    Shudong Liu
    [J]. Applied Intelligence, 2022, 52 : 295 - 304