Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

被引:468
|
作者
Caballero, Jose [1 ]
Ledig, Christian [1 ]
Aitken, Andrew [1 ]
Acosta, Alejandro [1 ]
Totz, Johannes [1 ]
Wang, Zehan [1 ]
Shi, Wenzhe [1 ]
机构
[1] Twitter, San Francisco, CA 94103 USA
关键词
IMAGE SUPERRESOLUTION; QUALITY ASSESSMENT;
D O I
10.1109/CVPR.2017.304
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while maintaining real-time speed. Specifically, we discuss the use of early fusion, slow fusion and 3D convolutions for the joint processing of multiple consecutive video frames. We also propose a novel joint motion compensation and video super-resolution algorithm that is orders of magnitude more efficient than competing methods, relying on a fast multi-resolution spatial transformer module that is end-to-end trainable. These contributions provide both higher accuracy and temporally more consistent videos, which we confirm qualitatively and quantitatively. Relative to single-frame models, spatio-temporal networks can either reduce the computational cost by 30% whilst maintaining the same quality or provide a 0.2dB gain for a similar computational cost. Results on publicly available datasets demonstrate that the proposed algorithms surpass current state-of-the-art performance in both accuracy and efficiency.
引用
收藏
页码:2848 / 2857
页数:10
相关论文
共 50 条
  • [1] Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference
    Wang, Wenhao
    Liu, Zhenbing
    Lu, Haoxiang
    Lan, Rushi
    Zhang, Zhaoyuan
    [J]. SENSORS, 2023, 23 (18)
  • [2] Spatio-Temporal Fusion Network for Video Super-Resolution
    Li, Huabin
    Zhang, Pingjian
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] Spatio-Temporal Super-Resolution from Compressed Video Employing Global and Local Motion
    Chen, Yue-Meng
    Bajic, Ivan V.
    [J]. 2011 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2011, : 907 - 912
  • [4] Super-Resolution Reconstruction for Spatio-Temporal Resolution Enhancement of Video Sequences
    Haseyama, Miki
    Izumi, Daisuke
    Takizawa, Makoto
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (09): : 2355 - 2358
  • [5] Residual Invertible Spatio-Temporal Network for Video Super-Resolution
    Zhu, Xiaobin
    Li, Zhuangzi
    Zhang, Xiao-Yu
    Li, Changsheng
    Liu, Yaqi
    Xue, Ziyu
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5981 - 5988
  • [6] Video super-resolution based on a spatio-temporal matching network
    Zhu, Xiaobin
    Li, Zhuangzi
    Lou, Jungang
    Shen, Qing
    [J]. PATTERN RECOGNITION, 2021, 110
  • [7] Grouped Spatio-Temporal Alignment Network for Video Super-Resolution
    Lu, Mingxuan
    Zhang, Peng
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2193 - 2197
  • [8] Video Super-Resolution via a Spatio-Temporal Alignment Network
    Wen, Weilei
    Ren, Wenqi
    Shi, Yinghuan
    Nie, Yunfeng
    Zhang, Jingang
    Cao, Xiaochun
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1761 - 1773
  • [9] Fast Spatio-Temporal Residual Network for Video Super-Resolution
    Li, Sheng
    He, Fengxiang
    Du, Bo
    Zhang, Lefei
    Xu, Yonghao
    Tao, Dacheng
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10514 - 10523
  • [10] Patch-based spatio-temporal super-resolution for video with non-rigid motion
    Salvador, Jordi
    Kochale, Axel
    Schweidler, Siegfried
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (05) : 483 - 493