Temporal Consistency Learning of Inter-Frames for Video Super-Resolution

被引:20
|
作者
Liu, Meiqin [1 ,2 ]
Jin, Shuo [1 ,2 ]
Yao, Chao [3 ]
Lin, Chunyu [1 ,2 ]
Zhao, Yao [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Adv Informat Sci & Network Technol, Beijing 100044, Peoples R China
[3] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Superresolution; Circuit stability; Optical flow; Image restoration; Image reconstruction; Degradation; Convolution; Bidirectional motion estimation; temporal consistency; self-alignment; video super-resolution; FUSION NETWORK;
D O I
10.1109/TCSVT.2022.3214538
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video super-resolution (VSR) is a task that aims to reconstruct high-resolution (HR) frames from the low-resolution (LR) reference frame and multiple neighboring frames. The vital operation is to utilize the relative misaligned frames for the current frame reconstruction and preserve the consistency of the results. Existing methods generally explore information propagation and frame alignment to improve the performance of VSR. However, few studies focus on the temporal consistency of inter-frames. In this paper, we propose a Temporal Consistency learning Network (TCNet) for VSR in an end-to-end manner, to enhance the consistency of the reconstructed videos. A spatio-temporal stability module is designed to learn the self-alignment from inter-frames. Especially, the correlative matching is employed to exploit the spatial dependency from each frame to maintain structural stability. Moreover, a self-attention mechanism is utilized to learn the temporal correspondence to implement an adaptive warping operation for temporal consistency among multi-frames. Besides, a hybrid recurrent architecture is designed to leverage short-term and long-term information. We further present a progressive fusion module to perform a multistage fusion of spatio-temporal features. And the final reconstructed frames are refined by these fused features. Objective and subjective results of various experiments demonstrate that TCNet has superior performance on different benchmark datasets, compared to several state-of-the-art methods.
引用
收藏
页码:1507 / 1520
页数:14
相关论文
共 50 条
  • [1] ADAPTIVE INCREMENTAL VIDEO SUPER-RESOLUTION WITH TEMPORAL CONSISTENCY
    Su, Heng
    Wu, Ying
    Zhou, Jie
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1149 - 1152
  • [2] Temporal Kernel Consistency for Blind Video Super-Resolution
    Xiang, Lichuan
    Lee, Royson
    Abdelfattah, Mohamed S.
    Lane, Nicholas D.
    Wen, Hongkai
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3470 - 3479
  • [3] VSRDiff: Learning Inter-Frame Temporal Coherence in Diffusion Model for Video Super-Resolution
    Liu, Linlin
    Niu, Lele
    Tang, Jun
    Ding, Yong
    IEEE ACCESS, 2025, 13 : 11447 - 11462
  • [4] VSRDiff: Learning Inter-Frame Temporal Coherence in Diffusion Model for Video Super-Resolution
    Liu, Linlin
    Niu, Lele
    Tang, Jun
    Ding, Yong
    IEEE ACCESS, 2025, 13 : 11447 - 11462
  • [5] Learning Temporal Dynamics for Video Super-Resolution: A Deep Learning Approach
    Liu, Ding
    Wang, Zhaowen
    Fan, Yuchen
    Liu, Xianming
    Wang, Zhangyang
    Chang, Shiyu
    Wang, Xinchao
    Huang, Thomas S.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3432 - 3445
  • [6] Video Coding With Key Frames Guided Super-Resolution
    Zhou, Qiang
    Song, Li
    Zhang, Wenjun
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT II, 2010, 6298 : 309 - 318
  • [7] Learning a spatial-temporal symmetry network for video super-resolution
    Xiaohang Wang
    Mingliang Liu
    Pengying Wei
    Applied Intelligence, 2023, 53 : 3530 - 3544
  • [8] Learning a spatial-temporal symmetry network for video super-resolution
    Wang, Xiaohang
    Liu, Mingliang
    Wei, Pengying
    APPLIED INTELLIGENCE, 2023, 53 (03) : 3530 - 3544
  • [9] Abnormal Video Sections Detection Based on Inter-Frames Information
    Wang, Wei
    Zhang, Peng
    Wang, Runsheng
    THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING (MUE 2009), 2009, : 529 - +
  • [10] SUPER-RESOLUTION OF VIDEO USING KEY FRAMES AND MOTION ESTIMATION
    Brandi, Fernanda
    de Queiroz, Ricardo
    Mukherjee, Debargha
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 321 - 324