STIFS: Spatio-Temporal Input Frame Selection for Learning-based Video Super-Resolution Models

被引:0
|
作者
Baniya, Arbind Agrahari [1 ]
Lee, Tsz-Kwan [1 ]
Eklund, Peter W. [1 ]
Aryal, Sunil [1 ]
机构
[1] Deakin Univ, Sch IT, Geelong, Vic, Australia
关键词
High Definition Video; Image Analysis; Image Quality; Video Signal Processing; Super-resolution;
D O I
10.5220/0011339900003289
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep learning Video Super-Resolution (VSR) methods rely on learning spatio-temporal correlations between a target frame and its neighbouring frames in a given temporal radius to generate a high-resolution output. Among recent VSR models, a sliding window mechanism is popularly adopted by picking a fixed number of consecutive frames as neighbouring frames for a given target frame. This results in a single frame being used multiple times in the input space during the super-resolution process. Moreover, the approach of adopting the fixed consecutive frames directly does not allow deep learning models to learn the full extent of spatio-temporal inter-dependencies between a target frame and its neighbours along a video sequence. To mitigate these issues, this paper proposes a Spatio-Temporal Input Frame Selection (STIFS) algorithm based on image analysis to adaptively select the neighbouring frame(s) based on the spatio-temporal context dynamics with respect to the target frame. STIFS is first-ever dynamic selection mechanism proposed for VSR methods. It aims to enable VSR models to better learn spatio-temporal correlations in a given temporal radius and consequently maximise the quality of the high-definition output. The proposed STIFS algorithm achieved remarkable PSNR improvements in the high-resolution output for VSR models on benchmark datasets.
引用
下载
收藏
页码:48 / 58
页数:11
相关论文
共 50 条
  • [21] Sparse Spatio-Temporal Representation with Adaptive Regularized Dictionaries for Super-Resolution Based Video Coding
    Pan, Zhiming
    Xiong, Hongkai
    2012 DATA COMPRESSION CONFERENCE (DCC), 2012, : 139 - 148
  • [22] Video super-resolution reconstruction based on deep convolutional neural network and spatio-temporal similarity
    Li Linghui
    Du Junping
    Liang Meiyu
    Ren Nan
    Fan Dan
    The Journal of China Universities of Posts and Telecommunications, 2016, 23 (05) : 68 - 81
  • [23] Video super-resolution reconstruction based on deep convolutional neural network and spatio-temporal similarity
    Li Linghui
    Du Junping
    Liang Meiyu
    Ren Nan
    Fan Dan
    The Journal of China Universities of Posts and Telecommunications, 2016, (05) : 68 - 81
  • [24] Patch-based spatio-temporal super-resolution for video with non-rigid motion
    Salvador, Jordi
    Kochale, Axel
    Schweidler, Siegfried
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (05) : 483 - 493
  • [25] Spatio-temporal Super-Resolution Using Depth Map
    Awatsu, Yusaku
    Kawai, Norihiko
    Sato, Tomokazu
    Yokoya, Naokazu
    IMAGE ANALYSIS, PROCEEDINGS, 2009, 5575 : 696 - 705
  • [26] A super-resolution method for spatio-temporal plasma diagnostics
    Teliban, I
    Block, D
    Piel, A
    Naulin, V
    PLASMA PHYSICS AND CONTROLLED FUSION, 2006, 48 (03) : 419 - 431
  • [27] Super-resolution reconstruction using spatio-temporal filtering
    Goldberg, N
    Feuer, A
    Goodwin, GC
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2003, 14 (04) : 508 - 525
  • [28] Deep learning method for super-resolution reconstruction of the spatio-temporal flow field
    Bao, Kairui
    Zhang, Xiaoya
    Peng, Wei
    Yao, Wen
    ADVANCES IN AERODYNAMICS, 2023, 5 (01)
  • [29] Deep learning method for super-resolution reconstruction of the spatio-temporal flow field
    Kairui Bao
    Xiaoya Zhang
    Wei Peng
    Wen Yao
    Advances in Aerodynamics, 5
  • [30] Spatio-Temporal Super-Resolution from Compressed Video Employing Global and Local Motion
    Chen, Yue-Meng
    Bajic, Ivan V.
    2011 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2011, : 907 - 912