STIFS: Spatio-Temporal Input Frame Selection for Learning-based Video Super-Resolution Models

被引:0
|
作者
Baniya, Arbind Agrahari [1 ]
Lee, Tsz-Kwan [1 ]
Eklund, Peter W. [1 ]
Aryal, Sunil [1 ]
机构
[1] Deakin Univ, Sch IT, Geelong, Vic, Australia
关键词
High Definition Video; Image Analysis; Image Quality; Video Signal Processing; Super-resolution;
D O I
10.5220/0011339900003289
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep learning Video Super-Resolution (VSR) methods rely on learning spatio-temporal correlations between a target frame and its neighbouring frames in a given temporal radius to generate a high-resolution output. Among recent VSR models, a sliding window mechanism is popularly adopted by picking a fixed number of consecutive frames as neighbouring frames for a given target frame. This results in a single frame being used multiple times in the input space during the super-resolution process. Moreover, the approach of adopting the fixed consecutive frames directly does not allow deep learning models to learn the full extent of spatio-temporal inter-dependencies between a target frame and its neighbours along a video sequence. To mitigate these issues, this paper proposes a Spatio-Temporal Input Frame Selection (STIFS) algorithm based on image analysis to adaptively select the neighbouring frame(s) based on the spatio-temporal context dynamics with respect to the target frame. STIFS is first-ever dynamic selection mechanism proposed for VSR methods. It aims to enable VSR models to better learn spatio-temporal correlations in a given temporal radius and consequently maximise the quality of the high-definition output. The proposed STIFS algorithm achieved remarkable PSNR improvements in the high-resolution output for VSR models on benchmark datasets.
引用
下载
收藏
页码:48 / 58
页数:11
相关论文
共 50 条
  • [1] Cross-Frame Transformer-Based Spatio-Temporal Video Super-Resolution
    Zhang, Wenhui
    Zhou, Mingliang
    Ji, Cheng
    Sui, Xiubao
    Bai, Junqi
    IEEE TRANSACTIONS ON BROADCASTING, 2022, 68 (02) : 359 - 369
  • [2] Video super-resolution based on a spatio-temporal matching network
    Zhu, Xiaobin
    Li, Zhuangzi
    Lou, Jungang
    Shen, Qing
    PATTERN RECOGNITION, 2021, 110
  • [3] Multi-frame spatio-temporal super-resolution
    Gharibi, Zahra
    Faramarzi, Sam
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (08) : 4415 - 4424
  • [4] Multi-frame spatio-temporal super-resolution
    Zahra Gharibi
    Sam Faramarzi
    Signal, Image and Video Processing, 2023, 17 : 4415 - 4424
  • [5] Video super-resolution reconstruction based on correlation learning and spatio-temporal nonlocal similarity
    Liang, Meiyu
    Du, Junping
    Li, Linghui
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (17) : 10241 - 10269
  • [6] Video super-resolution reconstruction based on correlation learning and spatio-temporal nonlocal similarity
    Meiyu Liang
    Junping Du
    Linghui Li
    Multimedia Tools and Applications, 2016, 75 : 10241 - 10269
  • [7] Spatio-Temporal Fusion Network for Video Super-Resolution
    Li, Huabin
    Zhang, Pingjian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [8] Lightweight video super-resolution based on hybrid spatio-temporal convolution
    Xia, Zhenping
    Chen, Hao
    Zhang, Yuning
    Cheng, Cheng
    Hu, Fuyuan
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (16): : 2564 - 2576
  • [9] Super-Resolution Reconstruction for Spatio-Temporal Resolution Enhancement of Video Sequences
    Haseyama, Miki
    Izumi, Daisuke
    Takizawa, Makoto
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (09): : 2355 - 2358
  • [10] Residual Invertible Spatio-Temporal Network for Video Super-Resolution
    Zhu, Xiaobin
    Li, Zhuangzi
    Zhang, Xiao-Yu
    Li, Changsheng
    Liu, Yaqi
    Xue, Ziyu
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5981 - 5988