RVSRT: Real-time Video Super Resolution Transformer

被引:0
|
作者
Ou, Linlin [1 ,2 ]
Chen, Yuanping [2 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
Video super resolution; vision transformer; deep learning;
D O I
10.1117/12.2680156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution is the task of converting low-resolution video to high-resolution video. Existing methods with better intuitive effects are mainly based on convolutional neural networks (CNNs), but the architecture is heavy, resulting in a slow inference structure. Aiming at this problem, this paper proposes a real-time video super-resolution Transformer (RVSRT) can quickly complete the super-resolution task while considering the visual fluency of video frame switching. Unlike traditional methods based on CNNs, this paper does not process video frames separately with different network modules in the temporal domain, but batches adjacent frames through a single UNet-style structure end-to-end Transformer network architecture. Moreover, this paper creatively sets up two-stage interpolation sampling before and after the end-to-end network to maximize the performance of the traditional CV algorithm. The experimental results show that compared with SOTA TMNet [1], RVSRT has only 20% of the network size (2.3M vs 12.3M, parameters) while ensuring comparable performance, and the speed is increased by 80% (26.2 fps vs 14.3 fps, frame size is 720*576).
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Real-time Super Resolution Algorithm for Security Cameras
    Gohshi, Seiichi
    2015 12TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS (ICETE), VOL 5, 2015, : 92 - 97
  • [22] REAL-TIME IMAGE SUPER RESOLUTION USING AN FPGA
    Bowen, Oliver
    Bouganis, Christos-Savvas
    2008 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE AND LOGIC APPLICATIONS, VOLS 1 AND 2, 2008, : 89 - 94
  • [23] Real-EVE: Real-Time Edge-Assist Video Enhancement for Joint Denoising and Super-Resolution
    Ge, Liming
    Bao, Wei
    Yuan, Dong
    Zhou, Bing Bing
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT I, 2024, 14487 : 320 - 339
  • [24] Cost-Optimized Video Transfer using Real-Time Super Resolution Convolutional Neural Networks
    Lodha, Ishaan
    Kolur, Lakshana
    Krishnan, Keertan
    Dheenadayalan, Kumar
    Sitaram, Dinkar
    Nandi, Siddhartha
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 213 - 221
  • [25] Real-Time Non-Rigid Multi-Frame Depth Video Super-Resolution
    Al Ismaeil, Kassem
    Aouada, Djamila
    Solignac, Thomas
    Mirbach, Bruno
    Ottersten, Bjorn
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [26] Edge-Assisted Deep Video Denoising and Super-Resolution for Real-Time Surveillance at Night
    Ge, Liming
    Bao, Wei
    Yuan, Dong
    Zhou, Bing B.
    PROCEEDINGS OF THE 2022 THE 28TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, ACM MOBICOM 2022, 2022, : 783 - 785
  • [27] Mapping of real-time and low-cost-super-resolution algorithms onto a hybrid video encoder
    Callicó, GM
    Llopis, RP
    Núñez, A
    Sethuraman, R
    VLSI CIRCUITS AND SYSTEMS, 2003, 5117 : 42 - 52
  • [28] An FPGA-Based Residual Recurrent Neural Network for Real-Time Video Super-Resolution
    Sun, Kaicong
    Koch, Maurice
    Wang, Zhe
    Jovanovic, Slavisa
    Rabah, Hassan
    Simon, Sven
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 1739 - 1750
  • [29] Real-time video super resolution network using recurrent multi-branch dilated convolutions
    Zeng, Yubin
    Xiao, Zhijiao
    Hung, Kwok-Wai
    Lui, Simon
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 93
  • [30] Real-Time Super-Resolution System of 4K-Video Based on Deep Learning
    Cao, Yanpeng
    Wang, Chengcheng
    Song, Changjun
    Tang, Yongming
    Li, He
    2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 69 - 76