RVSRT: Real-time Video Super Resolution Transformer

被引:0
|
作者
Ou, Linlin [1 ,2 ]
Chen, Yuanping [2 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
Video super resolution; vision transformer; deep learning;
D O I
10.1117/12.2680156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution is the task of converting low-resolution video to high-resolution video. Existing methods with better intuitive effects are mainly based on convolutional neural networks (CNNs), but the architecture is heavy, resulting in a slow inference structure. Aiming at this problem, this paper proposes a real-time video super-resolution Transformer (RVSRT) can quickly complete the super-resolution task while considering the visual fluency of video frame switching. Unlike traditional methods based on CNNs, this paper does not process video frames separately with different network modules in the temporal domain, but batches adjacent frames through a single UNet-style structure end-to-end Transformer network architecture. Moreover, this paper creatively sets up two-stage interpolation sampling before and after the end-to-end network to maximize the performance of the traditional CV algorithm. The experimental results show that compared with SOTA TMNet [1], RVSRT has only 20% of the network size (2.3M vs 12.3M, parameters) while ensuring comparable performance, and the speed is increased by 80% (26.2 fps vs 14.3 fps, frame size is 720*576).
引用
收藏
页数:5
相关论文
共 50 条
  • [1] RSTT: Real-time Spatial Temporal Transformer for Space-Time Video Super-Resolution
    Geng, Zhicheng
    Liang, Luming
    Ding, Tianyu
    Zharkov, Ilya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17420 - 17430
  • [2] Real-time Super Resolution Equipment for 8K Video
    Gohshi, Seiichi
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS (SIGMAP), 2014, : 149 - 156
  • [3] Super resolution processing for real-time video quality enhancement system
    Suzuki, Morimasa
    Journal of the Institute of Image Electronics Engineers of Japan, 2015, 44 (04) : 704 - 711
  • [4] Real-time super-resolution over raw video sequences
    Barreto, D
    Callicó, GM
    López, S
    García, L
    Núñez, A
    VLSI CIRCUITS AND SYSTEMS II, PTS 1 AND 2, 2005, 5837 : 628 - 637
  • [5] ASRSR: Adaptive Sending Resolution and Super-resolution for Real-time Video Streaming
    Wu, Ruoyu
    Bao, Wei
    Ge, Liming
    Zhou, Bing Bing
    PROCEEDINGS OF THE 19TH ACM INTERNATIONAL SYMPOSIUM ON QOS AND SECURITY FOR WIRELESS AND MOBILE NETWORKS, Q2SWINET 2023, 2023, : 61 - 68
  • [6] Robust Real-Time Super-Resolution on FPGA and an Application to Video Enhancement
    Angelopoulou, Maria E.
    Bouganis, Christos-Savvas
    Cheung, Peter Y. K.
    Constantinides, George A.
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2009, 2 (04)
  • [7] Real-time UHD video super-resolution and transcoding on heterogeneous hardware
    Dong, Yu
    Song, Li
    Xie, Rong
    Zhang, Wenjun
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) : 2029 - 2045
  • [8] Real-time UHD video super-resolution and transcoding on heterogeneous hardware
    Yu Dong
    Li Song
    Rong Xie
    Wenjun Zhang
    Journal of Real-Time Image Processing, 2020, 17 : 2029 - 2045
  • [9] A Novel Real-Time DSP-Based Video Super-Resolution System
    Lopez, Sebastian
    Callico, Gustavo M.
    Tobajas, Felix
    Lopez, Jose F.
    Sarmiento, Roberto
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (04) : 2264 - 2270
  • [10] Real-time Dictionary based Super-Resolution of Surveillance Video Streams and Targets
    Hospedales, Timothy M.
    Gong, Shaogang
    OPTICS AND PHOTONICS FOR COUNTERTERRORISM, CRIME FIGHTING, AND DEFENCE VIII, 2012, 8546