RVSRT: Real-time Video Super Resolution Transformer

被引:0
|
作者
Ou, Linlin [1 ,2 ]
Chen, Yuanping [2 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
Video super resolution; vision transformer; deep learning;
D O I
10.1117/12.2680156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution is the task of converting low-resolution video to high-resolution video. Existing methods with better intuitive effects are mainly based on convolutional neural networks (CNNs), but the architecture is heavy, resulting in a slow inference structure. Aiming at this problem, this paper proposes a real-time video super-resolution Transformer (RVSRT) can quickly complete the super-resolution task while considering the visual fluency of video frame switching. Unlike traditional methods based on CNNs, this paper does not process video frames separately with different network modules in the temporal domain, but batches adjacent frames through a single UNet-style structure end-to-end Transformer network architecture. Moreover, this paper creatively sets up two-stage interpolation sampling before and after the end-to-end network to maximize the performance of the traditional CV algorithm. The experimental results show that compared with SOTA TMNet [1], RVSRT has only 20% of the network size (2.3M vs 12.3M, parameters) while ensuring comparable performance, and the speed is increased by 80% (26.2 fps vs 14.3 fps, frame size is 720*576).
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Real-time video super-resolution using lightweight depthwise separable group convolutions with channel shuffling
    Xiao, Zhijiao
    Zhang, Zhikai
    Hung, Kwok-Wai
    Lui, Simon
    Journal of Visual Communication and Image Representation, 2021, 75
  • [42] Real-time video super-resolution using lightweight depthwise separable group convolutions with channel shuffling *
    Xiao, Zhijiao
    Zhang, Zhikai
    Hung, Kwok-Wai
    Lui, Simon
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 75
  • [43] Real-time video mosaicing with a high-resolution microendoscope
    Bedard, Noah
    Quang, Timothy
    Schmeler, Kathleen
    Richards-Kortum, Rebecca
    Tkaczyk, Tomasz S.
    BIOMEDICAL OPTICS EXPRESS, 2012, 3 (10): : 2428 - 2435
  • [44] Real time turbulent video super-resolution using MPEG 4
    Fishbain, Barak
    Yaroslavsky, Leonid P.
    Ideses, Lanir A.
    REAL-TIME IMAGE PROCESSING 2008, 2008, 6811
  • [45] Real time turbulent video perfecting by image stabilization and super-resolution
    Fishbain, Barak
    Yaroslavsky, Leonid P.
    Ideses, Ianir A.
    PROCEEDINGS OF THE SEVENTH IASTED INTERNATIONAL CONFERENCE ON VISUALIZATION, IMAGING, AND IMAGE PROCESSING, 2007, : 213 - +
  • [46] Perceptual Losses for Real-Time Style Transfer and Super-Resolution
    Johnson, Justin
    Alahi, Alexandre
    Li Fei-Fei
    COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 694 - 711
  • [47] Real-time Super-resolution Imaging Using a Single Sensor
    Li, Lianlin
    Ruan, Henxin
    Li, Fang
    Cui, Tiejun
    2015 1st URSI Atlantic Radio Science Conference (URSI AT-RASC), 2015,
  • [48] Super-Resolution Simulation for Real-Time Prediction of Urban Micrometeorology
    Onishi, Ryo
    Sugiyama, Daisuke
    Matsuda, Keigo
    SOLA, 2019, 15 : 178 - 182
  • [49] Exploring real-time super-resolution generative adversarial networks
    Hu, Xiaoyan
    Wang, Zechen
    Liu, Xiangjun
    Li, Xinran
    Cheng, Guang
    Gong, Jian
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2021, 36 (02) : 85 - 96
  • [50] Image quality enhancement based on real-time deconvolution and super resolution
    Marin, Yoan
    Douiyek, Abdelali
    Miteran, Johel
    Dubois, Julien
    Heyrman, Barthelemy
    Ginhac, Dominique
    UNCONVENTIONAL OPTICAL IMAGING, 2018, 10677