An FPGA-Based Residual Recurrent Neural Network for Real-Time Video Super-Resolution

被引:15
|
作者
Sun, Kaicong [1 ]
Koch, Maurice [1 ]
Wang, Zhe [1 ]
Jovanovic, Slavisa [2 ]
Rabah, Hassan [2 ]
Simon, Sven [1 ]
机构
[1] Univ Stuttgart, Inst Parallel & Distributed Syst, D-70569 Stuttgart, Germany
[2] Univ Lorraine, Inst Jean Lamour, F-54000 Nancy, France
关键词
Convolution; Field programmable gate arrays; Real-time systems; Streaming media; UHDTV; Image reconstruction; Superresolution; Video super-resolution; residual recurrent neural network; FPGA; real-time; 4K UHD; hardware-efficient;
D O I
10.1109/TCSVT.2021.3080241
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a hardware-efficient residual recurrent neural network for real-time video super-resolution (VSR) based on field programmable gate array (FPGA). Although recent learning-based VSR methods have achieved remarkable performance, the large computational complexity prohibits the deployment of the sophisticated VSR models on FPGA for real-time applications. Limited by the hardware resources, state-of-the-art FPGA-based VSR methods perform single-image super-resolution over the video sequence and suffer from temporal inconsistency. In order to exploit the inter-frame temporal correlation for real-time VSR on low-complexity hardware, we introduce a hardware-efficient recurrent neural network ERVSR. Specially, the proposed ERVSR leverages the input frame and the temporal information entailed in the hidden state to reconstruct the high-resolution counterpart. To reduce the network parameters, the low-resolution input branch and the hidden state branch are convolved individually and a channel modulation coefficient is proposed to explicitly guide the network to allocate the amount of output feature channels to each branch. Additionally, in order to reduce the memory consumption, we perform a dedicated lightweight compression of the hidden state by introducing a statistical normalization scheme followed by a fixed-point quantization. Besides, we adopt group convolution and depthwise separable convolution to further compact the network. We evaluated the proposed ERVSR on multiple public datasets from different aspects. Experimental results demonstrate that ERVSR performs better than the existing state-of-the-art FPGA-based VSR methods in both image quality and data throughput.
引用
收藏
页码:1739 / 1750
页数:12
相关论文
共 50 条
  • [1] An FPGA-based design for real-time super-resolution reconstruction
    Yoan Marin
    Johel Miteran
    Julien Dubois
    Barthélémy Heyrman
    Dominique Ginhac
    [J]. Journal of Real-Time Image Processing, 2020, 17 : 1769 - 1785
  • [2] An FPGA-based design for real-time super-resolution reconstruction
    Marin, Yoan
    Miteran, Johel
    Dubois, Julien
    Heyrman, Barthelemy
    Ginhac, Dominique
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) : 1769 - 1785
  • [3] FPGA-based real-time super-resolution on an adaptive image sensor
    Angelopoulou, Maria E.
    Bouganis, Christos-Savvas
    Cheung, Peter Y. K.
    Constantinides, George A.
    [J]. RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2008, 4943 : 125 - 136
  • [4] FPGA-based Real-Time Super-Resolution System for Ultra High Definition Videos
    He, Zhuolun
    Huang, Hanxian
    Jiang, Ming
    Bai, Yuanchao
    Luo, Guojie
    [J]. PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018), 2018, : 181 - 188
  • [5] FPGA Implementation of a Real-Time Super-Resolution System Using a Convolutional Neural Network
    Manabe, Taito
    Shibata, Yuichiro
    Oguri, Kiyoshi
    [J]. 2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 249 - 252
  • [6] Robust Real-Time Super-Resolution on FPGA and an Application to Video Enhancement
    Angelopoulou, Maria E.
    Bouganis, Christos-Savvas
    Cheung, Peter Y. K.
    Constantinides, George A.
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2009, 2 (04)
  • [7] A Real-Time Learning-Based Super-Resolution System on FPGA
    Zha, Daolu
    Jin, Xi
    Shang, Rui
    Yang, Pengfei
    [J]. PARALLEL PROCESSING LETTERS, 2020, 30 (04)
  • [8] Winograd-based Real-time Super-Resolution System on FPGA
    Shi, Bizhao
    Tang, Zhucheng
    Luo, Guojie
    Jiang, Ming
    [J]. 2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 423 - 426
  • [9] PhD Forum: An FPGA-based design for real-time Super Resolution Reconstruction
    Marin, Yoan
    Miteran, Johel
    Dubois, Julien
    Heyrman, Barthelemy
    Ginhac, Dominique
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC'18), 2018,
  • [10] A Real-Time Convolutional Neural Network for Super-Resolution on FPGA With Applications to 4K UHD 60 fps Video Services
    Kim, Yongwoo
    Choi, Jae-Seok
    Kim, Munchurl
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2521 - 2534