Real-time UHD video super-resolution and transcoding on heterogeneous hardware

被引:3
|
作者
Dong, Yu [1 ]
Song, Li [1 ,2 ]
Xie, Rong [1 ]
Zhang, Wenjun [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE, Key Lab Artificial Intelligence, Shanghai, Peoples R China
关键词
UHD video; Super-resolution; Real-time; GPU; IMAGE;
D O I
10.1007/s11554-019-00913-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Videos have become the major type of data produced and consumed every day. With screens grow larger, ultra high definition (UHD) videos are becoming more popular since they provide better visual experience. However, video contents with UHD resolution are still scarce. High-performance video super-resolution (SR) techniques that can obtain high resolution (HR) videos from low resolution (LR) sources are recently used in UHD video production. Deep learning (DL)-based SR methods can provide HR videos with appreciable objective and subjective qualities, while their massive computational complexity makes the processing speed far slower than real-time even on GPU servers when producing UHD videos. Moreover, transcoding and other video processing algorithms executed during the enhancement are also time and resource consuming, which performs relatively slow on ordinary CPU and GPU servers. Nowadays, hardware including GPU, field-programmable gate array (FPGA) and application specific integrated circuit (ASIC) are proved to have outstanding capability on image and video processing tasks in different aspects, and there are also dedicated hardware accelerators meant for specific video processing tasks. In this paper, we focus on accelerating a UHD video enhancement workflow on heterogeneous system with multiple hardware accelerators. First, we optimize the most time consuming task, video SR, with CUDNN and CUDA libraries to achieve real-time processing speed for a single UHD output frame on an ordinary GPU. Second, we design a GPU-friendly multi-thread scheduling algorithm for data and computation to better utilize GPU resources and achieve real-time performance on outputting UHD video clips. Third, targeting on production environment, we build a UHD video enhancement application on selected heterogeneous hardware, with an integrated command line tool of our proposed algorithm, and achieve 60 fps real-time end to end processing speed. Experiments show high efficiency, robustness and compatibility of our approach.
引用
收藏
页码:2029 / 2045
页数:17
相关论文
共 50 条
  • [41] Exploring real-time super-resolution generative adversarial networks
    Hu, Xiaoyan
    Wang, Zechen
    Liu, Xiangjun
    Li, Xinran
    Cheng, Guang
    Gong, Jian
    [J]. INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2021, 36 (02) : 85 - 96
  • [42] Neural foveated super-resolution for real-time VR rendering
    Ye, Jiannan
    Meng, Xiaoxu
    Guo, Daiyun
    Shang, Cheng
    Mao, Haotian
    Yang, Xubo
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (04)
  • [43] Real-time Document Image Super-Resolution by Fast Matting
    Zheng, Yun
    Kang, Xudong
    Li, Shutao
    He, Yuan
    Sun, Jun
    [J]. 2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 232 - 236
  • [44] Real-time Super-resolution Sound Source Localization for Robots
    Nakamura, Keisuke
    Nakadai, Kazuhiro
    Ince, Goekhan
    [J]. 2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 694 - 699
  • [45] RTSRGAN: Real-Time Super-Resolution Generative Adversarial Networks
    Hu, Xiaoyan
    Liu, Xiangjun
    Wang, Zechen
    Li, Xinran
    Peng, Wenqiang
    Cheng, Guang
    [J]. 2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 321 - 326
  • [46] Deep learning approaches for real-time image super-resolution
    Pourya Shamsolmoali
    M. Emre Celebi
    Ruili Wang
    [J]. Neural Computing and Applications, 2020, 32 : 14519 - 14520
  • [47] Direct observation and classification of heterogeneous protein aggregation in real-time through super-resolution and aggregational fingerprinting
    Bender, Steen
    Kaestel-Hansen, Jacob
    Zhang, Min
    Hatzakis, Nikos S.
    [J]. BIOPHYSICAL JOURNAL, 2023, 122 (03) : 16A - 16A
  • [48] Practical considerations for real-time super-resolution implementation. techniques over video coding platforms
    Callicó, GM
    López, S
    Llopis, RP
    Sethuraman, R
    Núñez, A
    López, JF
    Marrero, M
    Sarmiento, R
    [J]. VLSI CIRCUITS AND SYSTEMS II, PTS 1 AND 2, 2005, 5837 : 613 - 627
  • [49] Real-time video super-resolution using lightweight depthwise separable group convolutions with channel shuffling *
    Xiao, Zhijiao
    Zhang, Zhikai
    Hung, Kwok-Wai
    Lui, Simon
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 75
  • [50] Real-time video super-resolution using lightweight depthwise separable group convolutions with channel shuffling
    Xiao, Zhijiao
    Zhang, Zhikai
    Hung, Kwok-Wai
    Lui, Simon
    [J]. Journal of Visual Communication and Image Representation, 2021, 75