A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU

被引:0
|
作者
Zha, Wenqian [1 ]
Sun, Qi [1 ]
Bai, Yang [1 ]
Li, Wenbo [1 ]
Zheng, Haisheng [2 ]
Yu, Bei [1 ]
Wong, Martin D. F. [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] SmartMore, Hong Kong, Peoples R China
关键词
ACCURATE;
D O I
10.1109/ICCAD51958.2021.9643472
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent years have witnessed impressive progress in super-resolution (SR) processing. However, its real-time inference requirement sets a challenge not only for the model design but also for the on-chip implementation. In this paper, we implement a full-stack SR acceleration framework on embedded GPU devices. The special dictionary learning algorithm used in SR models was analyzed in detail and accelerated via a novel dictionary selective strategy. Besides, the hardware programming architecture together with the model structure is analyzed to guide the optimal design of computation kernels to minimize the inference latency under the resource constraints. With these novel techniques, the communication and computation bottlenecks in the deep dictionary learning-based SR models are tackled perfectly. The experiments on the edge embedded NVIDIA NX and 2080Ti show that our method outperforms the state-of-the-art NVIDIA TensorRT significantly and can achieve real-time performance.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A High-Performance Accelerator for Real-Time Super-Resolution on Edge FPGAs
    Liu, Hongduo
    Qian, Yijian
    Liang, Youqiang
    Zhang, Bin
    Liu, Zhaohan
    He, Tao
    Zhao, Wenqian
    Lu, Jiangbo
    Yu, Bei
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2024, 29 (03)
  • [2] High-performance quantization for spectral super-resolution
    Gunturk, C. Sinan
    Li, Weilin
    2019 13TH INTERNATIONAL CONFERENCE ON SAMPLING THEORY AND APPLICATIONS (SAMPTA), 2019,
  • [3] NanoJ: a high-performance open-source super-resolution microscopy toolbox
    Laine, Romain F.
    Tosheva, Kalina L.
    Gustafsson, Nils
    Gray, Robert D. M.
    Almada, Pedro
    Albrecht, David
    Risa, Gabriel T.
    Hurtig, Fredrik
    Lindas, Ann-Christin
    Baum, Buzz
    Mercer, Jason
    Leterrier, Christophe
    Pereira, Pedro M.
    Culley, Sian
    Henriques, Ricardo
    JOURNAL OF PHYSICS D-APPLIED PHYSICS, 2019, 52 (16)
  • [4] GPU ACCELERATED HIGH-QUALITY VIDEO/IMAGE SUPER-RESOLUTION
    Zhao, Zhangzong
    Song, Li
    Xie, Rong
    Yang, Xiaokang
    2016 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2016,
  • [5] Embedded GPU Implementation for High-Performance Ultrasound Imaging
    Rossi, Stefano
    Boni, Enrico
    ELECTRONICS, 2021, 10 (08)
  • [6] High-performance medical image secret sharing using super-resolution for CAD systems
    M. Raviraja Holla
    Alwyn R. Pais
    Applied Intelligence, 2022, 52 : 16852 - 16868
  • [7] High-performance medical image secret sharing using super-resolution for CAD systems
    Holla, M. Raviraja
    Pais, Alwyn R.
    APPLIED INTELLIGENCE, 2022, 52 (14) : 16852 - 16868
  • [8] Implementation of Java Accelerator for High-Performance Embedded Systems
    Kimura, Motoki
    Miki, Morgan Hirosuke
    Onoye, Takao
    Shirakawa, Isao
    IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 2003, E86-A (12) : 3079 - 3088
  • [9] Variational Bayesian Image Super-Resolution with GPU Acceleration
    Chantas, Giannis
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT I, 2010, 6352 : 480 - 489
  • [10] A GPU-BASED IMPLEMENTATION ON SUPER-RESOLUTION RECONSTRUCTION
    Wang, Kai
    Wang, Lifu
    Lu, Jian
    Sun, Yi
    Zhao, Shuping
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 849 - 852