A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU

被引:0
|
作者
Zha, Wenqian [1 ]
Sun, Qi [1 ]
Bai, Yang [1 ]
Li, Wenbo [1 ]
Zheng, Haisheng [2 ]
Yu, Bei [1 ]
Wong, Martin D. F. [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] SmartMore, Hong Kong, Peoples R China
关键词
ACCURATE;
D O I
10.1109/ICCAD51958.2021.9643472
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent years have witnessed impressive progress in super-resolution (SR) processing. However, its real-time inference requirement sets a challenge not only for the model design but also for the on-chip implementation. In this paper, we implement a full-stack SR acceleration framework on embedded GPU devices. The special dictionary learning algorithm used in SR models was analyzed in detail and accelerated via a novel dictionary selective strategy. Besides, the hardware programming architecture together with the model structure is analyzed to guide the optimal design of computation kernels to minimize the inference latency under the resource constraints. With these novel techniques, the communication and computation bottlenecks in the deep dictionary learning-based SR models are tackled perfectly. The experiments on the edge embedded NVIDIA NX and 2080Ti show that our method outperforms the state-of-the-art NVIDIA TensorRT significantly and can achieve real-time performance.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] A Data-Centric Accelerator for High-Performance Hypergraph Processing
    Wang, Qinggang
    Zheng, Long
    Hu, Ao
    Huang, Yu
    Yao, Pengcheng
    Gui, Chuangyi
    Liao, Xiaofei
    Tin, Hai
    Xue, Jingling
    2022 55TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2022, : 1326 - 1341
  • [32] Subjective Assessment of Super-Resolution -High-Resolution Effect of Nonlinear Signal Processing-
    Mori, Chinatsu
    Sugie, Masaki
    Takeshita, Hirohisa
    Gohshi, Seiichi
    2015 10th Asia-Pacific Symposium on Information and Telecommunication Technologies (APSITT), 2015,
  • [33] Super-Resolution Imaging by Arrays of High-Index Spheres Embedded in Transparent Matrices
    Allen, Kenneth W.
    Farahi, Navid
    Li, Yangcheng
    Limberopoulos, Nicholaos I.
    Walker, Dennis E., Jr.
    Urbas, Augustine M.
    Astratov, Vasily N.
    IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON 2014), 2014, : 50 - 52
  • [34] GPU-Accelerated Light-field Image Super-resolution
    Trung-Hieu Tran
    Mammadov, Gasim
    Sun, Kaicong
    Simon, Sven
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND APPLICATIONS (ACOMP), 2018, : 7 - 13
  • [35] Block Matching Super-Resolution Parallel GPU Implementation for Computational Imaging
    Marenzi, E.
    Torti, E.
    Leporati, F.
    Quevedo, E.
    Callico, G. M.
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2017, 63 (04) : 368 - 376
  • [36] Parallel Implementation of Super-Resolution Based Neighbor Embedding Using GPU
    Moustafa, Marwa
    Ebeid, Hala M.
    Helmy, Ashraf
    Nazamy, Taymoor M.
    Tolba, Mohamed F.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 628 - 638
  • [37] Processing a sequence of images for super-resolution problem solving
    A. S. Machikhin
    Optics and Spectroscopy, 2007, 103 : 839 - 842
  • [38] Super-Resolution Raman Spectroscopy by Digital Image Processing
    Tomita, Motohiro
    Hashiguchi, Hiroki
    Yamaguchi, Takuya
    Takei, Munehisa
    Kosemura, Daisuke
    Ogura, Atsushi
    JOURNAL OF SPECTROSCOPY, 2013, 2013
  • [39] Curvelet Transform Based Super-Resolution Image Processing
    Thuong Le-Tien
    Luc Nguyen-Tan
    Giang Le-Thach
    Cao Bui-Thu
    2012 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2012, : 170 - 175
  • [40] FACE IMAGE PROCESSING BY TV FILTER AND SUPER-RESOLUTION
    Goto, Tomio
    Hirano, Satoshi
    Sakurai, Masaru
    2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 245 - 248