Toward Efficient Co-Design of CNN Quantization and HW Architecture on FPGA Hybrid-Accelerator

被引:0
|
作者
Zhang, Yiran [1 ]
Li, Guiying [1 ]
Yuan, Bo [1 ]
机构
[1] Southern Univ Sci & Technol, Guangdong Prov Key Lab Brain Inspired Intelligent, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
CNN accelerator; FPGA; DSE method;
D O I
10.1109/SEDA62518.2024.10617620
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Field programmable gate array (FPGA) has emerged as a promising platform for accelerating convolutional neural networks (CNNs). In this paper, we propose a low-latency CNN hybrid-accelerator system and an efficient design space exploration (DSE) method. Specifically, our targeted FPGA platform consists of different types of accelerators for two advantages: high concurrency and full hardware utilization (i.e., lookup tables (LUTs) and digital signal processors (DSPs)). Besides, we adopt a bandwidth-aware analytical model for system latency to consider pipeline stalls and computation cycles simultaneously. Furthermore, for the huge design space encompassing layer-wise CNN quantization and FPGA hybrid-accelerator architecture, we propose a DSE method (named DiMEGA) aimed at enhancing search efficiency, which is a differentiable method embedded by a genetic algorithm. The performance of our CNN hybrid-accelerator system is demonstrated on a PYNQ-Z2 FPGA platform. The experimental results show that the system latency can be reduced by 42% similar to 48% without sacrificing accuracy, and the DSE time of DiMEGA is reduced by 23% on ResNet20-CIFAR10, and 63% on ResNet56-CIFAR10, compared with SOTA.
引用
收藏
页码:678 / 683
页数:6
相关论文
共 50 条
  • [21] A VLIW architecture simulator innovative approach for HW-SW co-design
    Barbieri, I
    Bariani, M
    Raggio, M
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1375 - 1378
  • [22] NAX: Neural Architecture and Memristive Xbar based Accelerator Co-design
    Negi, Shubham
    Chakraborty, Indranil
    Ankit, Aayush
    Roy, Kaushik
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 451 - 456
  • [23] A Co-Design Method for Parallel Image Processing Accelerator based on DSP and FPGA
    Wang, Ze
    Weng, Kaijian
    Cheng, Zhao
    Yan, Luxin
    Guan, Jing
    MIPPR 2011: PARALLEL PROCESSING OF IMAGES AND OPTIMIZATION AND MEDICAL IMAGING PROCESSING, 2011, 8005
  • [24] Deep Neural Network Model and FPGA Accelerator Co-Design: Opportunities and Challenges
    Hao, Cong
    Chen, Deming
    2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1413 - 1416
  • [25] Deep CNN Co-design for HEVC CU Partition Prediction on FPGA–SoC
    Soulef Bouaafia
    Randa Khemiri
    Seifeddine Messaoud
    Fatma Ezahra Sayadi
    Neural Processing Letters, 2022, 54 : 3283 - 3301
  • [26] Algorithm and Hardware Co-Design for Efficient PMCMC Acceleration on FPGA
    Wang, Fulong
    Peng, Shiyu
    Gong, Yuanhao
    Li, Minghan
    Liu, Shuanglong
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 330 - 334
  • [27] Hybrid Prototyping Methodology for Rapid System Validation in HW/SW Co-Design
    Wicaksana, Arief
    Charif, Amir
    Andriamisaina, Caaliph
    Ventroux, Nicolas
    2019 CONFERENCE ON DESIGN AND ARCHITECTURES FOR SIGNAL AND IMAGE PROCESSING (DASIP), 2019, : 35 - 40
  • [28] Energy-efficient crypto acceleration with HW/SW co-design for HTTPS
    Xiao, Chunhua
    Zhang, Lei
    Liu, Weichen
    Bergmann, Neil
    Xie, Yuhua
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 : 336 - 347
  • [29] Energy Efficient Data Access and Storage through HW/SW Co-design
    Guo, Minyi
    ACM SIGPLAN NOTICES, 2014, 49 (05) : 83 - 83
  • [30] FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge
    Hao, Cong
    Zhang, Xiaofan
    Li, Yuhong
    Huang, Sitao
    Xiong, Jinjun
    Rupnow, Kyle
    Hwu, Wen-mei
    Chen, Deming
    PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,