Toward Efficient Co-Design of CNN Quantization and HW Architecture on FPGA Hybrid-Accelerator

被引：0

作者：

Zhang, Yiran ^{[1
]}

Li, Guiying ^{[1
]}

Yuan, Bo ^{[1
]}

机构：

[1] Southern Univ Sci & Technol, Guangdong Prov Key Lab Brain Inspired Intelligent, Shenzhen, Peoples R China

来源：

2024 INTERNATIONAL SYMPOSIUM OF ELECTRONICS DESIGN AUTOMATION, ISEDA 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

CNN accelerator; FPGA; DSE method;

D O I：

10.1109/SEDA62518.2024.10617620

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Field programmable gate array (FPGA) has emerged as a promising platform for accelerating convolutional neural networks (CNNs). In this paper, we propose a low-latency CNN hybrid-accelerator system and an efficient design space exploration (DSE) method. Specifically, our targeted FPGA platform consists of different types of accelerators for two advantages: high concurrency and full hardware utilization (i.e., lookup tables (LUTs) and digital signal processors (DSPs)). Besides, we adopt a bandwidth-aware analytical model for system latency to consider pipeline stalls and computation cycles simultaneously. Furthermore, for the huge design space encompassing layer-wise CNN quantization and FPGA hybrid-accelerator architecture, we propose a DSE method (named DiMEGA) aimed at enhancing search efficiency, which is a differentiable method embedded by a genetic algorithm. The performance of our CNN hybrid-accelerator system is demonstrated on a PYNQ-Z2 FPGA platform. The experimental results show that the system latency can be reduced by 42% similar to 48% without sacrificing accuracy, and the DSE time of DiMEGA is reduced by 23% on ResNet20-CIFAR10, and 63% on ResNet56-CIFAR10, compared with SOTA.

引用

页码：678 / 683

页数：6

共 50 条

[21] A VLIW architecture simulator innovative approach for HW-SW co-design
Barbieri, I
Bariani, M
Raggio, M
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1375 - 1378
[22] NAX: Neural Architecture and Memristive Xbar based Accelerator Co-design
Negi, Shubham
Chakraborty, Indranil
Ankit, Aayush
Roy, Kaushik
PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 451 - 456
[23] A Co-Design Method for Parallel Image Processing Accelerator based on DSP and FPGA
Wang, Ze
Weng, Kaijian
Cheng, Zhao
Yan, Luxin
Guan, Jing
MIPPR 2011: PARALLEL PROCESSING OF IMAGES AND OPTIMIZATION AND MEDICAL IMAGING PROCESSING, 2011, 8005
[24] Deep Neural Network Model and FPGA Accelerator Co-Design: Opportunities and Challenges
Hao, Cong
Chen, Deming
2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1413 - 1416
[25] Deep CNN Co-design for HEVC CU Partition Prediction on FPGA–SoC
Soulef Bouaafia
Randa Khemiri
Seifeddine Messaoud
Fatma Ezahra Sayadi
Neural Processing Letters, 2022, 54 : 3283 - 3301
[26] Algorithm and Hardware Co-Design for Efficient PMCMC Acceleration on FPGA
Wang, Fulong
Peng, Shiyu
Gong, Yuanhao
Li, Minghan
Liu, Shuanglong
2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 330 - 334
[27] Hybrid Prototyping Methodology for Rapid System Validation in HW/SW Co-Design
Wicaksana, Arief
Charif, Amir
Andriamisaina, Caaliph
Ventroux, Nicolas
2019 CONFERENCE ON DESIGN AND ARCHITECTURES FOR SIGNAL AND IMAGE PROCESSING (DASIP), 2019, : 35 - 40
[28] Energy-efficient crypto acceleration with HW/SW co-design for HTTPS
Xiao, Chunhua
Zhang, Lei
Liu, Weichen
Bergmann, Neil
Xie, Yuhua
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 : 336 - 347
[29] Energy Efficient Data Access and Storage through HW/SW Co-design
Guo, Minyi
ACM SIGPLAN NOTICES, 2014, 49 (05) : 83 - 83
[30] FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge
Hao, Cong
Zhang, Xiaofan
Li, Yuhong
Huang, Sitao
Xiong, Jinjun
Rupnow, Kyle
Hwu, Wen-mei
Chen, Deming
PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,

← 1 2 3 4 5 →