Toward Efficient Co-Design of CNN Quantization and HW Architecture on FPGA Hybrid-Accelerator

被引:0
|
作者
Zhang, Yiran [1 ]
Li, Guiying [1 ]
Yuan, Bo [1 ]
机构
[1] Southern Univ Sci & Technol, Guangdong Prov Key Lab Brain Inspired Intelligent, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
CNN accelerator; FPGA; DSE method;
D O I
10.1109/SEDA62518.2024.10617620
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Field programmable gate array (FPGA) has emerged as a promising platform for accelerating convolutional neural networks (CNNs). In this paper, we propose a low-latency CNN hybrid-accelerator system and an efficient design space exploration (DSE) method. Specifically, our targeted FPGA platform consists of different types of accelerators for two advantages: high concurrency and full hardware utilization (i.e., lookup tables (LUTs) and digital signal processors (DSPs)). Besides, we adopt a bandwidth-aware analytical model for system latency to consider pipeline stalls and computation cycles simultaneously. Furthermore, for the huge design space encompassing layer-wise CNN quantization and FPGA hybrid-accelerator architecture, we propose a DSE method (named DiMEGA) aimed at enhancing search efficiency, which is a differentiable method embedded by a genetic algorithm. The performance of our CNN hybrid-accelerator system is demonstrated on a PYNQ-Z2 FPGA platform. The experimental results show that the system latency can be reduced by 42% similar to 48% without sacrificing accuracy, and the DSE time of DiMEGA is reduced by 23% on ResNet20-CIFAR10, and 63% on ResNet56-CIFAR10, compared with SOTA.
引用
收藏
页码:678 / 683
页数:6
相关论文
共 50 条
  • [1] Efficient HW/SW Co-design of FPGA Accelerator to Detect Anomaly Attacks in Smart Grids
    Liu, Hongsen
    Liu, Guangyi
    Li, Shizhong
    Meng, Wenchao
    Wang, Lin
    Sun, Yong
    2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024, 2024,
  • [2] HW-FlowQ: A Multi-Abstraction Level HW-CNN Co-design Quantization Methodology
    Fasfous, Nael
    Vemparala, Manoj Rohit
    Frickenstein, Alexander
    Valpreda, Emanuele
    Salihu, Driton
    Nguyen Anh Vu Doan
    Unger, Christian
    Nagaraja, Naveen Shankar
    Martina, Maurizio
    Stechele, Walter
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (05)
  • [3] Performance evaluation over HW/SW co-design SoC memory transfers for a CNN accelerator
    Rios-Navarro, A.
    Tapiador-Morales, R.
    Jimenez-Fernandez, A.
    Amaya, C.
    Dominguez-Morales, M.
    Delbruck, T.
    Linares-Barranco, A.
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON NANOTECHNOLOGY (IEEE-NANO), 2018,
  • [4] An Efficient CNN Architecture for Image Classification on FPGA Accelerator
    Mujawar, Shahmustafa
    Kiran, Divya
    Ramasangu, Hariharan
    2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2018,
  • [5] FPGA Implementation for GPR Signal Processing Based on HW/SW Co-Design Architecture
    Srimuk, Pachara
    Boonpoonga, Akkarat
    Burintramart, Santana
    2015 IEEE CONFERENCE ON ANTENNA MEASUREMENTS & APPLICATIONS (CAMA), 2015,
  • [6] HW/SW co-design project with FPGA prototyping
    Moreno Zamora, Jose A.
    Valverde Sanchez, Jose V.
    Alvarez Garcia, Francisco J.
    PROCEEDINGS OF 2016 TECHNOLOGIES APPLIED TO ELECTRONICS TEACHING (TAEE 2016), 2016,
  • [7] HW/SW Co-Design of Cost-Efficient CNN Inference for Cognitive IoT
    Lee, Kwangho
    Kong, Joonho
    Munir, Arslan
    2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
  • [8] A Template-based Methodology for Efficient Microprocessor and FPGA Accelerator Co-Design
    Kritikakou, Angeliki
    Catthoor, Francky
    Athanasiou, George S.
    Kelefouras, Vasilios
    Goutis, Costas
    2012 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS (SAMOS): ARCHITECTURES, MODELING AND SIMULATION, 2012, : 15 - 22
  • [9] Algorithm and Hardware Co-design for Reconfigurable CNN Accelerator
    Fan, Hongxiang
    Ferianc, Martin
    Que, Zhiqiang
    Li, He
    Liu, Shuanglong
    Niu, Xinyu
    Luk, Wayne
    27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 250 - 255
  • [10] HW/SW Co-Design of a Specific Accelerator for Robotic Computer Vision
    Pedroza de la Cruz, Adrian
    Carrazco Diaz, Miguel Angel
    Ortega Cisneros, Susana
    Raygoza Panduro, Juan Jose
    Rivera Dominguez, Jorge
    Sandoval Ibarra, Federico
    COMPUTACION Y SISTEMAS, 2015, 19 (03): : 513 - 527