Design and Implementation of Convolutional Neural Networks Accelerator Based on Multidie

被引:4
|
作者
Song, Qingzeng [1 ]
Zhang, Jiabing [1 ]
Sun, Liankun [1 ]
Jin, Guanghao [2 ]
机构
[1] Tiangong Univ, Sch Comp Sci & Technol, Tianjin 300387, Peoples R China
[2] Beijing Polytech, Sch Telecommun Engn, Beijing 100176, Peoples R China
来源
IEEE ACCESS | 2022年 / 10卷
基金
中国国家自然科学基金;
关键词
Convolutional neural networks; Quantization (signal); Object detection; Field programmable gate arrays; Mathematical models; Hardware acceleration; Hardware accelerator; multi-die; object detection; YOLOv4-tiny;
D O I
10.1109/ACCESS.2022.3199441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To achieve real-time object detection tasks with high throughput and low latency, this paper proposes a multi-die hardware accelerator architecture. It implements three accelerators on the VU9P chip, each of which is bound to an independent super logic region (SLR). To reduce off-chip memory access and power consumption, this design uses three on-chip buffers to store the weights and intermediate result data on one hand; on the other hand, it minimizes data access and movement and maximizes data reuse. This design uses an 8-bit quantization strategy for both weights and feature maps to achieve twice the throughput and computational efficiency of a single digital signal processor (DSP). In addition, many operators are designed in the accelerator, and all of them are fully parameterized, so it is easy to extend the network, and the control of the accelerator can be realized by configuring the instruction group. By accelerating the YOLOv4-tiny algorithm, the accelerator architecture can achieve a frame rate of 148.14 frames per second (FPS) and a peak throughput of 2.76 tera operations per second (TOPS) at 200 MHz with an energy efficiency ratio of 93.15 GOPS/W. The code can be found at https://github.com/19801201/Verilog_CNN_Accelerator.
引用
收藏
页码:91497 / 91508
页数:12
相关论文
共 50 条
  • [41] An Efficient Reconfigurable Hardware Accelerator for Convolutional Neural Networks
    Ansari, Anaam
    Gunnam, Kiran
    Ogunfunmi, Tokunbo
    [J]. 2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 1337 - 1341
  • [43] Snowflake: An Efficient Hardware Accelerator for Convolutional Neural Networks
    Gokhale, Vinayak
    Zaidy, Aliasger
    Chang, Andre Xian Ming
    Culurciello, Eugenio
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, : 2082 - 2085
  • [44] Implementation of Data-optimized FPGA-based Accelerator for Convolutional Neural Network
    Cho, Mannhee
    Kim, Youngmin
    [J]. 2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [45] FPGA-based Convolutional Neural Network Design and Implementation
    Yan, Ruitao
    Yi, Jianjun
    He, Jie
    Zhao, Yifan
    [J]. 2023 3RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE, ACCTCS, 2023, : 456 - 460
  • [46] Convolutional Neural Network (CNN) Accelerator Chip Design
    Ma, Xinran
    Zhao, Ruiyong
    Zhou, Jianyang
    [J]. PROCEEDINGS OF 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (IEEE-ASID'2019), 2019, : 211 - 215
  • [47] Design of a Convolutional Neural Network Accelerator Based on On-Chip Data Reordering
    Liu, Yang
    Zhang, Yiheng
    Hao, Xiaoran
    Chen, Lan
    Ni, Mao
    Chen, Ming
    Chen, Rong
    [J]. ELECTRONICS, 2024, 13 (05)
  • [48] Accelerator Design for Vector Quantized Convolutional Neural Network
    Wu, Yi-Heng
    Lee, Heng
    Lin, Yu Sheng
    Chien, Shao-Yi
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2019), 2019, : 46 - 50
  • [49] A Convolutional Neural Network Accelerator Based on NVDLA
    Zhao, Kangjin
    Wang, Jing
    Zang, Di
    [J]. 5TH INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS, ICACS 2021, 2021, : 43 - 47
  • [50] Design and Implementation of a Robust Convolutional Neural Network-Based Traffic Matrix Estimator for Cloud Networks
    Memon, Rashida Ali
    Qazi, Sameer
    Khan, Bilal Muhammad
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021 (2021):