AFHRE: An Accurate and Fast Hardware Resources Estimation Method for Convolutional Accelerator with Systolic Array Structure on FPGA

被引:0
|
作者
Wang, Yongchang [1 ]
Zhao, Hongzhi [1 ]
Zhao, Jinyao [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp Sci & Technol, Engn Res Ctr Network Management Technol High Speed, Minist Educ, Beijing 100044, Peoples R China
来源
ELECTRONICS | 2025年 / 14卷 / 01期
关键词
convolutional accelerator; systolic array structure; FPGA; hardware resources estimation; Xilinx Vivado HLS; DEEP NEURAL-NETWORKS;
D O I
10.3390/electronics14010168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
FPGA-based convolutional accelerators have been widely used in image recognition scenarios. Many convolutional accelerators utilize the systolic array structure to enhance parallelism. Developing a method to efficiently estimate the utilized hardware resources of an FPGA for such a structure would be helpful in improving the speed of achieving an optimal systolic array structure with the best performance on a given FPGA device. Currently, most estimations of work have either focused on the evaluation of hardware resources for general structures or have not adequately assessed hardware resources specifically for systolic arrays. To reduce estimation latency, this paper proposes an Accurate and Fast Hardware Resources Estimation method (AFHRE) that addresses these shortcomings by analyzing the structure of systolic arrays and utilizing mathematical formulas to describe their characteristics. Experiments show that the DSP resource occupancy estimated by AFHRE is fully consistent with that by Vivado HLS. The error rates of other three types of hardware resources (BRAM, LUT, and FF) are within 11%. In addition, the speed of resource estimation using this method is 40X to 610X faster than that of Vivado HLS. AFHRE can serve as a preprocessing step for Vivado HLS, achieving some optimal or sub-optimal solutions systolic array parameters much faster than original simulation manners of Vivado HLS.
引用
收藏
页数:21
相关论文
共 5 条
  • [1] A Fast, Accurate, and Comprehensive PPA Estimation of Convolutional Hardware Accelerators
    Juracy, Leonardo Rezende
    Amory, Alexandre de Morais
    Moraes, Fernando Gehm
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (12) : 5171 - 5184
  • [2] FPGA Architecture for Fast Floating Point Matrix Inversion Using Uni-dimensional Systolic Array Based Structure
    Hnilicka, Ondrej
    PROCEEDINGS OF THE 2013 IEEE 16TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2013, : 267 - 270
  • [3] Peak-to-peak search: fast and accurate DOA estimation method for arbitrary non-uniform linear array
    Wei, Li
    Shao, Wei
    Qi, Wangdong
    Chen, Jianhua
    ELECTRONICS LETTERS, 2015, 51 (25) : 2078 - 2079
  • [4] A Fast and Accurate Human Pose Estimation Method Based on Multi-Scale Feature Fusion Grid Structure
    Li, Qiming
    Wan, Daizong
    Yang, Xiaoyan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (01)
  • [5] Convolutional Neural Network-based Fast and Accurate Irregular Shape Power/Ground Plane Impedance Estimation Method for High-Speed Signaling
    Kim, Keunwoo
    Lho, Daehwan
    Park, Hyunwook
    Son, Keeyoung
    Kim, Seongguk
    Park, Shnyoung
    Sim, Boogyo
    Kim, Subin
    Kim, Joungho
    2020 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO 2020), 2020,