Design Optimization for High-Performance Computing Using FPGA

被引:0
|
作者
Isik, Murat [1 ]
Inadagbo, Kayode [2 ]
Aktas, Hakan [3 ]
机构
[1] Drexel Univ, Elect & Comp Engn Dept, Philadelphia, PA 19104 USA
[2] A&M Univ, Elect & Comp Engn Dept, Prairie View, TX USA
[3] Omer Halisdemir Univ, Comp Engn Dept, Nigde, Turkiye
关键词
High-performance computing; Tensil AI; Design optimization; FPGA; Open-source inference accelerator;
D O I
10.1007/978-3-031-63616-5_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reconfigurable architectures like Field Programmable Gate Arrays (FPGAs) have been used for accelerating computations in several domains because of their unique combination of flexibility, performance, and power efficiency. However, FPGAs have not been widely used for high-performance computing, primarily because of their programming complexity and difficulties in optimizing performance. We optimize Tensil AI's open-source inference accelerator for maximum performance using ResNet20 trained on CIFAR in this paper in order to gain insight into the use of FPGAs for high-performance computing. In this paper, we show how improving hardware design, using Xilinx Ultra RAM, and using advanced compiler strategies can lead to improved inference performance. We also demonstrate that running the CIFAR test data set shows very little accuracy drop when rounding down from the original 32bit floating point. The heterogeneous computing model in our platform allows us to achieve a frame rate of 293.58 frames per second (FPS) and a %90 accuracy on a ResNet20 trained using CIFAR. The experimental results show that the proposed accelerator achieves a throughput of 21.12 Giga-Operations Per Second (GOP/s) with a 5.21W on-chip power consumption at 100 MHz. The comparison results with off-the-shelf devices and recent state-of-the-art implementations illustrate that the proposed accelerator has obvious advantages in terms of energy efficiency.
引用
收藏
页码:142 / 156
页数:15
相关论文
共 50 条
  • [41] HIGH-PERFORMANCE COMPUTING MEETS HIGH-PERFORMANCE MEDICINE
    Verma, Anurag
    Huffman, Jennifer
    Torkamani, Ali
    Madduri, Ravi
    BIOCOMPUTING 2023, PSB 2023, 2023, : 541 - 545
  • [42] The Design and Performance of Batched BLAS on Modern High-Performance Computing Systems
    Dongarra, Jack
    Hammarling, Sven
    Higham, Nicholas J.
    Relton, Samuel D.
    Valero-Lara, Pedro
    Zounon, Mawussi
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 495 - 504
  • [43] Design of a High-Performance Titanium Nitride Metastructure-Based Solar Absorber Using Quantum Computing-Assisted Optimization
    Kim, Seongmin
    Wu, Shiwen
    Jian, Ruda
    Xiong, Guoping
    Luo, Tengfei
    ACS APPLIED MATERIALS & INTERFACES, 2023, 15 (34) : 40606 - 40613
  • [44] Accelerating convolutional neural networks on FPGA platforms: a high-performance design methodology using OpenCL
    Soufien Gdaim
    Abdellatif Mtibaa
    Journal of Real-Time Image Processing, 2025, 22 (2)
  • [45] Structural design of high-performance capacitive accelerometers using parametric optimization with uncertainties
    Teves, Andre da Costa
    de Lima, Cicero Ribeiro
    Passaro, Angelo
    Nelli Silva, Emilio Carlos
    ENGINEERING OPTIMIZATION, 2017, 49 (03) : 365 - 380
  • [46] Optimization of high-performance concrete mix ratio design using machine learning
    Chen, Bin
    Wang, Lei
    Feng, Zongbao
    Liu, Yang
    Wu, Xianguo
    Qin, Yawei
    Xia, Lingyu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [47] Shape analysis using high-performance computing techniques
    Clematis, A
    Coda, A
    Falcidieno, B
    Spagnuolo, M
    SHAPE MODELING INTERNATIONAL '99 - INTERNATIONAL CONFERENCE ON SHAPE MODELING AND APPLICATIONS, PROCEEDINGS, 1999, : 248 - 255
  • [48] A Design for Multi-Pricing High-Performance Computing System
    Chen, Lung-Pin
    Kao, Mike
    Wu, I-Chen
    Wei, Ting-Han
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 1733 - 1742
  • [49] Comparison of genomes using high-performance parallel computing
    Almeida, NF
    Alves, CER
    Caceres, EN
    Song, SW
    15TH SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2003, : 142 - 148
  • [50] Building a high-performance computing cluster using FreeBSD
    Davis, B
    AuYeung, M
    Green, G
    Lee, C
    USENIX ASSOCIATION PROCEEDINGS OF BSDCON '03, 2003, : 35 - 46