Design Optimization for High-Performance Computing Using FPGA

被引:0
|
作者
Isik, Murat [1 ]
Inadagbo, Kayode [2 ]
Aktas, Hakan [3 ]
机构
[1] Drexel Univ, Elect & Comp Engn Dept, Philadelphia, PA 19104 USA
[2] A&M Univ, Elect & Comp Engn Dept, Prairie View, TX USA
[3] Omer Halisdemir Univ, Comp Engn Dept, Nigde, Turkiye
关键词
High-performance computing; Tensil AI; Design optimization; FPGA; Open-source inference accelerator;
D O I
10.1007/978-3-031-63616-5_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reconfigurable architectures like Field Programmable Gate Arrays (FPGAs) have been used for accelerating computations in several domains because of their unique combination of flexibility, performance, and power efficiency. However, FPGAs have not been widely used for high-performance computing, primarily because of their programming complexity and difficulties in optimizing performance. We optimize Tensil AI's open-source inference accelerator for maximum performance using ResNet20 trained on CIFAR in this paper in order to gain insight into the use of FPGAs for high-performance computing. In this paper, we show how improving hardware design, using Xilinx Ultra RAM, and using advanced compiler strategies can lead to improved inference performance. We also demonstrate that running the CIFAR test data set shows very little accuracy drop when rounding down from the original 32bit floating point. The heterogeneous computing model in our platform allows us to achieve a frame rate of 293.58 frames per second (FPS) and a %90 accuracy on a ResNet20 trained using CIFAR. The experimental results show that the proposed accelerator achieves a throughput of 21.12 Giga-Operations Per Second (GOP/s) with a 5.21W on-chip power consumption at 100 MHz. The comparison results with off-the-shelf devices and recent state-of-the-art implementations illustrate that the proposed accelerator has obvious advantages in terms of energy efficiency.
引用
收藏
页码:142 / 156
页数:15
相关论文
共 50 条
  • [21] Janus: An FPGA-Based System for High-Performance Scientific Computing
    Belletti, F.
    Guidetti, M.
    Maiorano, A.
    Mantovani, F.
    Schifano, S. F.
    Tripiccione, R.
    Cotallo, M.
    Perez-Gaviro, S.
    Sciretti, D.
    Velasco, J. L.
    Cruz, A.
    Navarro, D.
    Tarancon, A.
    Fernandez, L. A.
    Martin-Mayor, V.
    Munoz-Sudupe, A.
    Yllanes, D.
    Gordillo-Guerrero, A.
    Ruiz-Lorenzo, J. J.
    Marinari, E.
    Parisi, G.
    Rossi, M.
    Zanier, G.
    COMPUTING IN SCIENCE & ENGINEERING, 2009, 11 (01) : 48 - 58
  • [22] Modular High-Performance Computing Using Chiplets
    Vinnakota, Bapi
    Shalf, John M.
    COMPUTING IN SCIENCE & ENGINEERING, 2023, 25 (06) : 39 - 48
  • [23] High-performance computing using a reconfigurable accelerator
    Hartenstein, RW
    Becker, J
    Kress, R
    Reinig, H
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1996, 8 (06): : 429 - 443
  • [24] Using Jini for high-performance network computing
    Mahmoud, QH
    INTERNATIONAL CONFERENCE ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING - PARELEC 2000, PROCEEDINGS, 2000, : 244 - 247
  • [25] High-performance supersonic missile inlet design using automated optimization
    Zha, GC
    Smith, D
    Schwabacher, M
    Rasheed, K
    Gelsey, A
    Knight, D
    Haas, M
    JOURNAL OF AIRCRAFT, 1997, 34 (06): : 697 - 705
  • [26] Optimization for Mix Design of High-Performance Concrete Using Orthogonal Test
    Li Xiaoyong
    Ma Wendi
    INNOVATIVE COMPUTING AND INFORMATION, PT II, 2011, 232 : 364 - 372
  • [27] High-performance computing in accelerating structure design and analysis
    Li, ZH
    Folwell, N
    Ge, LX
    Guetz, A
    Ivanov, V
    Kowalski, M
    Lee, LQ
    Ng, CK
    Schussman, G
    Stingelin, L
    Uplenchwar, R
    Wolf, M
    Xiao, LL
    Ko, K
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2006, 558 (01): : 168 - 174
  • [28] High-Performance Computing
    Bungartz, Hans-Joachim
    IT-INFORMATION TECHNOLOGY, 2013, 55 (03): : 83 - 85
  • [29] Design and Implementation of High-Performance Space Router Based on FPGA
    Zhou, Dong
    Shen, Xiaohu
    Li, Ke
    Feng, Guoping
    Wang, Luyuan
    2019 IEEE 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2019), 2019, : 704 - 708