Templatized Fused Vector Floating-Point Dot Product for High-Level Synthesis

被引:3
|
作者
Filippas, Dionysios [1 ]
Nicopoulos, Chrysostomos [2 ]
Dimitrakopoulos, Giorgos [1 ]
机构
[1] Democritus Univ Thrace, Elect & Comp Engn, Xanthi 67100, Greece
[2] Univ Cyprus, Elect & Comp Engn, CY-1678 Nicosia, Cyprus
关键词
floating point arithmetic; vector dot product; high level synthesis;
D O I
10.3390/jlpea12040056
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Machine-learning accelerators rely on floating-point matrix and vector multiplication kernels. To reduce their cost, customized many-term fused architectures are preferred, which improve the latency, power, and area of the designs. In this work, we design a parameterized fused many-term floating-point dot product architecture that is ready for high-level synthesis. In this way, we can exploit the efficiency offered by a well-structured fused dot-product architecture and the freedom offered by high-level synthesis in tuning the design's pipeline to the selected floating-point format and architectural constraints. When compared with optimized dot-product units implemented directly in RTL, the proposed design offers lower-latency implementations under the same clock frequency with marginal area savings. This result holds for a variety of floating-point formats, including standard and reduced-precision representations.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Floating-point fused multiply-add architectures
    Quinnell, Eric
    Swartzlander, Earl E., Jr.
    Lemonds, Carl
    CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 331 - +
  • [42] Area optimization of combined integer and floating point circuits in high-level synthesis
    Andres, Esther
    Molina, Maria C.
    Botella, Guillermo
    del Barrio, Alberto
    Mendias, Jose M.
    2008 4TH SOUTHERN CONFERENCE ON PROGRAMMABLE LOGIC, PROCEEDINGS, 2008, : 229 - 232
  • [43] A Fused Floating-Point Three-Term Adder
    Sohn, Jongwook
    Swartzlander, Earl E., Jr.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2014, 61 (10) : 2842 - 2850
  • [44] SUPERSCALAR FLOATING-POINT VECTOR COMPUTATION IN SCHEME
    SRINIVAS, S
    DYBVIG, K
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 634 : 811 - 812
  • [45] A Floating-Point Fused Add-Subtract Unit
    Saleh, Hani
    Swartzlander, Earl E., Jr.
    2008 51ST MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 2008, : 519 - +
  • [46] A Floating-Point Unit for 4D Vector Inner Product with Reduced Latency
    Kim, Donghyun
    Kim, Lee-Sup
    IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (07) : 890 - 901
  • [47] Exact Dot Product Accumulate Operators for 8-bit Floating-Point Deep Learning
    Desrentes, Oregane
    de Dinechin, Benoit Dupont
    Le Maire, Julien
    2023 26TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN, DSD 2023, 2023, : 642 - 649
  • [48] Synthesis of Rigorous Floating-Point Predicates
    Thanh Son Nguyen
    Jones, Ben
    Rakamari, Zvonimir
    MODEL CHECKING SOFTWARE, SPIN 2022, 2022, 13255 : 44 - 60
  • [49] Implementation of Vector Floating-point processing unit on FPGAs for high performance computing
    Chen, Shi
    Venkatesan, Ramachandran
    Gillard, Paul
    2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 840 - 844
  • [50] Fused Multiply-Add for Variable Precision Floating-Point
    Nannarelli, Alberto
    32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 342 - 347