Templatized Fused Vector Floating-Point Dot Product for High-Level Synthesis

被引：3

作者：

Filippas, Dionysios ^{[1
]}

Nicopoulos, Chrysostomos ^{[2
]}

Dimitrakopoulos, Giorgos ^{[1
]}

机构：

[1] Democritus Univ Thrace, Elect & Comp Engn, Xanthi 67100, Greece

[2] Univ Cyprus, Elect & Comp Engn, CY-1678 Nicosia, Cyprus

来源：

JOURNAL OF LOW POWER ELECTRONICS AND APPLICATIONS | 2022年 / 12卷 / 04期

关键词：

floating point arithmetic; vector dot product; high level synthesis;

D O I：

10.3390/jlpea12040056

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Machine-learning accelerators rely on floating-point matrix and vector multiplication kernels. To reduce their cost, customized many-term fused architectures are preferred, which improve the latency, power, and area of the designs. In this work, we design a parameterized fused many-term floating-point dot product architecture that is ready for high-level synthesis. In this way, we can exploit the efficiency offered by a well-structured fused dot-product architecture and the freedom offered by high-level synthesis in tuning the design's pipeline to the selected floating-point format and architectural constraints. When compared with optimized dot-product units implemented directly in RTL, the proposed design offers lower-latency implementations under the same clock frequency with marginal area savings. This result holds for a variety of floating-point formats, including standard and reduced-precision representations.

引用

页数：14

共 50 条

[41] Floating-point fused multiply-add architectures
Quinnell, Eric
Swartzlander, Earl E., Jr.
Lemonds, Carl
CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 331 - +
[42] Area optimization of combined integer and floating point circuits in high-level synthesis
Andres, Esther
Molina, Maria C.
Botella, Guillermo
del Barrio, Alberto
Mendias, Jose M.
2008 4TH SOUTHERN CONFERENCE ON PROGRAMMABLE LOGIC, PROCEEDINGS, 2008, : 229 - 232
[43] A Fused Floating-Point Three-Term Adder
Sohn, Jongwook
Swartzlander, Earl E., Jr.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2014, 61 (10) : 2842 - 2850
[44] SUPERSCALAR FLOATING-POINT VECTOR COMPUTATION IN SCHEME
SRINIVAS, S
DYBVIG, K
LECTURE NOTES IN COMPUTER SCIENCE, 1992, 634 : 811 - 812
[45] A Floating-Point Fused Add-Subtract Unit
Saleh, Hani
Swartzlander, Earl E., Jr.
2008 51ST MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 2008, : 519 - +
[46] A Floating-Point Unit for 4D Vector Inner Product with Reduced Latency
Kim, Donghyun
Kim, Lee-Sup
IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (07) : 890 - 901
[47] Exact Dot Product Accumulate Operators for 8-bit Floating-Point Deep Learning
Desrentes, Oregane
de Dinechin, Benoit Dupont
Le Maire, Julien
2023 26TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN, DSD 2023, 2023, : 642 - 649
[48] Synthesis of Rigorous Floating-Point Predicates
Thanh Son Nguyen
Jones, Ben
Rakamari, Zvonimir
MODEL CHECKING SOFTWARE, SPIN 2022, 2022, 13255 : 44 - 60
[49] Implementation of Vector Floating-point processing unit on FPGAs for high performance computing
Chen, Shi
Venkatesan, Ramachandran
Gillard, Paul
2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 840 - 844
[50] Fused Multiply-Add for Variable Precision Floating-Point
Nannarelli, Alberto
32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 342 - 347

← 1 2 3 4 5 →