A Fused Continuous Floating-Point MAC on FPGA

被引：0

作者：

Yuan, Min ^{[1
]}

Xing, Qianjian ^{[1
]}

Ma, Zhenguo ^{[1
]}

Yu, Feng ^{[1
]}

Xu, Yingke ^{[1
]}

机构：

[1] Zhejiang Univ, Dept Instrument Engn, Hangzhou 310027, Zhejiang, Peoples R China

来源：

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES | 2018年 / E101A卷 / 09期

关键词：

floating-point multiply-accumulator; fused algorithm; normalization and alignment; MULTIPLY-ADD; REDUCED LATENCY; UNIT; DESIGN;

D O I：

10.1587/transfun.E101.A.1594

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this letter, we present a novel single-precision floating-point multiply-accumulator (FNA-MAC) to achieve lower hardware resource, reduced computing latency and improved computing accuracy for continuous dot product operations. By further fusing the normalization and alignment in the traditional FMA algorithm, the proposed architecture eliminates the first N - 1 normalization and rounding operations for an N point dot product, and preserves the precision of interim results in a significant bit size that is twice of that in the traditional methods. The normalization and rounding of the final result is processed at the cost of consuming an additional multiply-add operation. The simulation results show that the improvement in computational accuracy is significant. Meanwhile, when comparing to a recently published FMA design, the proposed FNA-MAC can reduce the slice look-up table/ flip-flop resource and computing latency by a fact of 18%, 33.3%, respectively.

引用

页码：1594 / 1598

页数：5

共 50 条

[21] Logarithmic number system and floating-point arithmetics on FPGA
Matousek, R
Tichy, M
Pohl, Z
Kadlec, J
Softley, C
Coleman, N
FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS: RECONFIGURABLE COMPUTING IS GOING MAINSTREAM, 2002, 2438 : 627 - 636
[22] FPGA Optimizations for a Pipelined Floating-Point Exponential Unit
Alachiotis, Nikolaos
Stamatakis, Alexandros
RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2011, 6578 : 316 - 327
[23] High throughput floating-point dividers implemented in FPGA
Malik, Peter
2015 IEEE 18TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS 2015), 2015, : 291 - 294
[24] Improved Architectures for a Floating-Point Fused Dot Product Unit
Sohn, Jongwook
Swartzlander, Earl E., Jr.
2013 21ST IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2013, : 41 - 48
[25] Fused Multiply-Add for Variable Precision Floating-Point
Nannarelli, Alberto
32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 342 - 347
[26] Floating-point fused multiply-add with reduced latency
Lang, T
Bruguera, JD
ICCD'2002: IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 2002, : 145 - 150
[27] Floating-point multiply-add-fused with reduced latency
Lang, T
Bruguera, JD
IEEE TRANSACTIONS ON COMPUTERS, 2004, 53 (08) : 988 - 1003
[28] Bridge Floating-Point Fused Multiply-Add Design
Quinnell, Eric
Swartzlander, Earl E., Jr.
Lemonds, Carl
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2008, 16 (12) : 1726 - 1730
[29] A Decimal Floating-Point Fused-Multiply-Add Unit
Samy, Rodina
Fahmy, Hossam A. H.
Raafat, Ramy
Mohamed, Amira
ElDeeb, Tarek
Farouk, Yasmin
53RD IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 529 - 532
[30] Design of floating-point operation based on FPGA and it's application
Cui, Yunjuan
Chen, Baixiao
Zhang, Shouhong
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 2716 - +

← 1 2 3 4 5 →