A Fused Continuous Floating-Point MAC on FPGA

被引:0
|
作者
Yuan, Min [1 ]
Xing, Qianjian [1 ]
Ma, Zhenguo [1 ]
Yu, Feng [1 ]
Xu, Yingke [1 ]
机构
[1] Zhejiang Univ, Dept Instrument Engn, Hangzhou 310027, Zhejiang, Peoples R China
关键词
floating-point multiply-accumulator; fused algorithm; normalization and alignment; MULTIPLY-ADD; REDUCED LATENCY; UNIT; DESIGN;
D O I
10.1587/transfun.E101.A.1594
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this letter, we present a novel single-precision floating-point multiply-accumulator (FNA-MAC) to achieve lower hardware resource, reduced computing latency and improved computing accuracy for continuous dot product operations. By further fusing the normalization and alignment in the traditional FMA algorithm, the proposed architecture eliminates the first N - 1 normalization and rounding operations for an N point dot product, and preserves the precision of interim results in a significant bit size that is twice of that in the traditional methods. The normalization and rounding of the final result is processed at the cost of consuming an additional multiply-add operation. The simulation results show that the improvement in computational accuracy is significant. Meanwhile, when comparing to a recently published FMA design, the proposed FNA-MAC can reduce the slice look-up table/ flip-flop resource and computing latency by a fact of 18%, 33.3%, respectively.
引用
收藏
页码:1594 / 1598
页数:5
相关论文
共 50 条
  • [21] Logarithmic number system and floating-point arithmetics on FPGA
    Matousek, R
    Tichy, M
    Pohl, Z
    Kadlec, J
    Softley, C
    Coleman, N
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS: RECONFIGURABLE COMPUTING IS GOING MAINSTREAM, 2002, 2438 : 627 - 636
  • [22] FPGA Optimizations for a Pipelined Floating-Point Exponential Unit
    Alachiotis, Nikolaos
    Stamatakis, Alexandros
    RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2011, 6578 : 316 - 327
  • [23] High throughput floating-point dividers implemented in FPGA
    Malik, Peter
    2015 IEEE 18TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS 2015), 2015, : 291 - 294
  • [24] Improved Architectures for a Floating-Point Fused Dot Product Unit
    Sohn, Jongwook
    Swartzlander, Earl E., Jr.
    2013 21ST IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2013, : 41 - 48
  • [25] Fused Multiply-Add for Variable Precision Floating-Point
    Nannarelli, Alberto
    32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 342 - 347
  • [26] Floating-point fused multiply-add with reduced latency
    Lang, T
    Bruguera, JD
    ICCD'2002: IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 2002, : 145 - 150
  • [27] Floating-point multiply-add-fused with reduced latency
    Lang, T
    Bruguera, JD
    IEEE TRANSACTIONS ON COMPUTERS, 2004, 53 (08) : 988 - 1003
  • [28] Bridge Floating-Point Fused Multiply-Add Design
    Quinnell, Eric
    Swartzlander, Earl E., Jr.
    Lemonds, Carl
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2008, 16 (12) : 1726 - 1730
  • [29] A Decimal Floating-Point Fused-Multiply-Add Unit
    Samy, Rodina
    Fahmy, Hossam A. H.
    Raafat, Ramy
    Mohamed, Amira
    ElDeeb, Tarek
    Farouk, Yasmin
    53RD IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 529 - 532
  • [30] Design of floating-point operation based on FPGA and it's application
    Cui, Yunjuan
    Chen, Baixiao
    Zhang, Shouhong
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 2716 - +