A two-item floating point fused dot-product unit with latency reduced

被引:0
|
作者
Wang, Mingjiang [1 ]
Liu, De [1 ]
Liu, Ming [1 ]
Zhao, Boya [1 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Shenzhen 518055, Guangdong, Peoples R China
来源
IEICE ELECTRONICS EXPRESS | 2016年 / 13卷 / 23期
关键词
floating point arithmetic; fused dot-product unit; fused FFT-butterfly unit; ADD-SUBTRACT UNIT; IMPROVED ARCHITECTURES;
D O I
10.1587/elex.13.20160937
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a floating point fused dot-product (FDP) unit with latency reduced. The proposed FDP unit performs the dot-product operation of four floating point numbers: ab +/- cd and is implemented with dual-path algorithm. The proposed FDP is modeled in Verilog-HDL and synthesized using TSMC 65 nm technology library. Synthesis results show that our proposed FDP unit is 24 similar to 30% faster and 36.4% less area than the fastest FDP in previous work. We also use the proposed FDP unit and our previously designed FAS (fused add-subtract) unit to implement a FFT Radix-2 Butterfly (R2BF) unit. The latency of our proposed R2BF unit is improved roughly by 34% and the area is reduced by 41.6%, compared to the fastest 2' s-complement butterfly unit.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A Floating-Point Fused Dot-Product Unit
    Saleh, Hani H.
    Swartzlander, Earl E., Jr.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2008, : 427 - +
  • [2] UNIVERSAL FUSED FLOATING-POINT DOT-PRODUCT UNIT (UFDP)
    Saleh, Hani H.
    Mohammad, Baker S.
    [J]. 2013 8TH INTERNATIONAL DESIGN AND TEST SYMPOSIUM (IDT), 2013,
  • [3] A Fused Hybrid Floating-Point and Fixed-Point Dot-Product for FPGAs
    Lopes, Antonio Roldao
    Constantinides, George A.
    [J]. RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 157 - 168
  • [4] Improved Architectures for a Floating-Point Fused Dot Product Unit
    Sohn, Jongwook
    Swartzlander, Earl E., Jr.
    [J]. 2013 21ST IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2013, : 41 - 48
  • [5] A Fused Floating-Point Four-Term Dot Product Unit
    Sohn, Jongwook
    Swartzlander, Earl E., Jr.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2016, 63 (03) : 370 - 378
  • [6] A Low Complexity Floating-Point Complex Multiplier with a Three-term Dot-Product Unit
    Yun, Sangho
    Sobelman, Gerald. E.
    Zhou, Xiaofang
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2014, : 549 - 552
  • [7] Optimized Fused Floating-Point Many-Term Dot-Product Hardware for Machine Learning Accelerators
    Kaul, Himanshu
    Anders, Mark
    Mathew, Sanu
    Kim, Seongjong
    Krishnamurthy, Ram
    [J]. 2019 IEEE 26TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2019, : 84 - 87
  • [8] A High Speed Floating Point Dot Product Unit
    Gupta, Akash Kumar
    Biswal, Birendra
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ISSUES AND CHALLENGES IN INTELLIGENT COMPUTING TECHNIQUES (ICICT), 2014, : 314 - 318
  • [9] A Floating-Point Unit for 4D Vector Inner Product with Reduced Latency
    Kim, Donghyun
    Kim, Lee-Sup
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (07) : 890 - 901
  • [10] Floating-point fused multiply-add with reduced latency
    Lang, T
    Bruguera, JD
    [J]. ICCD'2002: IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 2002, : 145 - 150