High performance floating-point unit with 116 bit wide divider

被引:21
|
作者
Gerwig, G [1 ]
Wetter, H [1 ]
Schwarz, EM [1 ]
Haess, J [1 ]
机构
[1] IBM Corp, Server Div, Armonk, NY 10504 USA
关键词
D O I
10.1109/ARITH.2003.1207664
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The next generation zSeries floating-point unit is unveiled which is the first IBM mainframe with a fused multiply-add dataflow. It supports both S/390 hexadecimal floating-point architecture and the IEEE 754 binary floating-point architecture which was first implemented in S/390 on the 1998 S/390 G5 floating-point unit. The new floating-point unit supports a total of 6 formats including single, double, and quadword formats implemented in hardware. The floating-point pipeline is 5 cycles with a throughput of 1 multiply-add per cycle. Both hexadecimal and binary floating-point instructions are capable of this performance due to a novel way of handling both formats. Other key developments include new methods for handling denormalized numbers and quad precision divide engine dataflow. This divider uses a radix-4 SRT algorithm and is able to handle quad precision divides in multiple floating-point and fixed-point formats. The number of iterations for fixed-point divisions depend on the effective number of quotient bits. It uses a reduced carry-save form for the partial remainder, with only 1 carry bit for every 4 sum bits, to save area and power.
引用
收藏
页码:87 / 94
页数:8
相关论文
共 50 条
  • [1] Floating-point unit in standard cell design with 116 bit wide dataflow
    IBM Deutschland Entwicklung GmbH, Boeblingen, Germany
    Proc Symp Comput Arith, (266-273):
  • [2] Floating-point unit in standard cell design with 116 bit wide dataflow
    Gerwig, G
    Kroener, M
    14TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 1999, : 266 - 273
  • [3] A combined interval and floating-point divider
    Stine, JE
    Schulte, MJ
    CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 218 - 222
  • [4] VLSI implementation of a floating-point divider
    Patel, JN
    Abid, Z
    Wang, W
    16TH INTERNATIONAL CONFERENCE ON MICROELECTRONICS, PROCEEDINGS, 2004, : 505 - 508
  • [5] Floating-point divider design for FPGAs
    Hemmert, K. Scott
    Underwood, Keith D.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2007, 15 (01) : 115 - 118
  • [6] 32-bit logarithmic arithmetic unit and its performance compared to floating-point
    The University, Newcastle upon Tyne, United Kingdom
    Proc Symp Comput Arith, (142-151):
  • [7] Hardware Implementation of 24-bit Vedic Multiplier in 32-bit Floating-Point Divider
    Hanuman, C. R. S.
    Kamala, J.
    2018 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND SYSTEM ENGINEERING (ICEESE), 2018, : 60 - 64
  • [8] Radix-64 Floating-Point Divider
    Bruguera, Javier D.
    2018 IEEE 25TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2018, : 84 - 91
  • [9] Design and Implementation of a High-performance 64-bit Floating-point Reciprocal and Square Root Reciprocal Unit
    Feng, Chaochao
    Li, Shaoqing
    Zhang, Minxuan
    2008 9TH INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED-CIRCUIT TECHNOLOGY, VOLS 1-4, 2008, : 1843 - 1846
  • [10] Implementation of Vector Floating-point processing unit on FPGAs for high performance computing
    Chen, Shi
    Venkatesan, Ramachandran
    Gillard, Paul
    2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 840 - 844