High performance floating-point unit with 116 bit wide divider

被引：21

作者：

Gerwig, G ^{[1
]}

Wetter, H ^{[1
]}

Schwarz, EM ^{[1
]}

Haess, J ^{[1
]}

机构：

[1] IBM Corp, Server Div, Armonk, NY 10504 USA

来源：

16TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS | 2003年

关键词：

D O I：

10.1109/ARITH.2003.1207664

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The next generation zSeries floating-point unit is unveiled which is the first IBM mainframe with a fused multiply-add dataflow. It supports both S/390 hexadecimal floating-point architecture and the IEEE 754 binary floating-point architecture which was first implemented in S/390 on the 1998 S/390 G5 floating-point unit. The new floating-point unit supports a total of 6 formats including single, double, and quadword formats implemented in hardware. The floating-point pipeline is 5 cycles with a throughput of 1 multiply-add per cycle. Both hexadecimal and binary floating-point instructions are capable of this performance due to a novel way of handling both formats. Other key developments include new methods for handling denormalized numbers and quad precision divide engine dataflow. This divider uses a radix-4 SRT algorithm and is able to handle quad precision divides in multiple floating-point and fixed-point formats. The number of iterations for fixed-point divisions depend on the effective number of quotient bits. It uses a reduced carry-save form for the partial remainder, with only 1 carry bit for every 4 sum bits, to save area and power.

引用

页码：87 / 94

页数：8

共 50 条

[1] Floating-point unit in standard cell design with 116 bit wide dataflow
IBM Deutschland Entwicklung GmbH, Boeblingen, Germany
Proc Symp Comput Arith, (266-273):
[2] Floating-point unit in standard cell design with 116 bit wide dataflow
Gerwig, G
Kroener, M
14TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 1999, : 266 - 273
[3] A combined interval and floating-point divider
Stine, JE
Schulte, MJ
CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 218 - 222
[4] VLSI implementation of a floating-point divider
Patel, JN
Abid, Z
Wang, W
16TH INTERNATIONAL CONFERENCE ON MICROELECTRONICS, PROCEEDINGS, 2004, : 505 - 508
[5] Floating-point divider design for FPGAs
Hemmert, K. Scott
Underwood, Keith D.
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2007, 15 (01) : 115 - 118
[6] 32-bit logarithmic arithmetic unit and its performance compared to floating-point
The University, Newcastle upon Tyne, United Kingdom
Proc Symp Comput Arith, (142-151):
[7] Hardware Implementation of 24-bit Vedic Multiplier in 32-bit Floating-Point Divider
Hanuman, C. R. S.
Kamala, J.
2018 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND SYSTEM ENGINEERING (ICEESE), 2018, : 60 - 64
[8] Radix-64 Floating-Point Divider
Bruguera, Javier D.
2018 IEEE 25TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2018, : 84 - 91
[9] Design and Implementation of a High-performance 64-bit Floating-point Reciprocal and Square Root Reciprocal Unit
Feng, Chaochao
Li, Shaoqing
Zhang, Minxuan
2008 9TH INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED-CIRCUIT TECHNOLOGY, VOLS 1-4, 2008, : 1843 - 1846
[10] Implementation of Vector Floating-point processing unit on FPGAs for high performance computing
Chen, Shi
Venkatesan, Ramachandran
Gillard, Paul
2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 840 - 844

← 1 2 3 4 5 →