Radix-64 Floating-Point Division and Square Root: Iterative and Pipelined Units

被引：2

作者：

Bruguera, Javier D. ^{[1
]}

机构：

[1] Arm Ltd, Cambridge CB1 9NJ, England

来源：

IEEE TRANSACTIONS ON COMPUTERS | 2023年 / 72卷 / 10期

关键词：

Table lookup; Mathematical models; Low latency communication; Iterative algorithms; Program processors; Hardware; Timing; Digit-recurrence algorithms; floating-point division and square root; iterative methods;

D O I：

10.1109/TC.2023.3280136

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Digit-recurrence algorithms are widely used in actual microprocessors to compute floating-point division and square root. These iterative algorithms present a good trade-off in terms of performance, area and power. Traditionally, commercial processors have iterative division and square root units where the iteration logic is used over several cycles. The main drawbacks of these iterative units are long latency and low throughput due to the reuse of part of the logic over several cycles, and its hardware complexity with separated logic for division and square root. We present a radix-64 floating-point division and square root algorithm with a common iteration for division and square root and where, to have an affordable implementation, each radix-64 iteration is made of two simpler radix-8 iterations. The radix-64 algorithm allows to get low-latency operations, and the common division and square root radix-64 iteration results in some area reduction. The algorithm is mapped into two different microarchitectures: a low-latency and low area iterative unit, and a low-latency and high-throughput pipelined unit. In both units speculation between consecutive radix-8 iterations is used to reduce the timing.

引用

页码：2990 / 3001

页数：12

共 50 条

[31] Area and performance tradeoffs in floating-point divide and square-root implementations
Soderquist, P
Leeser, M
ACM COMPUTING SURVEYS, 1996, 28 (03) : 518 - 564
[32] A Novel and Efficient square root Computation Quantum Circuit for Floating-point Standard
Gayathri S S
R. Kumar
Majid Haghparast
Samiappan Dhanalakshmi
International Journal of Theoretical Physics, 61
[33] Area and performance tradeoffs in floating-point divide and square-root implementations
ACM Comput Surv, 3 (518):
[34] Decimal floating-point square root using Newton-Raphson iteration
Wang, LK
Schulte, MJ
16TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURE AND PROCESSORS, PROCEEDINGS, 2005, : 309 - 315
[35] A Novel and Efficient square root Computation Quantum Circuit for Floating-point Standard
Gayathri, S. S.
Kumar, R.
Haghparast, Majid
Dhanalakshmi, Samiappan
INTERNATIONAL JOURNAL OF THEORETICAL PHYSICS, 2022, 61 (09)
[36] Design and Implementation of Goldschmidts Algorithm for Floating Point Division and Square root
Bhoyar, Rupali
Palsodkar, Prasanna
Kakde, Sandeep
2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 1588 - 1592
[37] DESIGN AND ANALYSIS OF NONBINARY RADIX FLOATING-POINT REPRESENTATIONS
JOHNSTONE, P
PETRY, FE
COMPUTERS & ELECTRICAL ENGINEERING, 1994, 20 (01) : 39 - 50
[38] FLOATING-POINT ERROR PROPAGATION IN ITERATIVE METHODS
ALT, R
MATHEMATICS AND COMPUTERS IN SIMULATION, 1988, 30 (06) : 505 - 517
[39] On the Design of Iterative Approximate Floating-Point Multipliers
Towhidy, Ahmad
Omidi, Reza
Mohammadi, Karim
IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (06) : 1623 - 1635
[40] Experiments for decimal floating-point division by recurrence
Castellanos, Ivan D.
Stine, James E.
2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 1716 - +

← 1 2 3 4 5 →