Low Latency and Low Error Floating-Point Sine/Cosine Function Based TCORDIC Algorithm

被引：17

作者：

Zhu, Baozhou ^{[1
]}

Lei, Yuanwu ^{[1
]}

Peng, Yuanxi ^{[1
]}

He, Tingting ^{[1
]}

机构：

[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS | 2017年 / 64卷 / 04期

基金：

中国国家自然科学基金;

关键词：

CORDIC; floating-point sine/cosine; low latency; Taylor; RADIX-4 CORDIC ALGORITHM; ARCHITECTURE; GENERATION; PROCESSOR; HARDWARE;

D O I：

10.1109/TCSI.2016.2631588

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

CORDIC algorithm is suitable to implement sine/cosine function, but the large number of iterations lead to great delay and overhead. Moreover, due to finite bit-width of operands and number of iterations, the relative error of floating-point sine or cosine is terrible when the input angle is close to 0 or pi/2, respectively. To overcome these short-comings, TCORDIC algorithm, which combines low latency CORDIC and Taylor algorithm, is presented. After analyzing the latency of traditional CORDIC, low latency CORDIC is proposed, which adopts the technique of sign prediction, compressive iterations, and parallel iterations. Besides, the calculating boundary (N), which is used for determining whether Taylor algorithm is selected or not in TCORDIC algorithm, is evaluated to achieve a trade-off between area and delay. Truncated multipliers are used to reduce the area further. Finally, Using TCORDIC algorithm, pipelined and iterative structures are implemented for IEEE-754 double precision floating-point sine/cosine with the input Z epsilon [0, pi/2]. Under typical condition (1V, 25 degrees C), our designs are synthesized with 40 nm standard cell library. For a pipelined structure, the frequency is up to 1.70 GHz and area 194049.64 mu m(2). Frequency decreases to 1.45 GHz for iterative structure, but the area requires only 110590.81 mu m(2). TCORDIC is efficient in controlling relative error, and achieves the accuracy within one ulp (unit in the last place) for floating-point sine/cosine function.

引用

页码：892 / 905

页数：14

共 50 条

[31] A Step-Function Abstract Domain for Granular Floating-Point Error Analysis
Dario, Anthony
Pollard, Samuel D.
PROCEEDINGS OF THE 10TH ACM SIGPLAN INTERNATIONAL WORKSHOP ON NUMERICAL AND SYMBOLIC ABSTRACT DOMAINS, NSAD 2024, 2024, : 26 - 33
[32] A block floating-point treatment to the LMS algorithm: Efficient realization and a roundoff error analysis
Mitra, A
Chakraborty, M
Sakai, H
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2005, 53 (12) : 4536 - 4544
[33] Low power single precision BCD floating-point Vedic multiplier
Ramya, V.
Seshasayanan, R.
MICROPROCESSORS AND MICROSYSTEMS, 2020, 72
[34] Virtual Floating-point Units for Low-power Embedded Processors
Gilani, Syed Zohaib
Kim, Nam Sung
Schulte, Michael
2012 IEEE 23RD INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2012, : 61 - 68
[35] A Transprecision Floating-Point Platform for Ultra-Low Power Computing
Tagliavini, Giuseppe
Mach, Stefan
Rossi, Davide
Marongiu, Andrea
Benini, Luca
PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 1051 - 1056
[36] Low power techniques on a high speed floating-point adder design
Zhang, Ge
Huang, Kun
Shen, Haihua
Zhang, Feng
2007 IEEE INTERNATIONAL CONFERENCE ON INTEGRATION TECHNOLOGY, PROCEEDINGS, 2007, : 241 - +
[37] Low Power Floating-Point Multiplication and Squaring Units with Shared Circuitry
Moore, Jason
Thornton, Mitchell A.
Matula, David W.
2013 IEEE 56TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2013, : 1395 - 1398
[38] A unified reconfigurable floating-point arithmetic architecture based on CORDIC algorithm
Li, Bingyi
Fang, Linlin
Xie, Yizhuang
Chen, He
Chen, Liang
2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 301 - 302
[39] An FPGA-based low-cost VLIW floating-point processor for CNC applications
Dong, Jingchuan
Wang, Taiyong
Li, Bo
Liu, Zhe
Yu, Zhigiang
MICROPROCESSORS AND MICROSYSTEMS, 2017, 50 : 14 - 25
[40] A low-cost floating point vectoring algorithm based on CORDIC
Lee, JA
van der Kolk, KJ
Deprettere, EFA
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2000, E83A (08) : 1654 - 1662

← 1 2 3 4 5 →