Low Latency and Low Error Floating-Point Sine/Cosine Function Based TCORDIC Algorithm

被引:17
|
作者
Zhu, Baozhou [1 ]
Lei, Yuanwu [1 ]
Peng, Yuanxi [1 ]
He, Tingting [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
CORDIC; floating-point sine/cosine; low latency; Taylor; RADIX-4 CORDIC ALGORITHM; ARCHITECTURE; GENERATION; PROCESSOR; HARDWARE;
D O I
10.1109/TCSI.2016.2631588
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
CORDIC algorithm is suitable to implement sine/cosine function, but the large number of iterations lead to great delay and overhead. Moreover, due to finite bit-width of operands and number of iterations, the relative error of floating-point sine or cosine is terrible when the input angle is close to 0 or pi/2, respectively. To overcome these short-comings, TCORDIC algorithm, which combines low latency CORDIC and Taylor algorithm, is presented. After analyzing the latency of traditional CORDIC, low latency CORDIC is proposed, which adopts the technique of sign prediction, compressive iterations, and parallel iterations. Besides, the calculating boundary (N), which is used for determining whether Taylor algorithm is selected or not in TCORDIC algorithm, is evaluated to achieve a trade-off between area and delay. Truncated multipliers are used to reduce the area further. Finally, Using TCORDIC algorithm, pipelined and iterative structures are implemented for IEEE-754 double precision floating-point sine/cosine with the input Z epsilon [0, pi/2]. Under typical condition (1V, 25 degrees C), our designs are synthesized with 40 nm standard cell library. For a pipelined structure, the frequency is up to 1.70 GHz and area 194049.64 mu m(2). Frequency decreases to 1.45 GHz for iterative structure, but the area requires only 110590.81 mu m(2). TCORDIC is efficient in controlling relative error, and achieves the accuracy within one ulp (unit in the last place) for floating-point sine/cosine function.
引用
收藏
页码:892 / 905
页数:14
相关论文
共 50 条
  • [31] A Step-Function Abstract Domain for Granular Floating-Point Error Analysis
    Dario, Anthony
    Pollard, Samuel D.
    PROCEEDINGS OF THE 10TH ACM SIGPLAN INTERNATIONAL WORKSHOP ON NUMERICAL AND SYMBOLIC ABSTRACT DOMAINS, NSAD 2024, 2024, : 26 - 33
  • [32] A block floating-point treatment to the LMS algorithm: Efficient realization and a roundoff error analysis
    Mitra, A
    Chakraborty, M
    Sakai, H
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2005, 53 (12) : 4536 - 4544
  • [33] Low power single precision BCD floating-point Vedic multiplier
    Ramya, V.
    Seshasayanan, R.
    MICROPROCESSORS AND MICROSYSTEMS, 2020, 72
  • [34] Virtual Floating-point Units for Low-power Embedded Processors
    Gilani, Syed Zohaib
    Kim, Nam Sung
    Schulte, Michael
    2012 IEEE 23RD INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2012, : 61 - 68
  • [35] A Transprecision Floating-Point Platform for Ultra-Low Power Computing
    Tagliavini, Giuseppe
    Mach, Stefan
    Rossi, Davide
    Marongiu, Andrea
    Benini, Luca
    PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 1051 - 1056
  • [36] Low power techniques on a high speed floating-point adder design
    Zhang, Ge
    Huang, Kun
    Shen, Haihua
    Zhang, Feng
    2007 IEEE INTERNATIONAL CONFERENCE ON INTEGRATION TECHNOLOGY, PROCEEDINGS, 2007, : 241 - +
  • [37] Low Power Floating-Point Multiplication and Squaring Units with Shared Circuitry
    Moore, Jason
    Thornton, Mitchell A.
    Matula, David W.
    2013 IEEE 56TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2013, : 1395 - 1398
  • [38] A unified reconfigurable floating-point arithmetic architecture based on CORDIC algorithm
    Li, Bingyi
    Fang, Linlin
    Xie, Yizhuang
    Chen, He
    Chen, Liang
    2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 301 - 302
  • [39] An FPGA-based low-cost VLIW floating-point processor for CNC applications
    Dong, Jingchuan
    Wang, Taiyong
    Li, Bo
    Liu, Zhe
    Yu, Zhigiang
    MICROPROCESSORS AND MICROSYSTEMS, 2017, 50 : 14 - 25
  • [40] A low-cost floating point vectoring algorithm based on CORDIC
    Lee, JA
    van der Kolk, KJ
    Deprettere, EFA
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2000, E83A (08) : 1654 - 1662