Efficient FPGA Implementation of Digit Parallel Online Arithmetic Operators

被引:0
|
作者
Shi, Kan [1 ]
Boland, David [2 ]
Constantinides, George A. [1 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
[2] Monash Univ, Dept Elect & Comp Syst Engn, Clayton, Vic 3800, Australia
基金
英国工程与自然科学研究理事会;
关键词
PERFORMANCE;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Online arithmetic has been widely studied for ASIC implementation. Online components were originally designed to perform computations in digit serial with most significant digit (MSD) first, resulting in the ability to chain arithmetic operators together for low latency. More recently, research has shown that digit parallel online operators can fail more gracefully when operating beyond the deterministic clocking region in comparison to operators with conventional arithmetic. Unfortunately, the utilization of online arithmetic operators in the past has required a large area overhead for FPGA implementation. In this paper, we propose novel approaches to implement the key primitives of online arithmetic, adders and multipliers, efficiently on modern Xilinx FPGAs with 6-input LUTs and carry resources. We demonstrate experimentally that in comparison to a direct RTL synthesis, the proposed architectures achieve slice savings of over 67% and 69%, and speed-ups of over 1.2x and 1.5x for adders and multipliers, respectively. As a result, the area overheads of using online adders and multipliers in place of traditional arithmetic primitives is reduced from 8.41x and 8.11x to 1.88x and 1.84x respectively. Finally, because an online multiplier generates MSDs first, we also demonstrate the method to create an online multiplier with a reduced precision output that is smaller than a traditional multiplier producing the same result. We show that this can lead to silicon area savings of up to 56%.
引用
收藏
页码:115 / 122
页数:8
相关论文
共 50 条
  • [1] An Efficient Implementation of Online Arithmetic
    Zhao, Yiren
    Wickerson, John
    Constantinides, George A.
    2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 69 - 76
  • [2] Efficient FPGA-implementation of two's complement digit-serial/parallel multipliers
    Valls, J
    Boemo, E
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 2003, 50 (06): : 317 - 322
  • [3] Fast and efficient FPGA implementation of connected operators
    Ngan, N.
    Dokladalova, E.
    Akil, M.
    Contou-Carrere, F.
    JOURNAL OF SYSTEMS ARCHITECTURE, 2011, 57 (08) : 778 - 789
  • [4] Energy efficient parallel neuromorphic architectures with approximate arithmetic on FPGA
    Wang, Qian
    Li, Youjie
    Shao, Botang
    Dey, Siddhartha
    Li, Peng
    NEUROCOMPUTING, 2017, 221 : 146 - 158
  • [5] FPGA Implementation of Fully Parallel Distributed Arithmetic Based DCT Architecture
    Kalyani, K.
    Omnath, P. P.
    Priyadarshani, K.
    Srinath, S.
    Rajaram, S.
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [6] Efficient Parallel Implementation of Morphological Operation on GPU and FPGA
    Li, Teng
    Dou, Yong
    Jiang, Jingfei
    Gao, Jing
    2014 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2014, : 430 - 435
  • [7] ELIMINATING THE NORMALIZATION PROBLEM IN DIGIT ONLINE ARITHMETIC
    ZACCONE, RJ
    BARLOW, JL
    IEEE TRANSACTIONS ON COMPUTERS, 1987, 36 (01) : 36 - 46
  • [8] Low Power FIR Filter implementation on FPGA using Parallel Distributed Arithmetic
    Khan, Shaheen
    Jaffery, Zainul Abdin
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [9] Efficient ASIC and FPGA Implementation of Binary-Coded Decimal Digit Multipliers
    Gorgin, Saeid
    Jaberipur, Ghassem
    Asl, Reza Hashemi
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2014, 33 (12) : 3883 - 3899
  • [10] Efficient ASIC and FPGA Implementation of Binary-Coded Decimal Digit Multipliers
    Saeid Gorgin
    Ghassem Jaberipur
    Reza Hashemi Asl
    Circuits, Systems, and Signal Processing, 2014, 33 : 3883 - 3899