An FPGA based implementation of the Conjugate Gradient Kernels

被引:0
|
作者
Naher, Jannatun [1 ]
Sakib, Ahmad Sadman [2 ]
Adhav, Shrikant S. T. [1 ]
Gloster, Clay [1 ]
Doss, Christopher C. [1 ]
机构
[1] North Carolina A&T State Univ, Elect & Comp Engn Dept, Greensboro, NC 27411 USA
[2] Khulna Univ Engn & Technol, Elect & Elect Engn Dept, Khulna, Bangladesh
关键词
Conjugate Gradient; FPGAs; IP core;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Conjugate Gradient (CG) is frequently used iterative methods to solve Systems of Linear Equations (SLEs). The CG has a faster convergence rate and higher accuracy. It is widely used for many scientific applications such as meteorology, groundwater flow problems, studying satellite data, ocean circulation modeling, molecular dynamics simulations, real-time power quality assessment, and a neural robot controller, etc. It can be implemented on CPUs, GPUs and in Field Programmable Gate Arrays (FPGAs). FPGAs have been shown to provide an order of magnitude to speed up for various computation-intensive applications. However, a Hardware Description Language (HDL) based FPGA implementation for all the arithmetic modules requires considerable development time and the designer needs to be knowledgeable in hardware design as well as in HDL programming. Using IP cores can reduce the development time and design complexity. Prominently, CG has basic three computational kernels and amongst them, Matrix-Vector Multiplication (MVM) is the most computationally intensive kernel. Optimizing MVM kernels with higher throughput can reduce the computation time required for each iteration of CG. In this research, three basic kernels of CG are implemented on FPGAs using floating-point IP cores. The results show that with an FPGA-based implementation of CG we achieved a significant order-of-magnitude over the software implementation (Intel Xenon (R) CPU E5-2650 V2, 2.60 GHz) of the CG using Arria 10 1150 GX.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] An approximate inverse preconditioner and its implementation for conjugate gradient method
    Dag, Hasan
    [J]. PARALLEL COMPUTING, 2007, 33 (02) : 83 - 91
  • [22] EFFICIENT IMPLEMENTATION OF A CLASS OF PRECONDITIONED CONJUGATE-GRADIENT METHODS
    EISENSTAT, SC
    [J]. SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1981, 2 (01): : 1 - 4
  • [23] Matrix-Free Conjugate Gradient Implementation of Implicit Schemes
    Burago, N. G.
    Nikitin, I. S.
    [J]. COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 2018, 58 (08) : 1247 - 1258
  • [24] Matrix-Free Conjugate Gradient Implementation of Implicit Schemes
    N. G. Burago
    I. S. Nikitin
    [J]. Computational Mathematics and Mathematical Physics, 2018, 58 : 1247 - 1258
  • [25] Effective implementation of the High Performance Conjugate Gradient benchmark on GraphBLAS
    Scolari, Alberto
    Yzelman, Albert -Jan
    [J]. 2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW, 2023, : 216 - 225
  • [26] Simplification and Accurate Implementation of State Evolution Recursion for Conjugate Gradient
    Hashimoto, Sakyo
    Takeuchi, Keigo
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2023, E106A (06) : 952 - 956
  • [27] FPGA Implementation of Gauss Gradient Edge Detectors for Medical Images
    B. Sivasankari
    A. Ahilan
    S. N. Kumar
    [J]. Wireless Personal Communications, 2023, 131 : 973 - 990
  • [28] FPGA Implementation of Gauss Gradient Edge Detectors for Medical Images
    Sivasankari, B.
    Ahilan, A.
    Kumar, S. N.
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2023, 131 (02) : 973 - 990
  • [29] An Implementation of Block Conjugate Gradient Algorithm on CPU-GPU Processors
    Ji, Hao
    Sosonkina, Masha
    Li, Yaohang
    [J]. 2014 HARDWARE-SOFTWARE CO-DESIGN FOR HIGH PERFORMANCE COMPUTING (CO-HPC), 2014, : 72 - 77
  • [30] Implementation of Modified Conjugate Gradient Algorithm in Electromagnetic Tomography Lab System
    Wang, Zhanjun
    Li, Liu
    [J]. ENGINEERING SOLUTIONS FOR MANUFACTURING PROCESSES, PTS 1-3, 2013, 655-657 : 693 - +