An FPGA based implementation of the Conjugate Gradient Kernels

被引:0
|
作者
Naher, Jannatun [1 ]
Sakib, Ahmad Sadman [2 ]
Adhav, Shrikant S. T. [1 ]
Gloster, Clay [1 ]
Doss, Christopher C. [1 ]
机构
[1] North Carolina A&T State Univ, Elect & Comp Engn Dept, Greensboro, NC 27411 USA
[2] Khulna Univ Engn & Technol, Elect & Elect Engn Dept, Khulna, Bangladesh
关键词
Conjugate Gradient; FPGAs; IP core;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Conjugate Gradient (CG) is frequently used iterative methods to solve Systems of Linear Equations (SLEs). The CG has a faster convergence rate and higher accuracy. It is widely used for many scientific applications such as meteorology, groundwater flow problems, studying satellite data, ocean circulation modeling, molecular dynamics simulations, real-time power quality assessment, and a neural robot controller, etc. It can be implemented on CPUs, GPUs and in Field Programmable Gate Arrays (FPGAs). FPGAs have been shown to provide an order of magnitude to speed up for various computation-intensive applications. However, a Hardware Description Language (HDL) based FPGA implementation for all the arithmetic modules requires considerable development time and the designer needs to be knowledgeable in hardware design as well as in HDL programming. Using IP cores can reduce the development time and design complexity. Prominently, CG has basic three computational kernels and amongst them, Matrix-Vector Multiplication (MVM) is the most computationally intensive kernel. Optimizing MVM kernels with higher throughput can reduce the computation time required for each iteration of CG. In this research, three basic kernels of CG are implemented on FPGAs using floating-point IP cores. The results show that with an FPGA-based implementation of CG we achieved a significant order-of-magnitude over the software implementation (Intel Xenon (R) CPU E5-2650 V2, 2.60 GHz) of the CG using Arria 10 1150 GX.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] FPGA implementation of the conjugate gradient method
    Maslennikow, Oleg
    Lepekha, Volodymyr
    Sergyienko, Anatoli
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2006, 3911 : 526 - 533
  • [2] A high throughput FPGA-based floating point conjugate gradient implementation
    Lopes, Antonio Roldao
    Constantinides, George A.
    [J]. RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2008, 4943 : 75 - 86
  • [3] A High Throughput FPGA-Based Floating Point Conjugate Gradient Implementation for Dense Matrices
    Roldao, Antonio
    Constantinides, George A.
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2010, 3 (01)
  • [4] Preconditioned Conjugate Gradient Acceleration on FPGA-Based Platforms
    Malakonakis, Pavlos
    Isotton, Giovanni
    Miliadis, Panagiotis
    Alverti, Chloe
    Theodoropoulos, Dimitris
    Pnevmatikatos, Dionisios
    Ioannou, Aggelos
    Harteros, Konstantinos
    Georgopoulos, Konstantinos
    Papaefstathiou, Ioannis
    Mavroidis, Iakovos
    [J]. ELECTRONICS, 2022, 11 (19)
  • [5] An FPGA Implementation of Gradient Based Edge Detection Algorithm Design
    Yasri, I.
    Hamid, N. H.
    Yap, V. V.
    [J]. PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT, VOL 2, 2009, : 165 - 169
  • [6] An Implementation of the Conjugate Gradient Algorithm on FPGAs
    DuBois, David
    DuBois, Andrew
    Boorman, Thomas
    Connor, Carolyn
    Poole, Steve
    [J]. PROCEEDINGS OF THE SIXTEENTH IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, 2008, : 296 - +
  • [7] VLSI implementation of conjugate gradient based mobile user tracking system
    Baghaie, R
    Karttunen, P
    [J]. 42ND MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1999, : 309 - 312
  • [8] Non-Preconditioned Conjugate Gradient on Cell and FPGA based Hybrid Supercomputer Nodes
    DuBois, David
    DuBois, Andrew
    Boorman, Thomas
    Connor, Carolyn
    [J]. PROCEEDINGS OF THE 2009 17TH IEEE SYMPOSIUM ON FIELD PROGRAMMABLE CUSTOM COMPUTING MACHINES, 2009, : 201 - 208
  • [9] Conjugate gradient based implementation of interior point methods for network flow problems
    Mehrotra, S
    Wang, JS
    [J]. LINEAR AND NONLINEAR CONJUGATE GRADIENT-RELATED METHODS, 1996, : 124 - 142
  • [10] Moving images time gradient implementation using RAM-based FPGA
    Nozal, LL
    Aranguren, G
    Martin, JL
    Ezquerra, J
    [J]. REAL-TIME IMAGING II, 1997, 3028 : 108 - 116