Accelerating Real-Valued FFT on CPU-FPGA Platforms

被引:0
|
作者
Qian, Zhuo [1 ]
Gan, Guoyou [2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650000, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Fac Mat Sci & Engn, Kunming 650000, Yunnan, Peoples R China
关键词
FPGA; high-level synthesis; open computing language (OpenCL); real-valued fast Fourier transform (FFT); ARCHITECTURES;
D O I
10.1109/TCAD.2024.3377160
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The real-valued fast Fourier transform (RFFT) is an ideal candidate for implementing a high-speed and low-power FFT processor because it only has approximately half the number of arithmetic operations compared with traditional complex-valued FFT (CFFT). Although RFFT can be calculated using CFFT hardware, a dedicated RFFT implementation can result in reduced hardware complexity, power consumption and increased throughput. However, unlike CFFT, RFFT has irregular signal flow graphs which hinders the design of efficient pipelined architectures. In this article, utilizing open computing language (OpenCL), we propose a high-level programming method for the implementation of pipelined architectures of RFFT on FPGAs. By identifying the regular computational pattern in the flow graph of RFFT, the proposed method essentially uses a for loop to implement the RFFT algorithm, and later with the help of high-level synthesis tools, the loop is fully unrolled to automatically build pipelined architectures. Experiments show that for a 4096-point RFFT, the proposed method achieves a 2.49 x speedup and 3.09 x better-energy efficiency over CUFFT on GPU, and a 21.12 x speedup and 16.09 x better-energy efficiency over FFTW on CPU, respectively. Compared to Intel's CFFT design on the same FPGA, the proposed one reduces 12% logic resources and 16% DSP blocks, respectively, while achieving a 1.48 x speedup.
引用
下载
收藏
页码:2532 / 2536
页数:5
相关论文
共 50 条
  • [1] Accelerating Proximal Policy Optimization on CPU-FPGA Heterogeneous Platforms
    Meng, Yuan
    Kuppannagari, Sanmukh
    Prasanna, Viktor
    28TH IEEE INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2020, : 19 - 27
  • [2] GraphACT: Accelerating GCN Training on CPU-FPGA Heterogeneous Platforms
    Zeng, Hanqing
    Prasanna, Viktor
    2020 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA '20), 2020, : 255 - 265
  • [3] Accelerating Graph Analytics on CPU-FPGA Heterogeneous Platform
    Zhou, Shijie
    Prasanna, Viktor K.
    2017 29TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 2017, : 137 - 144
  • [4] A Quantitative Analysis on Microarchitectures of Modern CPU-FPGA Platforms
    Choi, Young-Kyu
    Cong, Jason
    Fang, Zhenman
    Hao, Yuchen
    Reinman, Glenn
    Wei, Peng
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [5] Accelerating Equi-Join on a CPU-FPGA Heterogeneous Platform
    Chen, Ren
    Prasanna, Viktor K.
    2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, : 212 - 219
  • [6] Canonic Real-Valued FFT Structures
    Parhi, Megha
    Lao, Yingjie
    Parhi, Keshab K.
    CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 1261 - 1265
  • [7] Accelerating Pattern Matching Queries in Hybrid CPU-FPGA Architectures
    Sidler, David
    Istvan, Zsolt
    Owaida, Muhsen
    Alonso, Gustavo
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 403 - 415
  • [8] A Hybrid Approach to Cache Management in Heterogeneous CPU-FPGA Platforms
    Feng, Liang
    Sinha, Sharad
    Zhang, Wei
    Liang, Yun
    2017 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2017, : 937 - 944
  • [9] STPAcc: Structural TI-Based Pruning for Accelerating Distance-Related Algorithms on CPU-FPGA Platforms
    Wang, Yuke
    Feng, Boyuan
    Li, Gushu
    Deng, Lei
    Xie, Yuan
    Ding, Yufei
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (05) : 1358 - 1370
  • [10] Accelerating Multi-Agent DDPG on CPU-FPGA Heterogeneous Platform
    Wiggins, Samuel
    Meng, Yuan
    Kannan, Rajgopal
    Prasanna, Viktor
    2023 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE, HPEC, 2023,