Acceleration with long vector architectures: Implementation and evaluation of the FFT kernel on NEC SX-Aurora and RISC-V vector extension

被引:2
|
作者
Vizcaino, Pablo [1 ]
Mantovani, Filippo [1 ]
Ferrer, Roger [1 ]
Labarta, Jesus [1 ,2 ]
机构
[1] Barcelona Supercomp Ctr, Placa Eusebi Guell 1-3, Barcelona 08034, Spain
[2] Univ Politecn Cataluna, Barcelona, Spain
来源
关键词
fast Fourier transform; FFT; NEC SX-Aurora; RISC-V; SIMD; vector acceleration;
D O I
10.1002/cpe.7424
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Novel architectures leveraging long and variable vector lengths like the NEC SX-Aurora or the vector extension of RISCV are appearing as promising solutions on the supercomputing market. These architectures often require re-coding of scientific kernels. For example, traditional implementations of algorithms for computing the fast Fourier transform (FFT) cannot take full advantage of vector architectures. In this article, we present the implementation of FFT algorithms able to leverage these novel architectures. We evaluate these codes on NEC SX-Aurora , comparing them with the optimized NEC libraries; and in a prototype of a RISC-V core with a vector processing unit. We present the benefits and limitations of two approaches of RADIX-2 FFT vector implementations. We show that our approach makes better use of the vector unit of the NEC SX-Aurora , reaching higher or equal performance than the optimized NEC library. More generally, we prove the importance of maximizing the vector length usage of the algorithm, taking advantage of the FFT properties to reduce long-latency vector operations, and reordering the instructions according to the specific hardware features to boost the performance of FFT-like computational kernels.
引用
收藏
页数:14
相关论文
共 33 条
  • [1] Accelerating FFT Using NEC SX-Aurora Vector Engine
    Vizcaino, Pablo
    Mantovani, Filippo
    Labarta, Jesus
    EURO-PAR 2021: PARALLEL PROCESSING WORKSHOPS, 2022, 13098 : 179 - 190
  • [2] Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA
    Komatsu, Kazuhiko
    Momose, Shintaro
    Isobe, Yoko
    Watanabe, Osamu
    Musa, Akihiro
    Yokokawa, Mitsuo
    Aoyama, Toshikazu
    Sato, Masayuki
    Kobayashi, Hiroaki
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE, AND ANALYSIS (SC'18), 2018,
  • [3] A Pluggable Vector Unit for RISC-V Vector Extension
    Maisto, Vincenzo
    Cilardo, Alessandro
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 1143 - 1148
  • [4] Evaluating the Performance of OpenMP Offloading on the NEC SX-Aurora TSUBASA Vector Engine
    Cramer T.
    Kosmynin B.
    Moll S.
    Römmer M.
    Focht E.
    Müller M.S.
    Supercomputing Frontiers and Innovations, 2021, 8 (02) : 59 - 74
  • [5] Efficient Support of the Scan Vector Model for RISC-V Vector Extension
    Lai, Hung-Ming
    Lee, Jenq-Kuen
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP 2022, 2022,
  • [6] Vectorized Nonlinear Functions with the RISC-V Vector Extension
    Bavier, Eric
    Knight, Nicholas
    Saint-Genies, Hugues De lassus
    Love, Eric
    2023 IEEE 30TH SYMPOSIUM ON COMPUTER ARITHMETIC, ARITH 2023, 2023, : 127 - 130
  • [7] Functional Validation of the RISC-V Unlimited Vector Extension
    Fernandes, Ana
    Crespo, Luis
    Neves, Nuno
    Tomas, Pedro
    Roma, Nuno
    Falcao, Gabriel
    IEEE EMBEDDED SYSTEMS LETTERS, 2025, 17 (01) : 2 - 5
  • [8] VGL: a high-performance graph processing framework for the NEC SX-Aurora TSUBASA vector architecture
    Ilya V. Afanasyev
    Vladimir V. Voevodin
    Kazuhiko Komatsu
    Hiroaki Kobayashi
    The Journal of Supercomputing, 2021, 77 : 8694 - 8715
  • [9] Communications Signal Processing Using RISC-V Vector Extension
    Razilov, Viktor
    Matus, Emil
    Fettweis, Gerhard
    2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 690 - 695
  • [10] VGL: a high-performance graph processing framework for the NEC SX-Aurora TSUBASA vector architecture
    Afanasyev, Ilya V.
    Voevodin, Vladimir V.
    Komatsu, Kazuhiko
    Kobayashi, Hiroaki
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (08): : 8694 - 8715