Vector Memory-Access Shuffle Fused Instructions for FFT-Like Algorithms

被引:0
|
作者
LIU Sheng [1 ]
YUAN Bo [1 ]
GUO Yang [1 ]
SUN Haiyan [1 ]
JIANG Zekun [1 ]
机构
[1] College of Computer, National University of Defense Technology
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP333 [存贮器]; TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 081201 ; 0835 ; 1405 ;
摘要
The shuffle operations are the bottleneck when mapping the FFT-like algorithms to the vector single instruction multiple data(SIMD) architectures.We propose six(three pairs) innovative vector memoryaccess shuffle fused instructions, which have been proved mathematically. Combined with the proposed modified binary-exchange method, the innovative instructions can efficiently address the bottleneck problem for decimationin-frequency or decimation-in-time(DIF/DIT) radix-2/4FFT-like algorithms, reach a performance improvement by 17.9%–111.2% and reduce the code size by 5.4%–39.8%.In addition, the proposed instructions fit some hybridradix FFTs and are suitable for the terms of the initial or result data placement for general algorithms. The software and hardware costs of the proposed instructions are moderate.
引用
收藏
页码:1077 / 1088
页数:12
相关论文
共 6 条
  • [1] Vector Memory-Access Shuffle Fused Instructions for FFT-Like Algorithms
    Liu Sheng
    Yuan Bo
    Guo Yang
    Sun Haiyan
    Jiang Zekun
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (05) : 1077 - 1088
  • [2] Mod (2P-1) Shuffle Memory-Access Instructions for FFTs on Vector SIMD DSPs
    Liu, Sheng
    Chen, Haiyan
    Wan, Jianghua
    Wang, Yaohua
    [J]. 2016 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2016, : 426 - 430
  • [3] Algorithms and pipeline architectures for 2-D FFT and FFT-like transforms
    Nibouche, O.
    Boussakta, S.
    Darnell, M.
    Benaissa, M.
    [J]. DIGITAL SIGNAL PROCESSING, 2010, 20 (04) : 1072 - 1086
  • [4] Construction of Ternary Bent Functions by FFT-like Permutation Algorithms
    Stankovic, Radomir S.
    Stankovic, Milena
    Moraga, Claudio
    Astola, Jaakko T.
    [J]. 2020 IEEE 50TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL 2020), 2020, : 88 - 93
  • [5] Construction of Ternary Bent Functions by FFT-Like Permutation Algorithms
    Stankovic, Radomir S.
    Stankovic, Milena
    Moraga, Claudio
    Astola, Jaakko T.
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (08) : 1092 - 1102
  • [6] Decoupled Vector Runahead for Prefetching Nested Memory-Access Chains
    Naithani, Ajeya
    Roelandts, Jaime
    Ainsworth, Sam
    Jones, Timothy M.
    Eeckhout, Lieven
    [J]. IEEE MICRO, 2024, 44 (04) : 20 - 26