共 27 条
- [21] An implementation of parallel 3-D FFT using short vector SIMD instructions on clusters of PCs APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2006, 3732 : 1159 - 1167
- [22] An Access-Pattern-Aware On-Chip Vector Memory System with Automatic Loading for SIMD Architectures 2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,
- [23] Mod (2P-1) Shuffle Memory-Access Instructions for FFTs on Vector SIMD DSPs 2016 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2016, : 426 - 430
- [25] On the performance improvement of sub-sampling MPEG-2 motion estimation algorithms with vector/SIMD architectures ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2005, 3708 : 595 - 602
- [26] Exploring Source-to-Source Compiler Transformation of OpenMP SIMD Constructs for Intel AVX and Arm SVE Vector Architectures PROCEEDINGS OF THE THIRTEENTH INTERNATIONAL WORKSHOP ON PROGRAMMING MODELS AND APPLICATIONS FOR MULTICORES AND MANYCORES (PMAM '22), 2022, : 11 - 20
- [27] SLAP: A SPLIT LATENCY ADAPTIVE VLIW PIPELINE ARCHITECTURE WHICH ENABLES ON-THE-FLY VARIABLE SIMD VECTOR-LENGTH 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7868 - 7872