共 50 条
- [41] C-for-Metal: High Performance SIMD Programming on Intel GPUs CGO '21: PROCEEDINGS OF THE 2021 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2021, : 289 - 300
- [44] Improving CADNA performance on GPUs 2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 1016 - 1025
- [45] Scalar Processing Overhead on SIMD-Only Architectures 2009 20TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2009, : 183 - 190
- [46] Sharing SIMD execution units with decoupled offloader in asymmetric multicores Analog Integrated Circuits and Signal Processing, 2022, 112 : 263 - 275
- [47] Warp-Consolidation: A Novel Execution Model for GPUs INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 53 - 64
- [48] MIMD Programs Execution Support on SIMD Machines: A Holistic Survey IEEE ACCESS, 2024, 12 : 34354 - 34377
- [49] Improving SIMD Code Generation in QEMU 2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 1233 - 1236