共 50 条
- [31] Extracting Memory-Level Parallelism through Reconfigurable Hardware Traces 2013 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2013,
- [32] Simulation and Architecture Improvements of Atomic Operations on GPU Scratchpad Memory 2013 IEEE 31ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2013, : 357 - 362
- [35] DYNAMO - A PORTABLE TOOL FOR DYNAMIC LOAD BALANCING ON DISTRIBUTED-MEMORY MULTICOMPUTERS CONCURRENCY-PRACTICE AND EXPERIENCE, 1994, 6 (08): : 613 - 639
- [36] BLPP: Improving the Performance of GPGPUs with Heterogeneous Memory through Bandwidth- and Latency-Aware Page Placement 2018 IEEE 36TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2018, : 358 - 365
- [37] Exploiting mixed-mode parallelism for matrix operations on the HERA architecture through reconfiguration IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 2006, 153 (04): : 249 - 260
- [38] Dynamic application placement under service and memory constraints EXPERIMENTAL AND EFFICIENT ALGORITHMS, PROCEEDINGS, 2005, 3503 : 391 - 402
- [39] Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications The Journal of Supercomputing, 2017, 73 : 5378 - 5401
- [40] Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications JOURNAL OF SUPERCOMPUTING, 2017, 73 (12): : 5378 - 5401