共 46 条
- [23] NARMADA: Near-memory horizontal diffusion accelerator for scalable stencil computations [J]. 2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2019, : 263 - 269
- [24] Pipelined CPU-GPU Scheduling to Reduce Main Memory Accesses [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, MEMSYS 2021, 2021,
- [25] Locality-Aware Stencil Computations using Flash SSDs as Main Memory Extension [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 1163 - 1168
- [27] Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory Model [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS'15), 2015, : 207 - 216
- [28] Promising 2.0: Global Optimizations in Relaxed Memory Concurrency [J]. PROCEEDINGS OF THE 41ST ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '20), 2020, : 362 - 376
- [29] DNA computations can have global memory [J]. INTERNATIONAL CONFERENCE ON COMPUTER DESIGN - VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 1996, : 344 - 347
- [30] Instruction combining for coalescing memory accesses using global code motion [J]. Proc. ACM SIGPLAN Workshop Mem. Syst. Perform., MSP, 1600, (2-11):