共 35 条
- [1] Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (11): : 13584 - 13600
- [2] Automatically Optimizing Stencil Computations on Many-Core NUMA Architectures [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2016, 2017, 10136 : 137 - 152
- [3] Islands-of-Cores Approach for Harnessing SMP/NUMA Architectures in Heterogeneous Stencil Computations [J]. PARALLEL COMPUTING TECHNOLOGIES (PACT 2017), 2017, 10421 : 351 - 364
- [4] Optimization and Performance Modeling of Stencil Computations on ARM Architectures [J]. Proceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications, IEEE 18th International Conference on Smart City and IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020, 2020, : 113 - 121
- [5] Tiling Stencil Computations to Maximize Parallelism [J]. 2012 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2012,
- [6] Data Partitioning Strategies for Stencil Computations on NUMA Systems [J]. EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 597 - 609
- [7] Modeling Stencil Computations on Modern HPC Architectures [J]. HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING, AND SIMULATION, 2015, 8966 : 149 - 171
- [8] NUMA Aware Iterative Stencil Computations on Many-Core Systems [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 461 - 473
- [10] Unleashing the performance of ccNUMA multiprocessor architectures in heterogeneous stencil computations [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (12): : 7765 - 7777