共 50 条
- [1] Efficient Stencil Computation with Temporal Blocking by Halide DSL [J]. 2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 870 - 877
- [2] An Extension of OpenACC Directives for Out-of-Core Stencil Computation with Temporal Blocking [J]. PROCEEDINGS OF WACCPD 2016: THIRD WORKSHOP ON ACCELERATOR PROGRAMMING USING DIRECTIVES, 2016, : 36 - 45
- [3] Revisiting Temporal Blocking Stencil Optimizations [J]. PROCEEDINGS OF THE 37TH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2023, 2023, : 251 - 263
- [4] Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL [J]. PROCEEDINGS OF THE 2018 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'18), 2018, : 153 - 162
- [5] Applying Recursive Temporal Blocking for Stencil Computations to Deeper Memory Hierarchy [J]. 2018 7TH IEEE NON-VOLATILE MEMORY SYSTEMS AND APPLICATIONS SYMPOSIUM (NVMSA 2018), 2018, : 19 - 24
- [6] Accelerating Stencil Computations on a GPU by Combining Using Tensor Cores and Temporal Blocking [J]. 16TH WORKSHOP ON GENERAL PURPOSE PROCESSING USING GPU, GPGPU 2024, 2024, : 1 - 6
- [7] Efficient temporal blocking for stencil computations by multicore-aware wavefront parallelization [J]. 2009 IEEE 33RD INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 579 - +
- [10] Locality of Computation for Stencil Optimization [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2016, 2016, 10048 : 449 - 456