共 25 条
- [1] Automatic CPU/GPU Generation of Multi-versioned OpenCL Kernels for C++ Scientific Applications International Journal of Parallel Programming, 2017, 45 : 262 - 282
- [2] Automatic Data Layout Generation and Kernel Mapping for CPU plus GPU Architectures PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON COMPILER CONSTRUCTION (CC 2016), 2016, : 240 - 250
- [3] Merge or Separate? Multi-job Scheduling for OpenCL Kernels on CPU/GPU Platforms PROCEEDINGS OF THE GENERAL PURPOSE GPUS (GPGPU-10), 2017, : 22 - 31
- [4] Compilation of MATLAB computations to CPU/GPU via C/OpenCL generation CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (22):
- [5] Hierarchical Partitioning Algorithm for Scientific Computing on Highly Heterogeneous CPU plus GPU Clusters EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 489 - 501
- [6] Concurrent CPU-GPU Task Programming using Modern C plus 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 588 - 597
- [8] Porting MATLAB Applications to High-Performance C plus plus Codes: CPU/GPU-Accelerated Spherical Deconvolution of Diffusion MRI Data ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2016, 2016, 10048 : 630 - 643
- [9] Exploring data flow design and vectorization with oneAPI for streaming applications on CPU plus GPU JOURNAL OF SUPERCOMPUTING, 2025, 81 (02):
- [10] Partition Strategies for C Source Programs to Support CPU plus GPU Coordination Computing 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CLOUD COMPUTING (ISCC), 2014, : 39 - 48