共 50 条
- [1] CUDA-Accelerated SVM for Celestial Object Classification ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XX, 2011, 442 : 119 - 122
- [2] Systematic Fusion of CUDA Kernels for Iterative Sparse Linear System Solvers EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 675 - 686
- [5] Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks 2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS, 2023, : 501 - 511
- [9] Modeling and Analyzing Evaluation Cost of CUDA Kernels PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2021, 5 (POPL):
- [10] Efficient NAS Parallel Benchmark Kernels with CUDA 2020 28TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2020), 2020, : 9 - 16