共 50 条
- [21] SparCML: High-Performance Sparse Communication for Machine Learning [J]. PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2019,
- [24] DeltaSPARSE: High-Performance Sparse General Matrix-Matrix Multiplication on Multi-GPU Systems [J]. 2023 IEEE 30TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC 2023, 2023, : 194 - 202
- [25] Learning Everywhere: Pervasive Machine Learning for Effective High-Performance Computation [J]. 2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 422 - 429
- [26] A high-performance batched matrix multiplication framework for GPUs under unbalanced input distribution [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 1741 - 1758
- [27] SparseX: A Library for High-Performance Sparse Matrix-Vector Multiplication on Multicore Platforms [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2018, 44 (03):
- [28] IMPLEMENTING HIGH-PERFORMANCE COMPLEX MATRIX MULTIPLICATION VIA THE 1M METHOD [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2020, 42 (05): : C221 - C244
- [29] A high-performance batched matrix multiplication framework for GPUs under unbalanced input distribution [J]. The Journal of Supercomputing, 2022, 78 : 1741 - 1758
- [30] A Machine Learning Approach Towards Runtime Optimisation of Matrix Multiplication [J]. 2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS, 2023, : 524 - 534