共 50 条
- [1] A high-performance batched matrix multiplication framework for GPUs under unbalanced input distribution [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 1741 - 1758
- [3] Anatomy of high-performance matrix multiplication [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2008, 34 (03):
- [4] High-Performance Homomorphic Matrix Completion on Multiple GPUs [J]. IEEE ACCESS, 2020, 8 : 25395 - 25406
- [5] Fast Batched Matrix Multiplication for Small Sizes using Half-Precision Arithmetic on GPUs [J]. 2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 111 - 122
- [6] A family of high-performance matrix multiplication algorithms [J]. APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2006, 3732 : 256 - 265
- [7] Matrix Converter Performance Under Unbalanced Input-Voltage [J]. 2008 40TH NORTH AMERICAN POWER SYMPOSIUM (NAPS 2008), 2008, : 500 - +
- [8] Unleashing the performance of bmSparse for the sparse matrix multiplication in GPUs [J]. PROCEEDINGS OF SCALA 2021: 12TH WORKSHOP ON LATEST ADVANCES IN SCALABLE ALGORITHMS FOR LARGE- SCALE SYSTEMS, 2021, : 19 - 26
- [10] High-Performance Matrix-Vector Multiplication on the GPU [J]. EURO-PAR 2011: PARALLEL PROCESSING WORKSHOPS, PT I, 2012, 7155 : 377 - 386