共 50 条
- [1] Anatomy of high-performance matrix multiplication [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2008, 34 (03):
- [2] A family of high-performance matrix multiplication algorithms [J]. APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2006, 3732 : 256 - 265
- [3] High-Performance Matrix-Vector Multiplication on the GPU [J]. EURO-PAR 2011: PARALLEL PROCESSING WORKSHOPS, PT I, 2012, 7155 : 377 - 386
- [4] High-performance systolic arrays for band matrix multiplication [J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 1130 - 1133
- [5] A high-performance matrix–matrix multiplication methodology for CPU and GPU architectures [J]. The Journal of Supercomputing, 2016, 72 : 804 - 844
- [6] A High-Performance Accelerator for Floating-Point Matrix Multiplication [J]. 2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 396 - 402
- [7] Anatomy of High-Performance Many-Threaded Matrix Multiplication [J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
- [8] A high-performance matrix-matrix multiplication methodology for CPU and GPU architectures [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (03): : 804 - 844
- [9] Exploiting Online Locality and Reduction Parallelism for Sampled Dense Matrix Multiplication on GPUs [J]. 2021 IEEE 39TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2021), 2021, : 567 - 574
- [10] Fault-tolerant high-performance matrix multiplication:: Theory and practice [J]. INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2001, : 47 - 56