共 50 条
- [31] Nonblocking Data Structures for Distributed-Memory Machines: Stacks as an Example 2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 9 - 17
- [32] Distributed-memory multi-GPU block-sparse tensor contraction for electronic structure 2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 537 - 546
- [33] Protein database search of hybrid alignment algorithm based on GPU parallel acceleration JOURNAL OF SUPERCOMPUTING, 2017, 73 (10): : 4517 - 4534
- [34] Protein database search of hybrid alignment algorithm based on GPU parallel acceleration The Journal of Supercomputing, 2017, 73 : 4517 - 4534
- [36] Generic matrix multiplication for multi-GPU accelerated distributed-memory platforms over PARSEC PROCEEDINGS OF SCALA 2019: 2019 IEEE/ACM 10TH WORKSHOP ON LATEST ADVANCES IN SCALABLE ALGORITHMS FOR LARGE-SCALE SYSTEMS (SCALA), 2019, : 33 - 41
- [38] Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory 2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 375 - 386
- [40] PROCESSOR TAGGED DESCRIPTORS - A DATA STRUCTURE FOR COMPILING FOR DISTRIBUTED-MEMORY MULTICOMPUTERS PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 1994, 50 : 123 - 132