共 50 条
- [1] WORKLOAD-AWARE AUTOMATIC PARALLELIZATION FOR MULTI-GPU DNN TRAINING 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1453 - 1457
- [2] Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers 2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 139 - 152
- [3] Parallelization of benchmarks for scalable shared-memory multiprocessors 1998 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 1998, : 401 - 408
- [5] The Optimization of Model Parallelization Strategies for Multi-GPU Training 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
- [7] Topology-Aware GPU Selection on Multi-GPU Nodes 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 712 - 720
- [8] Global Shared Memory Design for Multi-GPU Graphics Cards on Personal Supercomputer INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1236 - 1241
- [10] Multi-GPU and Multi-CPU Parallelization for Interactive Physics Simulations EURO-PAR 2010 - PARALLEL PROCESSING, PART II, 2010, 6272 : 235 - 246