共 50 条
- [21] Analysis of large deviations behavior of multi-GPU memory access in deep learning The Journal of Supercomputing, 2018, 74 : 2199 - 2212
- [22] CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (11): : 1399 - 1413
- [23] Adaptive Communication for Distributed Deep Learning on Commodity GPU Cluster 2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 283 - 290
- [25] Efficient Large-scale Deep Learning Framework for Heterogeneous Multi-GPU Cluster 2019 IEEE 4TH INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W 2019), 2019, : 176 - 181
- [26] Parallel Computing Model and Performance Prediction based on Multi-GPU Environments 2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTERS IN EDUCATION (ICFCE 2011), VOL I, 2011, : 309 - 312
- [27] Performance Evaluation of a Multi-GPU Enabled Finite Element Method for Computational Electromagnetics EURO-PAR 2011: PARALLEL PROCESSING WORKSHOPS, PT II, 2012, 7156 : 355 - 364
- [28] Multi-GPU Server Deign Parameters Selection based on Empirical Observation of HPL Behavior 2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
- [29] Poseidon: An Efficient Communication Architecture for Distributed Deep Learning on GPU Clusters 2017 USENIX ANNUAL TECHNICAL CONFERENCE (USENIX ATC '17), 2017, : 181 - 193
- [30] High-Performance Adaptive MPI Derived Datatype Communication for Modern Multi-GPU Systems 2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC), 2019, : 267 - 276