共 50 条
- [1] DistSim: A performance model of large-scale hybrid distributed DNN training [J]. PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2023, CF 2023, 2023, : 112 - 122
- [3] Interactive visual analytics of parallel training strategies for DNN models [J]. COMPUTERS & GRAPHICS-UK, 2023, 115 : 392 - 403
- [4] mCAP: Memory-Centric Partitioning for Large-Scale Pipeline-Parallel DNN Training [J]. EURO-PAR 2022: PARALLEL PROCESSING, 2022, 13440 : 155 - 170
- [6] AI Accelerator Embedded Computational Storage for Large-Scale DNN Models [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 483 - 486
- [7] Performance prediction of large-scale parallel discrete event models of physical systems [J]. Proceedings of the 2005 Winter Simulation Conference, Vols 1-4, 2005, : 356 - 364
- [8] Formal Metrics for Large-Scale Parallel Performance [J]. HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2015, 2015, 9137 : 488 - 496
- [10] Prophet: Fine-grained Load Balancing for Parallel Training of Large-scale MoE Models [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, CLUSTER, 2023, : 82 - 94