共 50 条
- [1] Memory-Efficient Pipeline-Parallel DNN Training [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [2] CAPTURE: Memory-Centric Partitioning for Distributed DNN Training with Hybrid Parallelism [J]. 2023 IEEE 30TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC 2023, 2023, : 76 - 86
- [3] CAPSlog: Scalable Memory-Centric Partitioning for Pipeline Parallelism [J]. 2024 32ND EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PDP 2024, 2024, : 17 - 25
- [5] Muulti-dimensional Parallel Training of Winograd Layer on Memory-Centric Architecture [J]. 2018 51ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2018, : 682 - 695
- [7] A PARALLEL PARTITIONING METHOD FOR LARGE-SCALE CIRCUIT SIMULATION [J]. UNIVERSITY PROGRAMS IN COMPUTER-AIDED ENGINEERING, DESIGN, AND MANUFACTURING, 1989, : 134 - 141
- [8] DistSim: A performance model of large-scale hybrid distributed DNN training [J]. PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2023, CF 2023, 2023, : 112 - 122