共 50 条
- [45] Training large-scale language models with limited GPU memory: a survey FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2025, : 309 - 331
- [46] PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [47] Training Large-Scale News Recommenders with Pretrained Language Models in the Loop PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4215 - 4225
- [49] MixPipe: Efficient Bidirectional Pipeline Parallelism for Training Large-Scale Models 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,