共 50 条
- [1] Parallel ContextWindows for Large Language Models [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6383 - 6402
- [2] Towards the holistic design of alloys with large language models [J]. NATURE REVIEWS MATERIALS, 2024,
- [3] Fast Parallel Training of Neural Language Models [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4193 - 4199
- [4] Augmenting interpretable models with large language models during training [J]. Nature Communications, 14
- [7] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism [J]. PROCEEDINGS OF THE 2024 USENIX ANNUAL TECHNICAL CONFERENCE, ATC 2024, 2024, : 545 - 561
- [8] GalaxyGPT: A Hybrid Framework for Large Language Model Safety [J]. IEEE ACCESS, 2024, 12 : 94436 - 94451
- [9] Training Hybrid Language Models by Marginalizing over Segmentations [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1477 - 1482