共 50 条
- [1] GLaM: Efficient Scaling of Language Models with Mixture-of-Experts INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [2] MoESys: A Distributed and Efficient Mixture-of-Experts Training and Inference System for Internet Services IEEE Transactions on Services Computing, 2024, 17 (05): : 1 - 15
- [3] Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3577 - 3599
- [4] Efficient Routing in Sparse Mixture-of-Experts Shamsolmoali, Pourya (pshams55@gmail.com), 1600, Institute of Electrical and Electronics Engineers Inc.
- [8] Breaking the gridlock in Mixture-of-Experts: Consistent and Efficient Algorithms INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [10] New estimation and feature selection methods in mixture-of-experts models CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (04): : 519 - 539