共 50 条
- [3] MoDE: A Mixture-of-Experts Model with Mutual Distillation among the Experts [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 16067 - 16075
- [4] Spatial Mixture-of-Experts [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [5] A Mixture-of-Experts Model for Antonym-Synonym Discrimination [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 558 - 564
- [6] SPEECHMOE2: MIXTURE-OF-EXPERTS MODEL WITH IMPROVED ROUTING [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7217 - 7221
- [7] Parsimonious mixture-of-experts based on mean mixture of multivariate normal distributions [J]. STAT, 2022, 11 (01):
- [8] Bayesian shrinkage in mixture-of-experts models: identifying robust determinants of class membership [J]. Advances in Data Analysis and Classification, 2019, 13 : 1019 - 1051
- [10] Asymptotic properties of mixture-of-experts models [J]. NEUROCOMPUTING, 2011, 74 (09) : 1444 - 1449