共 18 条
- [1] FEDUS W, ZOPH B, SHAZEER N., Switch transformers: Scaling to trillion parameter models with simple and efficient sparsity
- [2] CHENG H T, KOC L, HARMSEN J, Et al., Wide & deep learning for recommender systems, Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, pp. 7-10, (2016)
- [3] WANG RX, FU B, FU G, Et al., Deep & cross network for ad click predictions [C], Proceedings of the ADKDD17, pp. 1-7, (2017)
- [4] LI M, ANDERSEN D G, PARK J W, Et al., Scaling distributed machine learning with the parameter server, Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation, pp. 583-598, (2014)
- [5] JIANG J, YU L L, JIANG J W, Et al., Angel: A new large-scale machine learning system, National Science Review, 5, 2, pp. 216-236, (2018)
- [6] LI M, ANDERSEN D G, SMOLA A, Et al., Communication efficient distributed machine learning with the parameter server, Proceedings of the 27 th International Conference on Neural Information Processing Systems, 1, pp. 19-27, (2014)
- [7] ABADI M, BARHAM P, CHEN J M, Et al., TensorFlow: A system for large-scale machine learning, Proceedings of the 12th USENIX conference on Operating Systems Design and Implementation, pp. 265-283, (2016)
- [8] PASZKE A, GROSS S, MASS A F, Et al., PyTorch: An imperative style, high-performance deep learning library, Proceedings of the 33rd Conference on Neural Information Processing Systems, pp. 8026-8037, (2019)
- [9] LI S, ZHAO Y L, VARMA R, Et al., PyTorch distributed: Experiences on accelerating data parallel training, Proceedings of the VLDB Endowment, 13, 12, pp. 3005-3018, (2020)
- [10] PS-Lite, A light and efficient implementation of the parameter server framework