共 71 条
- [1] Leon B., Frank E.C., Jorge N., Optimization methods for large-scale machine learning, (2016)
- [2] Neal P., Stephen B., Proximal algorithms, Foundations and Trends in Optimization, 1, 3, pp. 127-239, (2014)
- [3] Stephen J.W., Coordinate descent algorithms, (2015)
- [4] Stephen B., Neal P., Eric C., Borja P., Jonathan E., Distributed optimization and statistical learning via the alternating direction method of multipliers, Foundations and Trends in Machine Learning, 3, 1, pp. 1-122, (2011)
- [5] Eric X., Qirong H., Xie P.T., Wei D., Strategies and principles of distributed machine learning on big data, Engineering, 2, 2, pp. 179-195, (2016)
- [6] Frederic L., Frederic G., David B., Bulk synchronous parallel ML: Modular implementation and performance prediction, Proc. of the Int'l Conf. on Computational Science, pp. 1046-1054, (2005)
- [7] Nesterov Y., Introductory Lectures on Convex Optimization: A Basic Course, (2004)
- [8] Meng X.R., Joseph B., Burak Y., Evan S., Shivaram V., Davies L., Jeremy F., Db T., Manish M., Sean O., Doris X., Reynold X., Michael J.F., Reza Z., Matei Z., Ameet T., MLlib: Machine learning in apache spark, The Journal of Machine Learning Research, 17, 1, pp. 1235-1241, (2016)
- [9] Martin A.Z., Markus W., Alexander S., Li L.H., Parallelized stochastic gradient descent, Advances in Neural Information Processing Systems, pp. 2595-2603, (2010)
- [10] Leen T.K., Orr G.B., Optimal stochastic search and adaptive momentum, Advances in Neural Information Processing Systems, pp. 477-484, (1994)