共 80 条
- [1] Bekkerman R, Bilenko M, Langford J., Scaling Up Machine Learning: Parallel and Distributed Approaches, (2011)
- [2] Gonzalez J E, Low Y, Gu H, Et al., PowerGraph: Distributed graph-parallel computation on natural graphs, Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation(OSDI 12), pp. 17-30, (2012)
- [3] Li C, Xue Y, Wang J, Et al., Edge-oriented computing paradigms: A survey on architecture design and system management, ACM Computing Surveys, 51, 2, pp. 1-34, (2018)
- [4] Chen T, Moreau T, Jiang Z, Et al., TVM: An automated end-to-end optimizing compiler for deep learning, Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation(OSDI 18), pp. 578-594, (2018)
- [5] Liu D, Chen T, Liu S, Et al., PuDianNao: A polyvalent machine learning accelerator, ACM SIGARCH Computer Architecture News, 43, 1, pp. 369-381, (2015)
- [6] Jouppi N P, Young C, Patil N, Et al., In-datacenter performance analysis of a tensor processing unit, Proceedings of the 44th Annual International Symposium on Computer Architecture, pp. 1-12, (2017)
- [7] Zhu Hu-Ming, Li Pei, Jiao Li-Cheng, Et al., The overview of the parallelization in deep neural network, Chinese Journal of Computers, 41, 8, pp. 171-191, (2018)
- [8] Chu C, Kim S K, Lin Y A, Et al., Map-reduce for machine learning on multicore, Advances in Neural Information Processing Systems, 19, (2007)
- [9] Choudhary A N, Honbo D, Kumar P, Et al., Accelerating data mining workloads: Current approaches and future challenges in system architecture design, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1, 1, pp. 41-54, (2011)
- [10] Huang Shan, Wang Bo-Tao, Wang Guo-Ren, Et al., The survey of the optimization technique for MapReduce, Journal of Frontiers of Computer Science&Technology, 7, 10, pp. 885-905, (2013)