共 39 条
- [1] Dietze R, Runger G., The search-based scheduling algorithm HP* for parallel tasks on heterogeneous platforms, Concurrency and Computation: Practice and Experience, 32, 21, (2020)
- [2] OpenCL
- [3] Pennycook S J, Hammond S D, Wright S A, Et al., An investigation of the performance portability of OpenCL[J], Journal of Parallel and Distributed Computing, 73, 11, pp. 1439-1450, (2013)
- [4] Grewe D, Wang Zheng, O'Boyle M F P., Portable mapping of data parallel programs to OpenCL for heterogeneous systems, Proc of the 11th IEEE/ACM Int Symp on Code Generation and Optimization (CGO), (2013)
- [5] Balasalle J, Lopez M A, Rutherford M J., Optimizing Memory Access Patterns for Cellular Automata on GPUs[M], GPU Computing Gems Jade Edition, pp. 67-75, (2012)
- [6] Shen Yuan, Yan Hanbing, Xia Chunhe, Et al., A novel method for malware clone detection based on deep learning[J/OL], Journal of Beijing University of Aeronautics and Astronautics, (2021)
- [7] Cummins C, Petoumenos P, Murray A, Et al., Compiler fuzzing through deep learning[C], Proc of the 27th ACM SIGSOFT Int Symp on Software Testing and Analysis, pp. 95-105, (2018)
- [8] Ruoqin Lin, Qiong Luo, Software vulnerability detection algorithm based on deformable convolutional neural network[J], Computer Integrated Manufacturing Systems, 38, 3, (2021)
- [9] Cummins C, Petoumenos P, Wang Zheng, Et al., End-to-end deep learning of optimization heuristics[C], Proc of the 26th Int Conf on Parallel Architectures and Compilation Techniques (PACT), pp. 219-232, (2017)
- [10] Tianqi Chen, Moreau T, Jiang Ziheng, Et al., TVM: An automated end-to-end optimizing compiler for deep learning[C], Proc of the 13th USENIX Symp on Operating Systems Design and Implementation (OSDI’18), pp. 578-594, (2018)