Optimizing GPU Deep Learning Operators with Polyhedral Scheduling Constraint Injection

被引:4
|
作者
Bastoul, Cedric [1 ]
Zhang, Zhen [1 ]
Razanajato, Harenome [1 ]
Lossing, Nelson [1 ]
Susungi, Adilla [1 ]
de Juan, Javier [1 ]
Filhol, Etienne [1 ]
Jarry, Baptiste [1 ]
Consolaro, Gianpietro [1 ]
Zhang, Renwei [2 ]
机构
[1] Huawei Technol France, Paris, France
[2] Huawei Technol Co Ltd, Beijing, Peoples R China
关键词
Polyhedral model; scheduling; vectorization;
D O I
10.1109/CGO53902.2022.9741260
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic parallel code generation from high-level abstractions such as those manipulated by artificial intelligence and deep learning (AI/DL) frameworks heavily rely on compiler techniques for automatic parallelization and optimization. Many recent advances rely on the polyhedral framework for this task because of its ability to model and to apply a wide range of loop transformations. However, modeling the complexity of the target architecture and of efficient cost models to decide about the best transformation is in general out of reach for a framework based on linear/affine constraints. In this work, we propose to decouple the polyhedral framework into linear and non-linear components. We introduce the constraint tree abstraction which may be generated by a non-linear optimizer and injected to the polyhedral optimization process to build better solutions. We present how to benefit from such a mechanism to generate efficient codes for GPU in the context of AI/DL operators. Our constraint injection allows to drive the polyhedral scheduler towards efficient solutions for load/store vectorization relying both on memory coalescing and vector types. We implemented our scheduler supporting constraint injection and our constraint construction system within a production AI/DL framework. Experiments on well known neural networks show the efficiency of this approach with respect to state-of-the-art polyhedral scheduling for GPU.
引用
收藏
页码:313 / 324
页数:12
相关论文
共 50 条
  • [41] Bioinformatics Tools with Deep Learning Based on GPU
    Hung, Che-Lun
    Tang, Chuan Yi
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1906 - 1908
  • [42] A deep learning container cloud for GPU resources
    Gao, Pengdong (pdgao@cuc.edu.cn), 1600, Universidad Central de Venezuela (55):
  • [43] A learning and optimizing system for order acceptance and scheduling
    Su Nguyen
    The International Journal of Advanced Manufacturing Technology, 2016, 86 : 2021 - 2036
  • [44] A learning and optimizing system for order acceptance and scheduling
    Su Nguyen
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2016, 86 (5-8): : 2021 - 2036
  • [45] Liquid: Intelligent Resource Estimation and Network-Efficient Scheduling for Deep Learning Jobs on Distributed GPU Clusters
    Gu, Rong
    Chen, Yuquan
    Liu, Shuai
    Dai, Haipeng
    Chen, Guihai
    Zhang, Kai
    Che, Yang
    Huang, Yihua
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (11) : 2808 - 2820
  • [46] swATOP: Automatically Optimizing Deep Learning Operators on SW26010 Many-Core Processor
    Gao, Wei
    Fang, Jiarui
    Zhao, Wenlai
    Yang, Jinzhe
    Wang, Long
    Gan, Lin
    Fu, Haohuan
    Yang, Guangwen
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [47] Optimizing task scheduling in human-robot collaboration with deep multi-agent reinforcement learning
    Yu, Tian
    Huang, Jing
    Chang, Qing
    JOURNAL OF MANUFACTURING SYSTEMS, 2021, 60 : 487 - 499
  • [48] The Scheduling Based on Machine Learning for Heterogeneous CPU/GPU Systems
    Shulga, D. A.
    Kapustin, A. A.
    Kozlov, A. A.
    Kozyrev, A. A.
    Rovnyagin, M. M.
    PROCEEDINGS OF THE 2016 IEEE NORTH WEST RUSSIA SECTION YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING CONFERENCE (ELCONRUSNW), 2016, : 345 - 348
  • [49] Deep-OSG: Deep learning of operators in semigroup
    Chen, Junfeng
    Wu, Kailiang
    JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 493
  • [50] Leveraging constraint programming in a deep learning approach for dynamically solving the flexible job-shop scheduling problem
    Echeverria, Imanol
    Murua, Maialen
    Santana, Roberto
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 265