Towards High-Performance Code Generation for Multi-GPU Clusters Based on a Domain-Specific Language for Algorithmic Skeletons

被引:0
|
作者
Wrede, Fabian [1 ]
Kuchen, Herbert [1 ]
机构
[1] Univ Munster, European Res Ctr Informat Syst ERCIS, Dept Informat Syst, Leonardo Campus 3, D-48149 Munster, Germany
关键词
Algorithmic skeletons; Parallel programming; High-performance computing; Model-driven development; Domain-specific language;
D O I
10.1007/s10766-020-00659-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In earlier work, we defined a domain-specific language (DSL) with the aim to provide an easy-to-use approach for programming multi-core and multi-GPU clusters. The DSL incorporates the idea of utilizing algorithmic skeletons, which are well-known patterns for parallel programming, such as map and reduce. Based on the chosen skeleton, a user-defined function can be applied to a data structure in parallel with the main advantage that the user does not have to worry about implementation details. So far, we had only implemented a generator for multi-core clusters and in this paper we present and evaluate two prototypes of generators for multi-GPU clusters, which are based on OpenACC and CUDA. We have evaluated the approach with four benchmark applications. The results show that the generation approach leads to execution times, which are on par with an alternative library implementation.
引用
收藏
页码:713 / 728
页数:16
相关论文
共 50 条
  • [31] LIFT: A Functional Data-Parallel IR for High-Performance GPU Code Generation
    Steuwer, Michel
    Remmelg, Toomas
    Dubach, Christophe
    [J]. CGO'17: PROCEEDINGS OF THE 2017 INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2017, : 74 - 85
  • [32] Towards a performance-portable description of geometric multigrid algorithms using a domain-specific language
    Membarth, Richard
    Reiche, Oliver
    Schmitt, Christian
    Hannig, Frank
    Teich, Juergen
    Stuermer, Markus
    Koestler, Harald
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2014, 74 (12) : 3191 - 3201
  • [33] The High Performance Computing for 3D Dynamic Holographic Simulation Based on Multi-GPU Cluster
    Zhang Yingxi
    Lin Tingyu
    Guo Liqin
    [J]. THEORY, METHODOLOGY, TOOLS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, PT I, 2016, 643 : 431 - 441
  • [34] Automatic compiler/interpreter generation from programs for Domain-Specific Languages: Code bloat problem and performance improvement
    Kovacevic, Zeljko
    Ravber, Miha
    Liu, Shih-Hsi
    Crepinsek, Matej
    [J]. JOURNAL OF COMPUTER LANGUAGES, 2022, 70
  • [35] Automatic compiler/interpreter generation from programs for Domain-Specific Languages: Code bloat problem and performance improvement
    Kovačević, Željko
    Ravber, Miha
    Liu, Shih-Hsi
    Črepinšek, Matej
    [J]. Journal of Computer Languages, 2022, 70
  • [36] Introduction to the JPDC Special Issue on Domain-Specific Languages and High-Level Frameworks for High-Performance Computing
    Krishnamoorthy, Sriram
    Ramanujam, J.
    Sadayappan, P.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2014, 74 (12) : 3175 - 3175
  • [37] Custom High-Performance Vector Code Generation for Data-Specific Sparse Computations
    Horro, Marcos
    Pouchet, Louis-Noel
    Rodriguez, Gabriel
    Tourino, Juan
    [J]. PROCEEDINGS OF THE 2022 31ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT 2022, 2022, : 160 - 171
  • [38] GFlow: Towards GPU-based High-Performance Table Matching in OpenFlow Switches
    Qiu, Kun
    Chen, Zhe
    Chen, Yang
    Zhao, Jin
    Wang, Xin
    [J]. 2015 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2015, : 283 - 288
  • [39] A GPU-based Algorithm-specific Optimization for High-performance Background Subtraction
    Zhang, Chulian
    Tabkhi, Hamed
    Schirner, Gunar
    [J]. 2014 43RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2014, : 182 - 191
  • [40] High-Level Synthesis Revised: Generation of FPGA Accelerators from a Domain-Specific Language using the Polyhedron Model
    Schmid, Moritz
    Hannig, Frank
    Tanase, Alexandru
    Teich, Juergen
    [J]. PARALLEL COMPUTING: ACCELERATING COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, 25 : 497 - 506