MPtostream: an OpenMP compiler for CPU-GPU heterogeneous parallel systems

被引:0
|
作者
XueJun Yang
Tao Tang
GuiBin Wang
Jia Jia
XinHai Xu
机构
[1] National University of Defense Technology,National Laboratory for Parallel and Distributed Processing
来源
关键词
GPGPU; stream; OpenMP; compiler;
D O I
暂无
中图分类号
学科分类号
摘要
In light of GPUs’ powerful floating-point operation capacity, heterogeneous parallel systems incorporating general purpose CPUs and GPUs have become a highlight in the research field of high performance computing(HPC). However, due to the complexity of programming on GPUs, porting a large number of existing scientific computing applications to the heterogeneous parallel systems remains a big challenge. The OpenMP programming interface is widely adopted on multi-core CPUs in the field of scientific computing. To effectively inherit existing OpenMP applications and reduce the transplant cost, we extend OpenMP with a group of compiler directives, which explicitly divide tasks among the CPU and the GPU, and map time-consuming computing fragments to run on the GPU, thus dramatically simplifying the transplantation. We have designed and implemented MPtoStream, a compiler of the extended OpenMP for AMD’s stream processing GPUs. Our experimental results show that programming with the extended directives deviates from programming with OpenMP by less than 11% modification and achieves significant speedup ranging from 3.1 to 17.3 on a heterogeneous system, incorporating an Intel Xeon E5405 CPU and an AMD FireStream 9250 GPU, over the execution on the Xeon CPU alone.
引用
收藏
页码:1961 / 1971
页数:10
相关论文
共 50 条
  • [1] MPtostream:an OpenMP compiler for CPU-GPU heterogeneous parallel systems
    YANG XueJun
    [J]. Science China(Information Sciences), 2012, 55 (09) : 1961 - 1971
  • [2] MPtostream: an OpenMP compiler for CPU-GPU heterogeneous parallel systems
    Yang XueJun
    Tang Tao
    Wang GuiBin
    Jia Jia
    Xu XinHai
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2012, 55 (09) : 1961 - 1971
  • [3] Performance Optimization for CPU-GPU Heterogeneous Parallel System
    Wang, Yanhua
    Qiao, Jianzhong
    Lin, Shukuan
    Zhao, Tinglei
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1259 - 1266
  • [4] Heterogeneous parallel_for Template for CPU-GPU Chips
    Navarro, Angeles
    Corbera, Francisco
    Rodriguez, Andres
    Vilches, Antonio
    Asenjo, Rafael
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 47 (02) : 213 - 233
  • [5] Transparent CPU-GPU Collaboration for Data-Parallel Kernels on Heterogeneous Systems
    Lee, Janghaeng
    Samadi, Mehrzad
    Park, Yongjun
    Mahlke, Scott
    [J]. 2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 245 - 255
  • [6] Performance Analysis of AES on CPU-GPU Heterogeneous Systems
    Sanz, Victoria
    Pousa, Adrian
    Naiouf, Marcelo
    De Giusti, Armando
    [J]. CLOUD COMPUTING, BIG DATA & EMERGING TOPICS, JCC-BD&ET 2022, 2022, 1634 : 31 - 42
  • [7] Parallel Smoothers in Multigrid Method for Heterogeneous CPU-GPU Environment
    Iyer, Neha
    Ganesan, Sashikumaar
    [J]. PARALLEL COMPUTING: TECHNOLOGY TRENDS, 2020, 36 : 114 - 123
  • [8] Efficient Pattern Matching on CPU-GPU Heterogeneous Systems
    Sanz, Victoria
    Pousa, Adrian
    Naiouf, Marcelo
    De Giusti, Armando
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING (ICA3PP 2019), PT I, 2020, 11944 : 391 - 403
  • [9] Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems
    Yu, Yuanhang
    Wen, Dong
    Zhang, Ying
    Wang, Xiaoyang
    Zhang, Wenjie
    Lin, Xuemin
    [J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1871 - 1876
  • [10] Heterogeneous programming using OpenMP and CUDA/HIP for hybrid CPU-GPU scientific applications
    Tallada, Marc Gonzalez
    Morancho, Enric
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2023, 37 (05): : 626 - 646