Exploration of OpenCL Heterogeneous Programming for Porting Solidification Modeling to CPU-GPU Platforms

被引:4
|
作者
Halbiniak, Kamil [1 ]
Szustak, Lukasz [1 ]
Olas, Tomasz [1 ]
Wyrzykowski, Roman [1 ]
Gepner, Pawel [2 ]
机构
[1] Czestochowa Tech Univ, Fac Mech Engn & Comp Sci, Czestochowa, Poland
[2] Warsaw Univ Technol, Fac Prod Engn, Warsaw, Poland
来源
关键词
heterogeneous computing; portability; CPU-GPU platforms; OpenCL; numerical modeling of solidification; performance-energy trade-off; HYBRID; SIMULATIONS; MPDATA; PARALLELIZATION; DECOMPOSITION; OPTIMIZATION;
D O I
10.1002/cpe.6011
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This article provides a comprehensive study of OpenCL heterogeneous programming for porting applications to CPU-GPU computing platforms, with a real-life application for the solidification modeling. The aim is to achieve a flexible workload distribution between available CPU-GPU resources and optimize application performance. Considering the solidification application as a use case, we explore the necessary steps required for (i) adaptation of an application to CPU-GPU platforms, and (ii) mapping the application workload onto the OpenCL programming model. The adaptation is based on a reformulation of steps developed previously for CPU-MIC architectures. The mapping process allows us to utilize OpenCL for harnessing CPU and GPU cores using data parallelism, as well as for the management of available compute devices with task parallelism. The resulting OpenCL code's performance and energy efficiency is experimentally studied for two platforms with powerful GPUs of various generations (with Kepler and Volta architectures). The experiments confirm the performance advantage of using computing resources of both GPUs and CPUs. The achieved benefit depends on the relationship between the computing power of CPUs and GPUs. Moreover, this gain entails the growth of the average power that increases the energy consumed during the application execution.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] CoopCL: Cooperative Execution of OpenCL Programs on Heterogeneous CPU-GPU Platforms
    Moren, Konrad
    Goehringer, Diana
    [J]. 2020 28TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2020), 2020, : 224 - 231
  • [2] Analyzing OpenCL 2.0 Workloads Using a Heterogeneous CPU-GPU Simulator
    Wang, Li
    Tsai, Ren-Wei
    Wang, Shao-Chung
    Chen, Kun-Chih
    Wang, Po-Han
    Cheng, Hsiang-Yun
    Lee, Yi-Chung
    Shu, Sheng-Jie
    Yang, Chun-Chieh
    Hsu, Min-Yih
    Kan, Li-Chen
    Lee, Chao-Lin
    Yu, Tzu-Chieh
    Peng, Rih-Ding
    Yang, Chia-Lin
    Hwang, Yuan-Shin
    Lee, Jenq-Kuen
    Tsao, Shiao-Li
    Ouhyoung, Ming
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2017, : 127 - 128
  • [3] OpenCL as a Unified Programming Model for Heterogeneous CPU/GPU Clusters
    Kim, Jungwon
    Seo, Sangmin
    Lee, Jun
    Nah, Jeongho
    Jo, Gangwon
    Lee, Jaejin
    [J]. ACM SIGPLAN NOTICES, 2012, 47 (08) : 299 - 300
  • [4] Automatic Mapping for OpenCL-Programs on CPU/GPU Heterogeneous Platforms
    Moren, Konrad
    Goehringer, Diana
    [J]. COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 301 - 314
  • [5] Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms
    Benatia, Akrem
    Ji, Weixing
    Wang, Yizhuo
    Shi, Feng
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 34 (01): : 66 - 80
  • [6] Porting a Neuro-Imaging Application to a CPU-GPU cluster
    Nakhjavani, Reza Sina
    Sharify, Sahel
    Hashemi, Ali B.
    Lu, Alan W.
    Amza, Cristiana
    Strother, Stephen
    [J]. 2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 137 - 145
  • [7] Exploration on Task Scheduling Strategy for CPU-GPU Heterogeneous Computing System
    Fang, Juan
    Zhang, Jiaxing
    Lu, Shuaibing
    Zhao, Hui
    [J]. 2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 306 - 311
  • [8] ACCELERATING LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION ON HETEROGENEOUS CPU-GPU PLATFORMS
    Kim, Jungsuk
    Lane, Ian
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Analysis of Energy Efficiency of a Parallel AES Algorithm for CPU-GPU Heterogeneous Platforms
    Fei, Xiongwei
    Li, Kenli
    Yang, Wangdong
    Li, Keqin
    [J]. 2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 499 - 508
  • [10] A load balancing method in accelerating Kriging algorithm on CPU-GPU heterogeneous platforms
    Jiang, Chunlei
    Zhang, Shuqing
    [J]. Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2015, 37 (05): : 35 - 39