HPSM: A Programming Framework for Multi-CPU and Multi-GPU Systems

被引:2
|
作者
Lima, Joao V. F. [1 ]
Di Domenico, Daniel [1 ]
机构
[1] Univ Fed Santa Maria, Santa Maria, RS, Brazil
关键词
ARCHITECTURES;
D O I
10.1109/SBAC-PADW.2017.14
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a high-level C++ framework to explore multi-CPU and multi-GPU systems called HPSM. HPSM enables parallel loops and reductions implemented over three parallel backends: Serial, OpenMP (with GCC and libKOMP runtime), and StarPU. We evaluated HPSM development effort with AXPY program, and performance with three parallel benchmarks: N-Body, Hotspot, and CFD solver. The CPU-GPU combination attained better performance than only GPUs for all cases on a CPU-GPU system. Still, our findings provide evidence that NUMA affinity at framework level may produce different results.
引用
收藏
页码:31 / 36
页数:6
相关论文
共 50 条
  • [1] HPSM: a programming framework to exploit multi-CPU and multi-GPU systems simultaneously
    Ferreira Lima, Joao Vicente
    Di Domenico, Daniel
    [J]. INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2019, 10 (03) : 201 - 211
  • [2] Multi-CPU/Multi-GPU Based Framework for Multimedia Processing
    Mahmoudi, Sidi Ahmed
    Manneback, Pierre
    [J]. COMPUTER SCIENCE AND ITS APPLICATIONS, CIIA 2015, 2015, 456 : 54 - 65
  • [3] Financial applications on multi-CPU and multi-GPU architectures
    Emilio Castillo
    Cristóbal Camarero
    Ana Borrego
    Jose Luis Bosque
    [J]. The Journal of Supercomputing, 2015, 71 : 729 - 739
  • [4] Financial applications on multi-CPU and multi-GPU architectures
    Castillo, Emilio
    Camarero, Cristobal
    Borrego, Ana
    Luis Bosque, Jose
    [J]. JOURNAL OF SUPERCOMPUTING, 2015, 71 (02): : 729 - 739
  • [5] Financial applications on multi-CPU and multi-GPU architectures
    Department of Computer Science and Electronics, Universidad de Cantabria, Santander, Spain
    不详
    [J]. J Supercomput, 2 (729-739):
  • [6] Multi-GPU and Multi-CPU Parallelization for Interactive Physics Simulations
    Hermann, Everton
    Raffin, Bruno
    Faure, Francois
    Gautier, Thierry
    Allard, Jeremie
    [J]. EURO-PAR 2010 - PARALLEL PROCESSING, PART II, 2010, 6272 : 235 - 246
  • [7] An Open Benchmark Implementation for Multi-CPU Multi-GPU Pedestrian Detection in Automotive Systems
    Maria Trompouki, Matina
    Kosmidis, Leonidas
    Navarro, Nacho
    [J]. 2017 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2017, : 305 - 312
  • [8] Strategies for maximizing utilization on multi-CPU and multi-GPU heterogeneous architectures
    Angeles Navarro
    Antonio Vilches
    Francisco Corbera
    Rafael Asenjo
    [J]. The Journal of Supercomputing, 2014, 70 : 756 - 771
  • [9] Multi-GPU and multi-CPU accelerated FDTD scheme for vibroacoustic applications
    Frances, J.
    Otero, B.
    Bleda, S.
    Gallego, S.
    Neipp, C.
    Marquez, A.
    Belendez, A.
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2015, 191 : 43 - 51
  • [10] Strategies for maximizing utilization on multi-CPU and multi-GPU heterogeneous architectures
    Navarro, Angeles
    Vilches, Antonio
    Corbera, Francisco
    Asenjo, Rafael
    [J]. JOURNAL OF SUPERCOMPUTING, 2014, 70 (02): : 756 - 771