Data Partitioning on Multicore and Multi-GPU Platforms Using Functional Performance Models

被引:52
|
作者
Zhong, Ziming [1 ]
Rychkov, Vladimir [1 ]
Lastovetsky, Alexey [1 ]
机构
[1] Univ Coll Dublin, Sch Comp Sci & Informat, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
HPC; heterogeneous computing; GPU-accelerated multicore system; performance modeling; data partitioning; HETEROGENEOUS MULTICORE; EQUATIONS; SYSTEMS;
D O I
10.1109/TC.2014.2375202
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous multiprocessor systems, which are composed of a mix of processing elements, such as commodity multicore processors, graphics processing units (GPUs), and others, have been widely used in scientific computing community. Software applications incorporate the code designed and optimized for different types of processing elements in order to exploit the computing power of such heterogeneous computing systems. In this paper, we consider the problem of optimal distribution of the workload of data-parallel scientific applications between processing elements of such heterogeneous computing systems. We present a solution that uses functional performance models (FPMs) of processing elements and FPM-based data partitioning algorithms. Efficiency of this approach is demonstrated by experiments with parallel matrix multiplication and numerical simulation of lid-driven cavity flow on hybrid servers and clusters.
引用
收藏
页码:2506 / 2518
页数:13
相关论文
共 50 条
  • [41] Performance Analysis of Parallel FFT on Large Multi-GPU Systems
    Ayala, Alan
    Tomov, Stan
    Stoyanov, Miroslav
    Haidar, Azzam
    Dongarra, Jack
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 372 - 381
  • [42] High performance MRI simulations of motion on multi-GPU systems
    Christos G Xanthis
    Ioannis E Venetis
    Anthony H Aletras
    Journal of Cardiovascular Magnetic Resonance, 16
  • [43] Multi-GPU performance of incompressible flow computation by lattice Boltzmann method on GPU cluster
    Xian, Wang
    Takayuki, Aoki
    PARALLEL COMPUTING, 2011, 37 (09) : 521 - 535
  • [44] Inference of dynamic spatial GRN models with multi-GPU evolutionary computation
    Mousavi, Reza
    Konuru, Sri Harsha
    Lobo, Daniel
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [45] Continuous and Discrete Models of Melanoma Progression Simulated in Multi-GPU Environment
    Dzwinel, Witold
    Klusek, Adrian
    Wcislo, Rafal
    Panuszewska, Marta
    Topa, Pawel
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2017), PT I, 2018, 10777 : 505 - 518
  • [46] Multi-GPU simulations of Vlasov's equation using Vlasiator
    Sandroos, A.
    Honkonen, I.
    von Alfthan, S.
    Palmroth, M.
    PARALLEL COMPUTING, 2013, 39 (08) : 306 - 318
  • [47] Workload Partitioning Algorithm Based on Performance Curve of GPU in Heterogeneous Platforms
    Yang, Hongyu
    Chen, Hui
    Li, Chengming
    Jiang, Qingshan
    Cai, Xueyuan
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ADVANCED CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (ACAAI 2018), 2018, 155 : 191 - 195
  • [48] Auto-Tuning TRSM with an Asynchronous Task Assignment Model on Multicore, Multi-GPU and Coprocessor systems
    Pinto, Clicia
    Barreto, Marcos
    Boratto, Murilo
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [49] Accelerating neural network architecture search using multi-GPU high-performance computing
    Lupion, Marcos
    Cruz, N. C.
    Sanjuan, Juan F.
    Paechter, B.
    Ortigosa, Pilar M.
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (07): : 7609 - 7625
  • [50] Accelerated CFD computations on multi-GPU using OpenMP and OpenACC
    Harshad Bhusare
    Nandan Sarkar
    Debajyoti Kumar
    Somnath Roy
    Sādhanā, 49