Efficient Parallelization of a Two-List Algorithm for the Subset-Sum Problem on a Hybrid CPU/GPU Cluster

被引:2
|
作者
Kang, Letian [1 ]
Wan, Lanjun [1 ]
Li, Kenli [1 ]
机构
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
关键词
MPI-CUDA implementation; hybrid CPU/GPU cluster; two-list algorithm; subset-sum problem; knapsack problem; KNAPSACK-PROBLEM; MEMORY CONFLICTS;
D O I
10.1109/PAAP.2014.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, hybrid CPU/GPU cluster has been widely used to deal with compute-intensive problems, such as the subset-sum problem. The two-list algorithm is a well known approach to solve the problem. However, a hybrid MPI-CUDA dual-level parallelization of the algorithm on the cluster is not straightforward. The key challenge is how to allocate the most suitable workload to each node to achieve good load balancing between nodes and minimize the communication overhead. Therefore, this paper proposes an effective workload distribution scheme which aims to reasonably assign workload to each node. According to this scheme, an efficient MPI-CUDA parallel implementation of a two-list algorithm is presented. A series of experiments are conducted to compare the performance of the hybrid MPI-CUDA implementation with that of the best sequential CPU implementation, the single-node CPU-only implementation, the single-node GPU-only implementation, and the hybrid MPI-OpenMP implementation with same cluster configuration. The results show that the proposed hybrid MPI-CUDA implementation not only offers significant performance benefits but also has excellent scalability.
引用
收藏
页码:93 / 98
页数:6
相关论文
共 36 条
  • [21] Learning Driven Parallelization for Large-Scale Video Workload in Hybrid CPU-GPU Cluster
    Zhang, Haitao
    Tang, Bingchang
    Geng, Xin
    Ma, Huadong
    PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [22] A Hybrid Parallel Algorithm for Computer Simulation of Electrocardiogram Based on a CPU-GPU Cluster
    Shen, Wenfeng
    Sun, Lianqiang
    Wei, Daming
    Xu, Weimin
    Wang, Hui
    Zhu, Xin
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 167 - 171
  • [23] A Efficient Algorithm for Molecular Dynamics Simulation on Hybrid CPU-GPU Computing Platforms
    Li, Dapu
    Ai, Wei
    Ye, Yu
    Liang, Jie
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1357 - 1363
  • [24] Blocked United Algorithm for the All-Pairs Shortest Paths Problem on Hybrid CPU-GPU Systems
    Matsumoto, Kazuya
    Nakasato, Naohito
    Sedukhin, Stanislav G.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (12) : 2759 - 2768
  • [25] An efficient algorithm for the subset sum problem based on finite-time convergent recurrent neural network
    Gu, Shenshen
    Cui, Rui
    NEUROCOMPUTING, 2015, 149 : 13 - 21
  • [26] Hybrid GPU-CPU Efficient Implementation of a Parallel Numerical Algorithm for Solving the Cauchy Problem for a Nonlinear Differential Riccati Equation of Fractional Variable Order
    Tverdyi, Dmitrii
    Parovik, Roman
    MATHEMATICS, 2023, 11 (15)
  • [27] An efficient mixed-precision, hybrid CPU-GPU implementation of a nonlinearly implicit one-dimensional particle-in-cell algorithm
    Chen, G.
    Chacon, L.
    Barnes, D. C.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2012, 231 (16) : 5374 - 5388
  • [28] Combining Two Local Searches with Crossover: An Efficient Hybrid Algorithm for the Traveling Salesman Problem
    Liu, Weichen
    Weise, Thomas
    Wu, Yuezhong
    Qi, Qi
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 298 - 305
  • [29] Efficient Two-Level Hybrid Algorithm for the Refinery Production Scheduling Problem Involving Operational Transitions
    Zhang, Lu
    Jiang, Yongheng
    Gao, Xiaoyong
    Huang, Dexian
    Wang, Ling
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2016, 55 (28) : 7768 - 7781
  • [30] A parallel machine scheduling problem with two-agent and tool change activities: an efficient hybrid metaheuristic algorithm
    Yazdani, Maziar
    Khalili, Seyed Mohammad
    Jolai, Fariborz
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2016, 29 (10) : 1075 - 1088