Efficient Parallelization of a Two-List Algorithm for the Subset-Sum Problem on a Hybrid CPU/GPU Cluster

被引:2
|
作者
Kang, Letian [1 ]
Wan, Lanjun [1 ]
Li, Kenli [1 ]
机构
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
关键词
MPI-CUDA implementation; hybrid CPU/GPU cluster; two-list algorithm; subset-sum problem; knapsack problem; KNAPSACK-PROBLEM; MEMORY CONFLICTS;
D O I
10.1109/PAAP.2014.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, hybrid CPU/GPU cluster has been widely used to deal with compute-intensive problems, such as the subset-sum problem. The two-list algorithm is a well known approach to solve the problem. However, a hybrid MPI-CUDA dual-level parallelization of the algorithm on the cluster is not straightforward. The key challenge is how to allocate the most suitable workload to each node to achieve good load balancing between nodes and minimize the communication overhead. Therefore, this paper proposes an effective workload distribution scheme which aims to reasonably assign workload to each node. According to this scheme, an efficient MPI-CUDA parallel implementation of a two-list algorithm is presented. A series of experiments are conducted to compare the performance of the hybrid MPI-CUDA implementation with that of the best sequential CPU implementation, the single-node CPU-only implementation, the single-node GPU-only implementation, and the hybrid MPI-OpenMP implementation with same cluster configuration. The results show that the proposed hybrid MPI-CUDA implementation not only offers significant performance benefits but also has excellent scalability.
引用
收藏
页码:93 / 98
页数:6
相关论文
共 36 条
  • [1] An optimal and scalable parallelization of the two-list algorithm for the subset-sum problem
    Sanches, C. A. A.
    Soma, N. Y.
    Yanasse, H. H.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 176 (02) : 870 - 879
  • [2] A novel cooperative accelerated parallel two-list algorithm for solving the subset-sum problem on a hybrid CPU-GPU cluster
    Wan, Lanjun
    Li, Kenli
    Li, Keqin
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2016, 97 : 112 - 123
  • [3] GPU implementation of a parallel two-list algorithm for the subset-sum problem
    Wan, Lanjun
    Li, Kenli
    Liu, Jing
    Li, Keqin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (01): : 119 - 145
  • [4] Efficient CPU-GPU cooperative computing for solving the subset-sum problem
    Wan, Lanjun
    Li, Kenli
    Liu, Jing
    Li, Keqin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (02): : 492 - 516
  • [5] An efficient solution to the subset-sum problem on GPU
    Curtis, V. V.
    Sanches, C. A. A.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (01): : 95 - 113
  • [6] A low-space algorithm for the subset-sum problem on GPU
    Curtis, V. V.
    Sanches, C. A. A.
    COMPUTERS & OPERATIONS RESEARCH, 2017, 83 : 120 - 124
  • [7] A FAST APPROXIMATION ALGORITHM FOR THE SUBSET-SUM PROBLEM
    GENS, G
    LEVNER, E
    INFOR, 1994, 32 (03) : 143 - 148
  • [8] An improved balanced algorithm for the subset-sum problem
    Curtis, V. V.
    Sanches, C. A. A.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 275 (02) : 460 - 466
  • [9] New algorithm for dense subset-sum problem
    Chaimovich, M
    ASTERISQUE, 1999, (258) : 363 - 373
  • [10] An efficient approximation scheme for the Subset-Sum Problem
    Kellerer, H
    Pferschy, U
    Speranza, MG
    ALGORITHMS AND COMPUTATION, PROCEEDINGS, 1997, 1350 : 394 - 403