A novel cooperative accelerated parallel two-list algorithm for solving the subset-sum problem on a hybrid CPU-GPU cluster

被引:6
|
作者
Wan, Lanjun [1 ,2 ]
Li, Kenli [1 ,2 ]
Li, Keqin [1 ,2 ,3 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Natl Supercomp Ctr Changsha, Changsha 410082, Hunan, Peoples R China
[3] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
基金
中国国家自然科学基金; 对外科技合作项目(国际科技项目);
关键词
Heterogeneous cooperative computing; Hybrid CPU-GPU cluster; Hybrid programming model; Subset-sum problem; Two-list algorithm; Workload distribution; KNAPSACK-PROBLEM;
D O I
10.1016/j.jpdc.2016.07.003
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many parallel algorithms have recently been developed to accelerate solving the subset-sum problem on a heterogeneous CPU-GPU system. However, within each compute node, only one CPU core is used to control one GPU and all the remaining CPU cores are in idle state, which leads to a large number of CPU cores being wasted. In this paper, based on a cost-optimal parallel two-list algorithm, we propose a novel heterogeneous cooperative computing approach to solve the subset-sum problem on a hybrid CPU-GPU cluster, which can make full use of all available computational resources of a cluster. The unbalanced workload distribution and the huge communication overhead are two main obstacles for the heterogeneous cooperative computing. In order to assign the most suitable workload to each compute node and reasonably partition it between CPU and GPU within each node, and minimize the inter-node and intra-node communication costs, we design a communication-avoiding workload distribution scheme suitable for the parallel two-list algorithm. According to this scheme, we provide an efficient heterogeneous cooperative implementation of the algorithm. A series of experiments are conducted on a hybrid CPU-GPU cluster, where each node has two 6-core CPUs and one GPU. The results show that the heterogeneous cooperative computing significantly outperforms the CPU-only or GPU-only computing. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:112 / 123
页数:12
相关论文
共 6 条
  • [1] Efficient Parallelization of a Two-List Algorithm for the Subset-Sum Problem on a Hybrid CPU/GPU Cluster
    Kang, Letian
    Wan, Lanjun
    Li, Kenli
    2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 93 - 98
  • [2] GPU implementation of a parallel two-list algorithm for the subset-sum problem
    Wan, Lanjun
    Li, Kenli
    Liu, Jing
    Li, Keqin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (01): : 119 - 145
  • [3] Efficient CPU-GPU cooperative computing for solving the subset-sum problem
    Wan, Lanjun
    Li, Kenli
    Liu, Jing
    Li, Keqin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (02): : 492 - 516
  • [4] An optimal and scalable parallelization of the two-list algorithm for the subset-sum problem
    Sanches, C. A. A.
    Soma, N. Y.
    Yanasse, H. H.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 176 (02) : 870 - 879
  • [5] A Hybrid Parallel Algorithm for Computer Simulation of Electrocardiogram Based on a CPU-GPU Cluster
    Shen, Wenfeng
    Sun, Lianqiang
    Wei, Daming
    Xu, Weimin
    Wang, Hui
    Zhu, Xin
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 167 - 171
  • [6] Hybrid GPU-CPU Efficient Implementation of a Parallel Numerical Algorithm for Solving the Cauchy Problem for a Nonlinear Differential Riccati Equation of Fractional Variable Order
    Tverdyi, Dmitrii
    Parovik, Roman
    MATHEMATICS, 2023, 11 (15)