GPU-accelerated Hungarian algorithms for the Linear Assignment Problem

被引:40
|
作者
Date, Ketan [1 ]
Nagi, Rakesh [1 ]
机构
[1] Univ Illinois, Dept Ind & Enterprise Syst Engn, 117 Transportat Bldg, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
Linear assignment problem; Parallel algorithm; Graphics processing unit; CUDA;
D O I
10.1016/j.parco.2016.05.012
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we describe parallel versions of two different variants (classical and alternating tree) of the Hungarian algorithm for solving the Linear Assignment Problem (LAP). We have chosen Compute Unified Device Architecture (CUDA) enabled NVIDIA Graphics Processing Units (GPU) as the parallel programming architecture because of its ability to perform intense computations on arrays and matrices. The main contribution of this paper is an efficient parallelization of the augmenting path search phase of the Hungarian algorithm. Computational experiments on problems with up to 25 million variables reveal that the GPU-accelerated versions are extremely efficient in solving large problems, as compared to their CPU counterparts. Tremendous parallel speedups are achieved for problems with up to 400 million variables, which are solved within 13 seconds on average. We also tested multi-GPU versions of the two variants on up to 16 GPUs, which show decent scaling behavior for problems with up to 1.6 billion variables and dense cost matrix structure. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:52 / 72
页数:21
相关论文
共 50 条
  • [1] GPU-accelerated parallel algorithms for linear rankSVM
    Jing Jin
    Xianggao Cai
    Guoming Lai
    Xiaola Lin
    [J]. The Journal of Supercomputing, 2015, 71 : 4141 - 4171
  • [2] GPU-accelerated parallel algorithms for linear rankSVM
    Jin, Jing
    Cai, Xianggao
    Lai, Guoming
    Lin, Xiaola
    [J]. JOURNAL OF SUPERCOMPUTING, 2015, 71 (11): : 4141 - 4171
  • [3] GPU-accelerated preconditioned iterative linear solvers
    Ruipeng Li
    Yousef Saad
    [J]. The Journal of Supercomputing, 2013, 63 : 443 - 466
  • [4] GPU-accelerated preconditioned iterative linear solvers
    Li, Ruipeng
    Saad, Yousef
    [J]. JOURNAL OF SUPERCOMPUTING, 2013, 63 (02): : 443 - 466
  • [5] GPU-Accelerated Fixpoint Algorithms for Faster Compiler Analyses
    Blass, Thorsten
    Philippsen, Michael
    [J]. PROCEEDINGS OF THE 28TH INTERNATIONAL CONFERENCE ON COMPILER CONSTRUCTION (CC '19), 2019, : 122 - 134
  • [6] Efficient OLAP algorithms on GPU-accelerated Hadoop clusters
    Hongzhi Wang
    Zheng Wang
    Ning Li
    Xinxin Kong
    [J]. Distributed and Parallel Databases, 2019, 37 : 507 - 542
  • [7] Gpu-accelerated relaxed graph pattern matching algorithms
    Benachour, Amira
    Yahiaoui, Said
    Bouhenni, Sarra
    Kheddouci, Hamamache
    Nouali-Taboudjemat, Nadia
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (15): : 21811 - 21836
  • [8] Efficient OLAP algorithms on GPU-accelerated Hadoop clusters
    Wang, Hongzhi
    Wang, Zheng
    Li, Ning
    Kong, Xinxin
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2019, 37 (04) : 507 - 542
  • [9] GPU-Accelerated Microdosimetry
    Decunha, J.
    Mohan, R.
    [J]. MEDICAL PHYSICS, 2022, 49 (06) : E467 - E468
  • [10] GPU-Accelerated Scalable Solver for Banded Linear Systems
    Liu, Hang
    Seo, Jung-Hee
    Mital, Rajat
    Huang, H. Howie
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,