Fast and memory-efficient minimum spanning tree on the

被引:9
|
作者
Rostrup, Scott [1 ]
Srivastava, Shweta [1 ]
Singhal, Kishore [1 ]
机构
[1] Synopsys Inc, 700 East Middlefield Rd, Mountain View, CA USA
关键词
minimum spanning tree; MST; graphics processing unit; GPU; graph algorithms;
D O I
10.1504/IJCSE.2013.052115
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The GPU is an efficient accelerator for regular data-parallel workloads, but GPU acceleration is more difficult for graph algorithms and other applications with irregular memory access patterns and large memory footprints. The minimum spanning tree (MST) problem arises in a variety of applications and its solution exemplifies the difficulties of mapping irregular algorithms to the GPU. In this paper, we present a memory-efficient parallel algorithm for finding the minimum spanning tree of very large graphs by introducing a data-parallel implementation of Kruskal's algorithm. We test scalability and performance on random and real-world graphs with up to 25 million vertices and 240 million edges on an Nvidia Tesla T10 GPU with 4GB of memory. Our method can process graphs 4X larger and up to 10X faster than was possible with the recently published implementation of Boruvka's MST algorithm for the GPU. We also demonstrate the performance advantage of the proposed method against the multi-core filter-Kruskal's MST algorithm on a dual quad-core CPU server with Nehalem X5550 processors.
引用
收藏
页码:21 / 33
页数:13
相关论文
共 50 条
  • [31] A fast implementation of the minimum spanning tree method for phase unwrapping
    An, L
    Xiang, QS
    Chavez, S
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2000, 19 (08) : 805 - 808
  • [32] Fast approximate minimum spanning tree based clustering algorithm
    Jothi, R.
    Mohanty, Sraban Kumar
    Ojha, Aparajita
    [J]. NEUROCOMPUTING, 2018, 272 : 542 - 557
  • [33] MINIMUM SPANNING TREE GENERATION WITH CONTENT-ADDRESSABLE MEMORY
    PARK, TG
    OLDFIELD, JV
    [J]. ELECTRONICS LETTERS, 1993, 29 (11) : 1037 - 1039
  • [34] A Fast and Memory-Efficient Hierarchical Graph Clustering Algorithm
    Szilagyi, Laszlo
    Szilagyi, Sandor Miklos
    Hirsbrunner, Beat
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2014), PT I, 2014, 8834 : 247 - 254
  • [35] Fast, Memory-Efficient Spectral Clustering with Cosine Similarity
    Li, Ran
    Chen, Guangliang
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 700 - 714
  • [36] A FAST AND MEMORY-EFFICIENT ALGORITHM FOR ROBUST PCA (MEROP)
    Narayanamurthy, Praneeth
    Vaswani, Namrata
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4684 - 4688
  • [37] A Fast and Memory-Efficient Approach to NDN Name Lookup
    He, Dacheng
    Zhang, Dafang
    Xu, Ke
    Huang, Kun
    Li, Yanbiao
    [J]. CHINA COMMUNICATIONS, 2017, 14 (10) : 61 - 69
  • [38] Fast and Memory-Efficient Import Vector Domain Description
    Decherchi, Sergio
    Cavalli, Andrea
    [J]. NEURAL PROCESSING LETTERS, 2020, 52 (01) : 511 - 524
  • [39] Fast, Accurate and Memory-Efficient Partial Permutation Synchronization
    Li, Shaohan
    Shi, Yunpeng
    Lerman, Gilad
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15714 - 15722
  • [40] A Fast and Memory-Efficient Approach to NDN Name Lookup
    Dacheng He
    Dafang Zhang
    Ke Xu
    Kun Huang
    Yanbiao Li
    [J]. China Communications, 2017, 14 (10) : 61 - 69