A robust, efficient, and balanced parallel algorithm for finding connected components

被引:0
|
作者
Asokan, M. [1 ]
机构
[1] Syncsort Inc, Pearl River, NY 10965 USA
关键词
Graph Mining; Connected Components; Hadoop; MapReduce; MAPREDUCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding connected components in an undirected graph has many practical applications. For example in a graph representing a social network, a connected component represents a group of related individuals with common interest. Also, finding connected components forms the basis for other clustering algorithms. In this paper, we will present a parallel algorithm which uses the well known sequential algorithm as the basis for finding connected components in an undirected graph. The algorithm can be adopted to run on a single computer with multiple cores or MapReduce. It is robust in the sense that it honors memory limits. This is important in today's containerized environments. It balances the workload even in the presence of data skew. For the best known algorithm running in MapReduce, the number of iterations is the square of the logarithmic function of the number of vertices in the graph. For our algorithm, we will prove that the upper bound on the number of iterations is a logarithmic function of the maximum size of a connected component. In each iteration, the amount of data read from or written to a file system is bounded by four times the number of edges in the graph.
引用
下载
收藏
页码:2113 / 2118
页数:6
相关论文
共 50 条
  • [21] Efficient scan mask techniques for connected components labeling algorithm
    Phaisarn Sutheebanjard
    Wichian Premchaiswadi
    EURASIP Journal on Image and Video Processing, 2011
  • [22] An Efficient Strongly Connected Components Algorithm in the Fault Tolerant Model
    Baswana, Surender
    Choudhary, Keerti
    Roditty, Liam
    ALGORITHMICA, 2019, 81 (03) : 967 - 985
  • [23] Efficient algorithm to compute mutually connected components in interdependent networks
    Hwang, S.
    Choi, S.
    Lee, Deokjae
    Kahng, B.
    PHYSICAL REVIEW E, 2015, 91 (02)
  • [24] An Efficient Strongly Connected Components Algorithm in the Fault Tolerant Model
    Department of Computer Science and Engineering, IIT, Kanpur, India
    不详
    Algorithmica, 3 (967-985):
  • [25] Efficient scan mask techniques for connected components labeling algorithm
    Sutheebanjard, Phaisarn
    Premchaiswadi, Wichian
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2011,
  • [26] An Efficient Strongly Connected Components Algorithm in the Fault Tolerant Model
    Surender Baswana
    Keerti Choudhary
    Liam Roditty
    Algorithmica, 2019, 81 : 967 - 985
  • [27] A coarse-grained parallel algorithm for spanning tree and connected components
    Cáceres, EN
    Dehne, F
    Mongelli, H
    Song, SW
    Szwarcfiter, JL
    EURO-PAR 2004 PARALLEL PROCESSING, PROCEEDINGS, 2004, 3149 : 828 - 831
  • [28] Parallel algorithm for finding the minimum edges to build a strongly connected directed graph
    Tada, Akio
    Mukai, Eiichi
    Migita, Masahiro
    Itokawa, Tsuyoshi
    APPLIED MATHEMATICS FOR SCIENCE AND ENGINEERING, 2007, : 100 - +
  • [29] PARALLEL ALGORITHM FOR FINDING MINIMUM EDGES REQUIRED TO MAKE A DAG STRONGLY CONNECTED
    Itokawa, Tsuyoshi
    Tada, Akio
    Migita, Masahiro
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (03): : 581 - 588
  • [30] LACC: A Linear-Algebraic Algorithm for Finding Connected Components in Distributed Memory
    Azad, Ariful
    Buluc, Aydin
    2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 2 - 12