Finding and Counting Tree-Like Subgraphs Using MapReduce

被引:2
|
作者
Zhao, Zhao [1 ,2 ]
Chen, Langshi [3 ]
Avram, Mihai [3 ]
Li, Meng [3 ]
Wang, Guanying [4 ]
Butt, Ali [1 ,2 ]
Khan, Maleq [5 ]
Marathe, Madhav [1 ,2 ]
Qiu, Judy [6 ]
Vullikanti, Anil [1 ,2 ]
机构
[1] Virginia Tech, Biocomplex Inst, Network Dynam & Simulat Sci Lab, Blacksburg, VA 24061 USA
[2] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[3] Indiana Univ, Dept Comp Sci, Bloomington, IN 47405 USA
[4] Google Inc, Mountain View, CA 94043 USA
[5] Texas A&M Univ Kingsville, Dept Elect Engn & Comp Sci, Kingsville, TX 78363 USA
[6] Indiana Univ, Intelligent Syst Engn Dept, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
Subgraph isomorphism; color coding; approximation algorithm; MapReduce; Hadoop; harp;
D O I
10.1109/TMSCS.2017.2768426
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several variants of the subgraph isomorphism problem, e.g., finding, counting, and estimating frequencies of subgraphs in networks arise in a number of real world applications, such as web analysis, disease diffusion prediction, and social network analysis. These problems are computationally challenging in having to scale to very large networks with millions of vertices. In this paper, we present SAHAD, a MapReduce algorithm for detecting and counting trees of bounded size using the elegant color coding technique developed by N. Alon et al. SAHAD is a randomized algorithm, and we show rigorous bounds on the approximation quality and the performance of it. SAHAD scales to very large networks comprising of 10(7) - 10(8) vertices and 10(8) - 10(9) edges and tree-like (acyclic) templates with up to 12 vertices. Further, we extend our results by implementing SAHAD in the Harp framework, which is more of a high performance computing environment. The new implementation gives 100x improvement in performance over the standard Hadoop implementation and achieves better performance than state-of-the-art MPI solutions on larger graphs.
引用
收藏
页码:217 / 230
页数:14
相关论文
共 50 条
  • [21] Contextuality in Tree-Like Graphs
    Alexei A. Strakhov
    Vladimir I. Man’ko
    Journal of Russian Laser Research, 2014, 35 : 609 - 616
  • [22] TREE-LIKE HIERARCHY OF SYSTEMS
    LIN, Y
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1988, 14 (01) : 33 - 44
  • [23] SubGraph2Vec: Highly-Vectorized Tree-like Subgraph Counting
    Chen, Langshi
    Li, Jiayu
    Sahinaip, Cenk
    Marathe, Madhav
    Vullikanti, Anil
    Nikolaev, Andrey
    Smirnov, Egor
    Israfilov, Ruslan
    Qiu, Judy
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 483 - 492
  • [24] ON UNIVERSAL TREE-LIKE CONTINUA
    ROGERS, JW
    NOTICES OF THE AMERICAN MATHEMATICAL SOCIETY, 1969, 16 (01): : 205 - &
  • [25] Corners in Tree-Like Tableaux
    Hitczenko, Pawel
    Lohss, Amanda
    ELECTRONIC JOURNAL OF COMBINATORICS, 2016, 23 (04):
  • [26] HOMOGENEOUS TREE-LIKE CONTINUA
    LEWIS, W
    PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 1981, 82 (03) : 470 - 472
  • [27] Continuous tree-like scales
    Cummings, James
    CENTRAL EUROPEAN JOURNAL OF MATHEMATICS, 2010, 8 (02): : 314 - 318
  • [28] Digital Form-finding of Tree-like Structures Based on Wet Threads Experiment
    Lavasani, Mona Alibabaye
    Matini, Mohammad Reza
    Khaghani, Saeed
    BAGH-E NAZAR, 2024, 21 (134): : 39 - 46
  • [29] Novel machine learning approach for shape-finding design of tree-like structures
    Du, Wenfeng
    Zhao, Yannan
    Wang, Yingqi
    Wang, Hui
    Yang, Mijia
    COMPUTERS & STRUCTURES, 2022, 261
  • [30] Tight stationarity and tree-like scales
    Chen, William
    ANNALS OF PURE AND APPLIED LOGIC, 2015, 166 (10) : 1019 - 1036