A distributed approach for graph mining in massive networks

被引:56
|
作者
Talukder, N. [1 ]
Zaki, M. J. [1 ]
机构
[1] Rensselaer Polytech Inst, Troy, NY 12180 USA
基金
美国国家科学基金会;
关键词
Parallel graph mining; Distributed graph mining; Single large graph; Frequent subgraph mining; High performance computing; ALGORITHM;
D O I
10.1007/s10618-016-0466-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel distributed algorithm for mining frequent subgraphs from a single, very large, labeled network. Our approach is the first distributed method to mine a massive input graph that is too large to fit in the memory of any individual compute node. The input graph thus has to be partitioned among the nodes, which can lead to potential false negatives. Furthermore, for scalable performance it is crucial to minimize the communication among the compute nodes. Our algorithm, DistGraph, ensures that there are no false negatives, and uses a set of optimizations and efficient collective communication operations to minimize information exchange. To our knowledge DistGraph is the first approach demonstrated to scale to graphs with over a billion vertices and edges. Scalability results on up to 2048 IBM Blue Gene/Q compute nodes, with 16 cores each, show very good speedup.
引用
收藏
页码:1024 / 1052
页数:29
相关论文
共 50 条
  • [21] An approach to Mining Information from Telephone Graph Using Graph Mining Techniques
    Rao, Bapuji
    Mishra, S. N.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 424 - 429
  • [22] Factor Graph Approach to Distributed Facility Location in Large-Scale Networks
    Ngo, Hung Q.
    Lee, Sungyoung
    Lee, Young-Koo
    2009 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1- 4, 2009, : 943 - 947
  • [23] A Graph-based Approach for Distributed Parameter Coordination in Wireless Communication Networks
    Guerreiro, Igor M.
    Hui, Dennis
    Guey, Jiann-Ching
    Cavalcante, Charles C.
    2012 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2012, : 152 - 156
  • [24] An autonomy-oriented computing approach to community mining in distributed and dynamic networks
    Yang, Bo
    Liu, Jiming
    Liu, Dayou
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2010, 20 (02) : 123 - 157
  • [25] Distributed Graph Routing for WirelessHART Networks
    Modekurthy, Venkata P.
    Saifullah, Abusayeed
    Madria, Sanjay
    ICDCN'18: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, 2018,
  • [26] Distributed graph layout for sensor networks
    Gotsman, C
    Koren, Y
    GRAPH DRAWING, 2004, 3383 : 273 - 284
  • [27] An autonomy-oriented computing approach to community mining in distributed and dynamic networks
    Bo Yang
    Jiming Liu
    Dayou Liu
    Autonomous Agents and Multi-Agent Systems, 2010, 20 : 123 - 157
  • [28] Distributed Training of Graph Convolutional Networks
    Scardapane, Simone
    Spinelli, Indro
    Di Lorenzo, Paolo
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2021, 7 : 87 - 100
  • [29] PMS: an Effective Approximation Approach for Distributed Large-scale Graph Data Processing and Mining
    Cao, Yingjie
    Zhang, Yangyang
    Li, Jianxin
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1999 - 2002
  • [30] ViWoSG:A distributed scene graph of ultra-massive distributed virtual environments
    WANG GuoPing
    Department of Computer Science
    Science China(Information Sciences), 2009, (03) : 457 - 469