A distributed approach for graph mining in massive networks

被引:56
|
作者
Talukder, N. [1 ]
Zaki, M. J. [1 ]
机构
[1] Rensselaer Polytech Inst, Troy, NY 12180 USA
基金
美国国家科学基金会;
关键词
Parallel graph mining; Distributed graph mining; Single large graph; Frequent subgraph mining; High performance computing; ALGORITHM;
D O I
10.1007/s10618-016-0466-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel distributed algorithm for mining frequent subgraphs from a single, very large, labeled network. Our approach is the first distributed method to mine a massive input graph that is too large to fit in the memory of any individual compute node. The input graph thus has to be partitioned among the nodes, which can lead to potential false negatives. Furthermore, for scalable performance it is crucial to minimize the communication among the compute nodes. Our algorithm, DistGraph, ensures that there are no false negatives, and uses a set of optimizations and efficient collective communication operations to minimize information exchange. To our knowledge DistGraph is the first approach demonstrated to scale to graphs with over a billion vertices and edges. Scalability results on up to 2048 IBM Blue Gene/Q compute nodes, with 16 cores each, show very good speedup.
引用
收藏
页码:1024 / 1052
页数:29
相关论文
共 50 条
  • [1] A distributed approach for graph mining in massive networks
    N. Talukder
    M. J. Zaki
    Data Mining and Knowledge Discovery, 2016, 30 : 1024 - 1052
  • [2] Graph Neural Network for Distributed Beamforming and Power Control in Massive URLLC Networks
    Gu, Yifan
    She, Changyang
    Bi, Suzhi
    Quan, Zhi
    Vucetic, Branka
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (08) : 9099 - 9112
  • [3] Graph Mining with Graph Neural Networks
    Jin, Wei
    WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 1119 - 1120
  • [4] Distributed Attack Modeling Approach Based on Process Mining and Graph Segmentation
    Chen, Yuzhong
    Liu, Zhenyu
    Liu, Yulin
    Dong, Chen
    ENTROPY, 2020, 22 (09)
  • [5] Catching Synchronized Behaviors in Large Networks: A Graph Mining Approach
    Jiang, Meng
    Cui, Peng
    Beutel, Alex
    Faloutsos, Christos
    Yang, Shiqiang
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2016, 10 (04)
  • [6] A Graph-Theoretic Approach to Distributed Control over Networks
    Swigart, John
    Lall, Sanjay
    PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 5409 - 5414
  • [7] Arabesque: A System for Distributed Graph Mining
    Teixeira, Carlos H. C.
    Fonseca, Alexandre J.
    Serafini, Marco
    Siganos, Georgos
    Zaki, Mohammed J.
    Aboulnaga, Ashraf
    SOSP'15: PROCEEDINGS OF THE TWENTY-FIFTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, 2015, : 425 - 440
  • [8] Trend motif: A graph mining approach for analysis of dynamic complex networks
    Jin, Ruoming
    McCallen, Scott
    Almaas, Eivind
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 541 - +
  • [9] A Survey of Distributed Graph Algorithms on Massive Graphs
    Meng, Lingkai
    Shao, Yu
    Yuan, Long
    Lai, Longbin
    Cheng, Peng
    Li, Xue
    Yu, Wenyuan
    Zhang, Wenjie
    Lin, Xuemin
    Zhou, Jingren
    ACM COMPUTING SURVEYS, 2025, 57 (02)
  • [10] NOMA-Aided Distributed Massive MIMO-A Graph-Theoretic Approach
    Kudathanthirige, Dhanushka
    Gunasinghe, Dulaj
    Aruma Baduge, Gayan Amarasuriya
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (04) : 1398 - 1413