Incremental Frequent Subgraph Mining on Large Evolving Graphs

被引:40
|
作者
Abdelhamid, Ehab [1 ]
Canim, Mustafa [2 ]
Sadoghi, Mohammad [3 ]
Bhattacharjee, Bishwaranjan [2 ]
Chang, Yuan-Chi [2 ]
Kalnis, Panos [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Comp Elect & Math Sci & Engn Div, Thuwal 23955, Saudi Arabia
[2] IBM Thomas J Watson Res Ctr, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
[3] Univ Calif Davis, Comp Sci Dept, 2063 Kemper Hall, Davis, CA 95616 USA
关键词
Graph algorithms; data mining; indexing; ALGORITHM; PATTERNS;
D O I
10.1109/TKDE.2017.2743075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent subgraph mining is a core graph operation used in many domains, such as graph data management and knowledge exploration, bioinformatics, and security. Most existing techniques target static graphs. However, modern applications, such as social networks, utilize large evolving graphs. Mining these graphs using existing techniques is infeasible, due to the high computational cost. In this paper, we propose IncGM+, a fast incremental approach for continuous frequent subgraph mining on a single large evolving graph. We adapt the notion of "fringe" to the graph context, that is the set of subgraphs on the border between frequent and infrequent subgraphs. IncGM+ maintains fringe subgraphs and exploits them to prune the search space. To boost the efficiency, we propose an efficient index structure to maintain selected embeddings with minimal memory overhead. These embeddings are utilized to avoid redundant expensive subgraph isomorphism operations. Moreover, the proposed system supports batch updates. Using large real-world graphs, we experimentally verify that IncGM+ outperforms existing methods by up to three orders of magnitude, scales to much larger graphs and consumes less memory.
引用
收藏
页码:2710 / 2723
页数:14
相关论文
共 50 条
  • [31] Parallel Incremental Frequent Itemset Mining for Large Data
    Yu-Geng Song
    Hui-Min Cui
    Xiao-Bing Feng
    Journal of Computer Science and Technology, 2017, 32 : 368 - 385
  • [32] Incremental Partitioning of Large Time-Evolving Graphs
    Abdolrashidi, Amirreza
    Ramaswamy, Lakshmish
    2015 IEEE CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC), 2015, : 19 - 27
  • [33] An Efficient Distributed Subgraph Mining Algorithm in Extreme Large Graphs
    Wu, Bin
    Bai, YunLong
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2010, 6319 : 107 - 115
  • [34] ScaleMine: Scalable Parallel Frequent Subgraph Mining in a Single Large Graph
    Abdelhamid, Ehab
    Abdelaziz, Ibrahim
    Kalnis, Panos
    Khayyat, Zuhair
    Jamour, Fuad
    SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 716 - 726
  • [35] ScaleMine: Scalable Parallel Frequent Subgraph Mining in a Single Large Graph
    Abdelhamid, Ehab
    Abdelaziz, Ibrahim
    Kalnis, Panos
    Khayyat, Zuhair
    Jamour, Fuad
    SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 727 - 727
  • [36] A Parallel Algorithm for Frequent Subgraph Mining
    Bay Vo
    Dang Nguyen
    Thanh-Long Nguyen
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2015, 358 : 163 - 173
  • [37] Frequent Subgraph Mining Based on Pregel
    Zhao, Xiang
    Chen, Yifan
    Xiao, Chuan
    Ishikawa, Yoshiharu
    Tang, Jiuyang
    COMPUTER JOURNAL, 2016, 59 (08): : 1113 - 1128
  • [38] The Gaston Tool for Frequent Subgraph Mining
    Nijssen, Siegfried
    Kok, Joost N.
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2005, 127 (01) : 77 - 87
  • [39] A survey of frequent subgraph mining algorithms
    Jiang, Chuntao
    Coenen, Frans
    Zito, Michele
    KNOWLEDGE ENGINEERING REVIEW, 2013, 28 (01): : 75 - 105
  • [40] Differentially Private Frequent Subgraph Mining
    Xu, Shengzhi
    Su, Sen
    Xiong, Li
    Cheng, Xiang
    Xiao, Ke
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 229 - 240