Incremental Frequent Subgraph Mining on Large Evolving Graphs

被引:40
|
作者
Abdelhamid, Ehab [1 ]
Canim, Mustafa [2 ]
Sadoghi, Mohammad [3 ]
Bhattacharjee, Bishwaranjan [2 ]
Chang, Yuan-Chi [2 ]
Kalnis, Panos [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Comp Elect & Math Sci & Engn Div, Thuwal 23955, Saudi Arabia
[2] IBM Thomas J Watson Res Ctr, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
[3] Univ Calif Davis, Comp Sci Dept, 2063 Kemper Hall, Davis, CA 95616 USA
关键词
Graph algorithms; data mining; indexing; ALGORITHM; PATTERNS;
D O I
10.1109/TKDE.2017.2743075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent subgraph mining is a core graph operation used in many domains, such as graph data management and knowledge exploration, bioinformatics, and security. Most existing techniques target static graphs. However, modern applications, such as social networks, utilize large evolving graphs. Mining these graphs using existing techniques is infeasible, due to the high computational cost. In this paper, we propose IncGM+, a fast incremental approach for continuous frequent subgraph mining on a single large evolving graph. We adapt the notion of "fringe" to the graph context, that is the set of subgraphs on the border between frequent and infrequent subgraphs. IncGM+ maintains fringe subgraphs and exploits them to prune the search space. To boost the efficiency, we propose an efficient index structure to maintain selected embeddings with minimal memory overhead. These embeddings are utilized to avoid redundant expensive subgraph isomorphism operations. Moreover, the proposed system supports batch updates. Using large real-world graphs, we experimentally verify that IncGM+ outperforms existing methods by up to three orders of magnitude, scales to much larger graphs and consumes less memory.
引用
收藏
页码:2710 / 2723
页数:14
相关论文
共 50 条
  • [41] Efficient frequent subgraph mining algorithm
    Li, Xian-Tong
    Li, Jian-Zhong
    Gao, Hong
    Ruan Jian Xue Bao/Journal of Software, 2007, 18 (10): : 2469 - 2480
  • [42] A qualitative survey on frequent subgraph mining
    Guvenoglu, Busra
    Bostanoglu, Belgin Ergenc
    OPEN COMPUTER SCIENCE, 2018, 8 (01) : 194 - 209
  • [43] Frequent Subgraph Mining Algorithms - A Survey
    Ramraj, T.
    Prabhakar, R.
    GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 : 197 - 204
  • [44] A Fast Frequent Subgraph Mining Algorithm
    Wu, Jia
    Chen, Ling
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 82 - 87
  • [45] MARGIN: Maximal frequent subgraph mining
    Thomas, Lini T.
    Valluri, Satyanarayana R.
    Karlapalem, Kamalakar
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 1097 - +
  • [46] MARGIN: Maximal Frequent Subgraph Mining
    Thomas, Lini T.
    Valluri, Satyanarayana R.
    Karlapalem, Kamalakar
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
  • [47] Evolving subgraph matching on temporal graphs
    Li, Faming
    Zou, Zhaonian
    Li, Jianzhong
    Yang, Xiaochun
    Wang, Bin
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [48] A Parallel Approach for Frequent Subgraph Mining in a Single Large Graph Using Spark
    Qiao, Fengcai
    Zhang, Xin
    Li, Pei
    Ding, Zhaoyun
    Jia, Shanshan
    Wang, Hui
    APPLIED SCIENCES-BASEL, 2018, 8 (02):
  • [49] Subgraph Mining on Directed and Weighted Graphs
    Guennemann, Stephan
    Seidl, Thomas
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PROCEEDINGS, 2010, 6119 : 133 - 146
  • [50] Enhancing Recommender Systems with Semantic User Profiling through Frequent Subgraph Mining on Knowledge Graphs
    Jung, Haemin
    Park, Heesung
    Lee, Kwangyon
    APPLIED SCIENCES-BASEL, 2023, 13 (18):