Incremental Frequent Subgraph Mining on Large Evolving Graphs

被引:40
|
作者
Abdelhamid, Ehab [1 ]
Canim, Mustafa [2 ]
Sadoghi, Mohammad [3 ]
Bhattacharjee, Bishwaranjan [2 ]
Chang, Yuan-Chi [2 ]
Kalnis, Panos [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Comp Elect & Math Sci & Engn Div, Thuwal 23955, Saudi Arabia
[2] IBM Thomas J Watson Res Ctr, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
[3] Univ Calif Davis, Comp Sci Dept, 2063 Kemper Hall, Davis, CA 95616 USA
关键词
Graph algorithms; data mining; indexing; ALGORITHM; PATTERNS;
D O I
10.1109/TKDE.2017.2743075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent subgraph mining is a core graph operation used in many domains, such as graph data management and knowledge exploration, bioinformatics, and security. Most existing techniques target static graphs. However, modern applications, such as social networks, utilize large evolving graphs. Mining these graphs using existing techniques is infeasible, due to the high computational cost. In this paper, we propose IncGM+, a fast incremental approach for continuous frequent subgraph mining on a single large evolving graph. We adapt the notion of "fringe" to the graph context, that is the set of subgraphs on the border between frequent and infrequent subgraphs. IncGM+ maintains fringe subgraphs and exploits them to prune the search space. To boost the efficiency, we propose an efficient index structure to maintain selected embeddings with minimal memory overhead. These embeddings are utilized to avoid redundant expensive subgraph isomorphism operations. Moreover, the proposed system supports batch updates. Using large real-world graphs, we experimentally verify that IncGM+ outperforms existing methods by up to three orders of magnitude, scales to much larger graphs and consumes less memory.
引用
收藏
页码:2710 / 2723
页数:14
相关论文
共 50 条
  • [1] Incremental Frequent Subgraph Mining on Large Evolving Graphs (Extended abstract)
    Abdelhamid, Ehab
    Canim, Mustafa
    Sadoghi, Mohammad
    Bhattacharjee, Bishwaranjan
    Chang, Yuan-Chi
    Kalnis, Panos
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1767 - 1768
  • [2] Efficient frequent subgraph mining on large streaming graphs
    Ray, Abhik
    Holder, Lawrence B.
    Bifet, Albert
    INTELLIGENT DATA ANALYSIS, 2019, 23 (01) : 103 - 132
  • [3] Dynamic frequent subgraph mining algorithms over evolving graphs: a survey
    Bostanoğlu, Belgin Ergenç
    Abuzayed, Nourhan
    PeerJ Computer Science, 2024, 10
  • [4] Dynamic frequent subgraph mining algorithms over evolving graphs: a survey
    Bostanoglu, Belgin Ergenc
    Abuzayed, Nourhan
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [5] Towards Frequent Subgraph Mining on Single Large Uncertain Graphs
    Chen, Yifan
    Zhao, Xiang
    Lin, Xuemin
    Wang, Yang
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 41 - 50
  • [6] Frequent subgraph mining in outerplanar graphs
    Tamás Horváth
    Jan Ramon
    Stefan Wrobel
    Data Mining and Knowledge Discovery, 2010, 21 : 472 - 508
  • [7] TIPTAP: Approximate Mining of Frequent k-Subgraph Patterns in Evolving Graphs
    Nasir, Muhammad Anis Uddin
    Aslay, Cigdem
    Morales, Gianmarco De Francisci
    Riondato, Matteo
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (03)
  • [8] Frequent subgraph mining in outerplanar graphs
    Horvath, Tamas
    Ramon, Jan
    Wrobel, Stefan
    DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 21 (03) : 472 - 508
  • [9] Periodic frequent subgraph mining in dynamic graphs
    Cai, Jiayu
    Chen, Zhaoming
    Chen, Guoting
    Gan, Wensheng
    Broustet, Amael
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,
  • [10] Frequent Subgraph Mining Algorithms for Single Large Graphs- A Brief Survey
    Dhiman, Aarzoo
    Jain, S. K.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND AUTOMATION (ICACCA 2016), 2016, : 179 - 184