Incremental Frequent Subgraph Mining on Large Evolving Graphs

被引:40
|
作者
Abdelhamid, Ehab [1 ]
Canim, Mustafa [2 ]
Sadoghi, Mohammad [3 ]
Bhattacharjee, Bishwaranjan [2 ]
Chang, Yuan-Chi [2 ]
Kalnis, Panos [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Comp Elect & Math Sci & Engn Div, Thuwal 23955, Saudi Arabia
[2] IBM Thomas J Watson Res Ctr, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
[3] Univ Calif Davis, Comp Sci Dept, 2063 Kemper Hall, Davis, CA 95616 USA
关键词
Graph algorithms; data mining; indexing; ALGORITHM; PATTERNS;
D O I
10.1109/TKDE.2017.2743075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent subgraph mining is a core graph operation used in many domains, such as graph data management and knowledge exploration, bioinformatics, and security. Most existing techniques target static graphs. However, modern applications, such as social networks, utilize large evolving graphs. Mining these graphs using existing techniques is infeasible, due to the high computational cost. In this paper, we propose IncGM+, a fast incremental approach for continuous frequent subgraph mining on a single large evolving graph. We adapt the notion of "fringe" to the graph context, that is the set of subgraphs on the border between frequent and infrequent subgraphs. IncGM+ maintains fringe subgraphs and exploits them to prune the search space. To boost the efficiency, we propose an efficient index structure to maintain selected embeddings with minimal memory overhead. These embeddings are utilized to avoid redundant expensive subgraph isomorphism operations. Moreover, the proposed system supports batch updates. Using large real-world graphs, we experimentally verify that IncGM+ outperforms existing methods by up to three orders of magnitude, scales to much larger graphs and consumes less memory.
引用
收藏
页码:2710 / 2723
页数:14
相关论文
共 50 条
  • [21] Densest Periodic Subgraph Mining on Large Temporal Graphs
    Qin, Hongchao
    Li, Rong-Hua
    Yuan, Ye
    Dai, Yongheng
    Wang, Guoren
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11259 - 11273
  • [22] GRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph
    Elseidy, Mohammed
    Abdelhamid, Ehab
    Skiadopoulos, Spiros
    Kalnis, Panos
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (07): : 517 - 528
  • [23] POSGRAMI: Possibilistic Frequent Subgraph Mining in a Single Large Graph
    Moussaoui, Mohamed
    Zaghdoud, Montaceur
    Akaichi, Jalel
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS, IPMU 2016, PT I, 2016, 610 : 549 - 561
  • [24] A Method for Closed Frequent Subgraph Mining in a Single Large Graph
    Nguyen, Lam B. Q.
    Nguyen, Loan T. T.
    Zelinka, Ivan
    Snasel, Vaclav
    Hung Son Nguyen
    Bay Vo
    IEEE ACCESS, 2021, 9 : 165719 - 165733
  • [25] Generalization for frequent subgraph mining
    Inokuchi, Akihiro
    Washio, Takashi
    Motoda, Hiroshi
    Transactions of the Japanese Society for Artificial Intelligence, 2004, 19 (05) : 368 - 378
  • [26] Frequent Subgraph Mining on BigData
    Sreedevi, K. M.
    Hareesh, M. J.
    Kunjachan, Honeytta
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 555 - 560
  • [27] Frequent mining of subgraph structures
    Guo, Ping
    Wang, Xin-Ru
    Kang, Yan-Rong
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2006, 18 (04) : 513 - 521
  • [28] FCSG-Miner: Frequent closed subgraph mining in multi-graphs
    Chen, Xinyang
    Cai, Jiayu
    Chen, Guoting
    Gan, Wensheng
    Broustet, Amael
    INFORMATION SCIENCES, 2024, 665
  • [29] Efficient frequent connected subgraph mining in graphs of bounded tree-width
    Horvath, Tamas
    Ramon, Jan
    THEORETICAL COMPUTER SCIENCE, 2010, 411 (31-33) : 2784 - 2797
  • [30] Parallel Incremental Frequent Itemset Mining for Large Data
    Song, Yu-Geng
    Cui, Hui-Min
    Feng, Xiao-Bing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (02) : 368 - 385