Isomorphic Graph Embedding for Progressive Maximal Frequent Subgraph Mining

被引:0
|
作者
Thanh Toan Nguyen [1 ]
Thanh Tam Nguyen [2 ]
Thanh Hung Nguyen [3 ]
Yin, Hongzhi [4 ]
Thanh Thi Nguyen [5 ]
Jo, Jun [2 ]
Quoc Viet Hung Nguyen [2 ]
机构
[1] HUTECH Univ, Fac Informat Technol, 475A Dien Bien Phu,Ward 25, Ho Chi Minh City 70000, Vietnam
[2] Griffith Univ, 1 Parklands Dr, Gold Coast, Qld 4215, Australia
[3] Hanoi Univ Sci & Technol, Viet Nam 1 Dai Co Viet, Hanoi 113000, Vietnam
[4] Univ Queensland, St Lucia, Qld 4072, Australia
[5] Monash Univ, Australia Wellington Rd, Clayton, Vic 3800, Australia
关键词
Maximal frequent subgraph mining; graph representation learning; isomorphism testing;
D O I
10.1145/3630635
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Maximal frequent subgraph mining (MFSM) is the task of mining only maximal frequent subgraphs, i.e., subgraphs that are not a part of other frequent subgraphs. Although many intelligent systems require MFSM, MFSM is challenging compared to frequent subgraph mining (FSM), as maximal frequent subgraphs lie in the middle of graph lattice, and FSM algorithms must explore an exponential space and an NP-hard subroutine of frequency counting. Different from prior research, which primarily focused on optimal solutions, we introduce pmMine, a progressive graph neural framework designed for MFSM in a single large graph to attain an approximate solution. The framework combines isomorphic graph embedding, non-parametric partitioning, and an efficiently top-down pattern searching strategy. The critical insight that makes pmMine work is to define the concepts of rooted subgraph and isomorphic graph embedding, in which the costly isomorphism subroutine can be efficiently performed using similarity estimation in embedding space. In addition, pmMine returns the patterns identified during the mining process in a progressive manner. We validate the efficiency and effectiveness of our technique through extensive experiments on a variety of datasets spanning various domains.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] MARGIN: Maximal Frequent Subgraph Mining
    Thomas, Lini T.
    Valluri, Satyanarayana R.
    Karlapalem, Kamalakar
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
  • [2] MARGIN: Maximal frequent subgraph mining
    Thomas, Lini T.
    Valluri, Satyanarayana R.
    Karlapalem, Kamalakar
    [J]. ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 1097 - +
  • [3] A new algorithm for mining maximal frequent subgraph
    Wang, Zhisong
    Chai, Ran
    [J]. Journal of Computational Information Systems, 2010, 6 (02): : 469 - 476
  • [4] Large Graph Sampling Algorithm for Frequent Subgraph Mining
    Zheng, Tianyu
    Li Wang
    [J]. IEEE ACCESS, 2021, 9 : 88970 - 88980
  • [5] Weighted Frequent Subgraph Mining in Weighted Graph Databases
    Shinoda, Masaki
    Ozaki, Tomonobu
    Ohkawa, Takenao
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 58 - +
  • [6] Frequent Subgraph Mining in Graph Databases Based on MapReduce
    Wang, Kai
    Xie, Xia
    Jin, Hai
    Yuan, Pingpeng
    Lu, Feng
    Ke, Xijiang
    [J]. ADVANCES IN SERVICES COMPUTING, 2016, 10065 : 464 - 476
  • [7] Optimizing Frequent Subgraph Mining for Single Large Graph
    Dhiman, Aarzoo
    Jain, S. K.
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 378 - 385
  • [8] Top-Down algorithm for mining maximal frequent subgraph
    Guo Jingfeng
    Chai Ran
    Li Jia
    [J]. ADVANCED RESEARCH ON INDUSTRY, INFORMATION SYSTEMS AND MATERIAL ENGINEERING, PTS 1-7, 2011, 204-210 : 1472 - 1476
  • [9] Frequent Generalized Subgraph Mining via Graph Edit Distances
    Palme, Richard
    Welke, Pascal
    [J]. MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II, 2023, 1753 : 477 - 483
  • [10] Online Structural Graph Clustering Using Frequent Subgraph Mining
    Seeland, Madeleine
    Girschick, Tobias
    Buchwald, Fabian
    Kramer, Stefan
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2010, 6323 : 213 - 228