Flexible and Feasible Support Measures for Mining Frequent Patterns in Large Labeled Graphs

被引:12
|
作者
Meng, Jinghan [1 ]
Tu, Yi-Cheng [1 ,2 ]
机构
[1] Univ S Florida, Dept Comp Sci & Engn, Tampa, FL 33620 USA
[2] Univ S Florida, IDSC, Tampa, FL 33620 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Data mining; graph mining; support measures; hypergraph; SUBGRAPH;
D O I
10.1145/3035918.3035936
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the popularity of graph databases has grown rapidly. This paper focuses on single-graph as an effective model to represent information and its related graph mining techniques. In frequent pattern mining in a single-graph setting, there are two main problems: support measure and search scheme. In this paper, we propose a novel framework for constructing support measures that brings together existing minimum-image-based and overlap-graph-based support measures. Our framework is built on the concept of occurrence / instance hypergraphs. Based on that, we present two new support measures: minimum instance (MI) measure and minimum vertex cover (MVC) measure, that combine the advantages of existing measures. In particular, we show that the existing minimum-image-based support measure is an upper bound of the MI measure, which is also linear-time computable and results in counts that are close to number of instances of a pattern. Although the MVC measure is NP-hard, it can be approximated to a constant factor in polynomial time. We also provide polynomial-time relaxations for both measures and bounding theorems for all presented support measures in the hypergraph setting. We further show that the hypergraph-based framework can unify all support measures studied in this paper. This framework is also flexible in that more variants of support measures can be defined and profiled in it.
引用
收藏
页码:391 / 402
页数:12
相关论文
共 50 条
  • [1] Mining Frequent Neighborhood Patterns in a Large Labeled Graph
    Han, Jialong
    Wen, Ji-Rong
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 259 - 268
  • [2] Generalizing Design of Support Measures for Counting Frequent Patterns in Graphs
    Meng, Jinghan
    Pitaksirianan, Napath
    Tu, Yicheng
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 533 - 542
  • [3] Mining frequent labeled and partially labeled graph patterns
    Vanetik, N
    Gudes, E
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 91 - 102
  • [4] Mining Frequent Patterns on Knowledge Graphs
    Mouatadid, Lalla
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1647 - 1647
  • [5] Mining Frequent Patterns in Evolving Graphs
    Aslay, Cigdem
    Nasir, Muhammad Anis Uddin
    Morales, Gianmarco De Francisci
    Gionis, Aristides
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 923 - 932
  • [6] Counting frequent patterns in large labeled graphs: a hypergraph-based approach
    Meng, Jinghan
    Pitaksirianan, Napath
    Tu, Yi-Cheng
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (04) : 980 - 1021
  • [7] Counting frequent patterns in large labeled graphs: a hypergraph-based approach
    Jinghan Meng
    Napath Pitaksirianan
    Yi-Cheng Tu
    Data Mining and Knowledge Discovery, 2020, 34 : 980 - 1021
  • [8] Efficient Mining of Frequent Patterns on Uncertain Graphs
    Chen, Yifan
    Zhao, Xiang
    Lin, Xuemin
    Wang, Yang
    Guo, Deke
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 287 - 300
  • [9] A general framework for mining frequent subgraphs from labeled graphs
    Inokuchi, A
    Washio, T
    Motoda, H
    FUNDAMENTA INFORMATICAE, 2005, 66 (1-2) : 53 - 82
  • [10] Complete Mining of Frequent Patterns from Graphs: Mining Graph Data
    Akihiro Inokuchi
    Takashi Washio
    Hiroshi Motoda
    Machine Learning, 2003, 50 : 321 - 354