Flexible and Feasible Support Measures for Mining Frequent Patterns in Large Labeled Graphs

被引:12
|
作者
Meng, Jinghan [1 ]
Tu, Yi-Cheng [1 ,2 ]
机构
[1] Univ S Florida, Dept Comp Sci & Engn, Tampa, FL 33620 USA
[2] Univ S Florida, IDSC, Tampa, FL 33620 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Data mining; graph mining; support measures; hypergraph; SUBGRAPH;
D O I
10.1145/3035918.3035936
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the popularity of graph databases has grown rapidly. This paper focuses on single-graph as an effective model to represent information and its related graph mining techniques. In frequent pattern mining in a single-graph setting, there are two main problems: support measure and search scheme. In this paper, we propose a novel framework for constructing support measures that brings together existing minimum-image-based and overlap-graph-based support measures. Our framework is built on the concept of occurrence / instance hypergraphs. Based on that, we present two new support measures: minimum instance (MI) measure and minimum vertex cover (MVC) measure, that combine the advantages of existing measures. In particular, we show that the existing minimum-image-based support measure is an upper bound of the MI measure, which is also linear-time computable and results in counts that are close to number of instances of a pattern. Although the MVC measure is NP-hard, it can be approximated to a constant factor in polynomial time. We also provide polynomial-time relaxations for both measures and bounding theorems for all presented support measures in the hypergraph setting. We further show that the hypergraph-based framework can unify all support measures studied in this paper. This framework is also flexible in that more variants of support measures can be defined and profiled in it.
引用
收藏
页码:391 / 402
页数:12
相关论文
共 50 条
  • [21] Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases
    Zhao, Zhou
    Yan, Da
    Ng, Wilfred
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1171 - 1184
  • [22] Mining frequent tree-like patterns in large datasets
    Chen, TS
    Hsu, SC
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2005, 3453 : 561 - 567
  • [23] Mining frequent tree-like patterns in large datasets
    Chen, Tzung-Shi
    Hsu, Shih-Chun
    DATA & KNOWLEDGE ENGINEERING, 2007, 62 (01) : 65 - 83
  • [24] TIPTAP: Approximate Mining of Frequent k-Subgraph Patterns in Evolving Graphs
    Nasir, Muhammad Anis Uddin
    Aslay, Cigdem
    Morales, Gianmarco De Francisci
    Riondato, Matteo
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (03)
  • [25] Quick Mining of Isomorphic Exact Large Patterns from Large Graphs
    Almasri, Islam
    Gao, Xin
    Fedoroff, Nina
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 517 - 524
  • [26] Incremental mining of frequent patterns without candidate generation or support constraint
    Cheung, W
    Zaïane, OR
    SEVENTH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2003, : 111 - 116
  • [27] A PARALLEL METHOD FOR MINING FREQUENT PATTERNS WITH MULTIPLE MINIMUM SUPPORT THRESHOLDS
    Huynh, Bao
    Trinh, Cuong
    Dang, Vu
    Vo, Bay
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2019, 15 (02): : 479 - 488
  • [28] Mining top−k frequent patterns without minimum support threshold
    Abdus Salam
    M. Sikandar Hayat Khayal
    Knowledge and Information Systems, 2012, 30 : 57 - 86
  • [29] An efficient mining of weighted frequent patterns with length decreasing support constraints
    Yun, Unil
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (08) : 741 - 752
  • [30] Frequent Subgraph Mining Algorithms for Single Large Graphs- A Brief Survey
    Dhiman, Aarzoo
    Jain, S. K.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND AUTOMATION (ICACCA 2016), 2016, : 179 - 184