Flexible and Feasible Support Measures for Mining Frequent Patterns in Large Labeled Graphs

被引:12
|
作者
Meng, Jinghan [1 ]
Tu, Yi-Cheng [1 ,2 ]
机构
[1] Univ S Florida, Dept Comp Sci & Engn, Tampa, FL 33620 USA
[2] Univ S Florida, IDSC, Tampa, FL 33620 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Data mining; graph mining; support measures; hypergraph; SUBGRAPH;
D O I
10.1145/3035918.3035936
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the popularity of graph databases has grown rapidly. This paper focuses on single-graph as an effective model to represent information and its related graph mining techniques. In frequent pattern mining in a single-graph setting, there are two main problems: support measure and search scheme. In this paper, we propose a novel framework for constructing support measures that brings together existing minimum-image-based and overlap-graph-based support measures. Our framework is built on the concept of occurrence / instance hypergraphs. Based on that, we present two new support measures: minimum instance (MI) measure and minimum vertex cover (MVC) measure, that combine the advantages of existing measures. In particular, we show that the existing minimum-image-based support measure is an upper bound of the MI measure, which is also linear-time computable and results in counts that are close to number of instances of a pattern. Although the MVC measure is NP-hard, it can be approximated to a constant factor in polynomial time. We also provide polynomial-time relaxations for both measures and bounding theorems for all presented support measures in the hypergraph setting. We further show that the hypergraph-based framework can unify all support measures studied in this paper. This framework is also flexible in that more variants of support measures can be defined and profiled in it.
引用
收藏
页码:391 / 402
页数:12
相关论文
共 50 条
  • [41] Mining Attribute-structure Correlated Patterns in Large Attributed Graphs
    Silva, Arlei
    Meira, Wagner, Jr.
    Zaki, Mohammed J.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (05): : 466 - 477
  • [42] Mining Frequent Patterns in 2D+t Grid Graphs for Cellular Automata Analysis
    Deville, Romain
    Fromont, Elisa
    Jeudy, Baptiste
    Solnon, Christine
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION (GBRPR 2017), 2017, 10310 : 177 - 186
  • [43] A MapReduce Framework for Mining Maximal Contiguous Frequent Patterns in Large DNA Sequence Datasets
    Karim, Md. Rezaul
    Hossain, Md. Azam
    Rashid, Md. Mamunur
    Jeong, Byeong-Soo
    Choi, Ho-Jin
    IETE TECHNICAL REVIEW, 2012, 29 (02) : 162 - 168
  • [44] H-mine: Hyper-structure mining of frequent patterns in large databases
    Pei, J
    Han, JW
    Lu, HJ
    Nishio, S
    Tang, SW
    Yang, DQ
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 441 - 448
  • [45] Compact in-memory representation of large graph databases for efficient mining of maximal frequent sub graphs
    Lakshmi, K.
    Meyyappan, T.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (03):
  • [46] Incremental mining algorithms for generating and updating frequent patterns for dynamic databases against insert, update, and support changes
    Borra, Sivaiah
    Rao, R. Rajeswara
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [47] Mining large patterns with. profit-based support in e-commerce
    Jung, Jin-Guk
    Ghose, Supratip
    Jo, Geun-Sik
    AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 1054 - +
  • [48] Linear and sublinear time algorithms for mining frequent traversal path patterns from very large web logs
    Chen, ZX
    Fowler, RH
    Fu, AWC
    Wang, CY
    SEVENTH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2003, : 117 - 122
  • [49] Mining Top-k Frequent Patterns in Large Geosocial Networks: A Mnie-Based Extension Approach
    Zhou, Changben
    Xu, Jian
    Jiang, Ming
    Tang, Donghang
    Wang, Sheng
    IEEE ACCESS, 2023, 11 : 27662 - 27675
  • [50] Visual mining of moving flock patterns in large spatio-temporal data sets using a frequent pattern approach
    Turdukulov, Ulanbek
    Romero, Andres Oswaldo Calderon
    Huisman, Otto
    Retsios, Vasilios
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2014, 28 (10) : 2013 - 2029