MANIACS: Approximate Mining of Frequent Subgraph Patterns through Sampling

被引:5
|
作者
Preti, Giulia [1 ]
Morales, Gianmarco De Francisci [1 ]
Riondato, Matteo [2 ]
机构
[1] CENTAI, Corso Inghilterra 3, I-10138 Turin, Italy
[2] Amherst Coll, Dept Comp Sci, Box 2232, Amherst, MA 01002 USA
基金
美国国家科学基金会;
关键词
Minimum Node Image; pattern mining; VC-dimension; GRAPHLETS;
D O I
10.1145/3587254
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present MANIACS, a sampling-based randomized algorithm for computing high-quality approximations of the collection of the subgraph patterns that are frequent in a single, large, vertex-labeled graph, according to the Minimum Node Image-based (MNI) frequency measure. The output of MANIACS comes with strong probabilistic guarantees, obtained by using the empirical Vapnik-Chervonenkis (VC) dimension, a key concept from statistical learning theory, together with strong probabilistic tail bounds on the difference between the frequency of a pattern in the sample and its exact frequency. MANIACS leverages properties of the MNI-frequency to aggressively prune the pattern search space, and thus to reduce the time spent in exploring subspaces that contain no frequent patterns. In turn, this pruning leads to better bounds to the maximum frequency estimation error, which leads to increased pruning, resulting in a beneficial feedback effect. The results of our experimental evaluation of MANIACS on real graphs show that it returns high-quality collections of frequent patterns in large graphs up to two orders of magnitude faster than the exact algorithm.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] MANIACS: Approximate Mining of Frequent Subgraph Patterns through Sampling
    Preti, Giulia
    Morales, Gianmarco De Francisci
    Riondato, Matteo
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1348 - 1358
  • [2] TIPTAP: Approximate Mining of Frequent k-Subgraph Patterns in Evolving Graphs
    Nasir, Muhammad Anis Uddin
    Aslay, Cigdem
    Morales, Gianmarco De Francisci
    Riondato, Matteo
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (03)
  • [3] Image Clustering Based on Frequent Approximate Subgraph Mining
    Acosta-Mendoza, Niusvel
    Ariel Carrasco-Ochoa, Jesus
    Martinez-Trinidad, Jose Fco.
    Gago-Alonso, Andres
    Medina-Pagola, Jose E.
    PATTERN RECOGNITION, 2018, 10880 : 189 - 198
  • [4] Large Graph Sampling Algorithm for Frequent Subgraph Mining
    Zheng, Tianyu
    Li Wang
    IEEE ACCESS, 2021, 9 : 88970 - 88980
  • [5] Structural-Semantic Approach for Approximate Frequent Subgraph Mining
    Moussaoui, Mohamed
    Zaghdoud, Montaceur
    Akaichi, Jalel
    2015 IEEE/ACS 12TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2015,
  • [6] Approximate mining of frequent patterns on streams
    Silvestri, Claudio
    Orlando, Salvatore
    INTELLIGENT DATA ANALYSIS, 2007, 11 (01) : 49 - 73
  • [7] Multi-graph Frequent Approximate Subgraph Mining for Image Clustering
    Acosta-Mendoza, Niusvel
    Ariel Carrasco-Ochoa, Jesus
    Gago-Alonso, Andres
    Francisco Martinez-Trinidad, Jose
    Eladio Medina-Pagola, Jose
    PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, IWAIPR 2018, 2018, 11047 : 133 - 140
  • [8] Mining frequent approximate patterns in large networks
    Driss, Kaouthar
    Boulila, Wadii
    Leborgne, Aurelie
    Gancarski, Pierre
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2021, 31 (03) : 1265 - 1279
  • [9] Mining Frequent Subgraph Patterns from Uncertain Graph Data
    Zou, Zhaonian
    Li, Jianzhong
    Gao, Hong
    Zhang, Shuo
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) : 1203 - 1218
  • [10] Generalization for frequent subgraph mining
    Inokuchi, Akihiro
    Washio, Takashi
    Motoda, Hiroshi
    Transactions of the Japanese Society for Artificial Intelligence, 2004, 19 (05) : 368 - 378