Mining patterns in graphs with multiple weights

被引:0
|
作者
Giulia Preti
Matteo Lissandrini
Davide Mottin
Yannis Velegrakis
机构
[1] University of Trento,
[2] Aalborg University,undefined
[3] Aarhus University,undefined
来源
关键词
Multi-weighted graphs; Graph mining; Weighted pattern mining; Personalized patterns;
D O I
暂无
中图分类号
学科分类号
摘要
Graph pattern mining aims at identifying structures that appear frequently in large graphs, under the assumption that frequency signifies importance. In real life, there are many graphs with weights on nodes and/or edges. For these graphs, it is fair that the importance (score) of a pattern is determined not only by the number of its appearances, but also by the weights on the nodes/edges of those appearances. Scoring functions based on the weights do not generally satisfy the apriori property, which guarantees that the number of appearances of a pattern cannot be larger than the frequency of any of its sub-patterns, and hence allows faster pruning. Therefore, existing approaches employ other, less efficient, pruning strategies. The problem becomes even more challenging in the case of multiple weighting functions that assign different weights to the same nodes/edges. In this work we propose a new family of scoring functions that respects the apriori property, and thus can rely on effective pruning strategies. We provide efficient and effective techniques for mining patterns in multi-weighted graphs, and we devise both an exact and an approximate solution. In addition, we propose a distributed version of our approach, which distributes the appearances of the patterns to examine among multiple workers. Extensive experiments on both real and synthetic datasets prove that the presence of edge weights and the choice of scoring function affect the patterns mined, and the quality of the results returned to the user. Moreover, we show that, even when the performance of the exact algorithm degrades because of an increasing number of weighting functions, the approximate algorithm performs well and with fairly good quality. Finally, the distributed algorithm proves to be the best choice for mining large and rich input graphs.
引用
收藏
页码:281 / 319
页数:38
相关论文
共 50 条
  • [41] Collaborative mining of graph patterns from multiple sources
    Levchuk, Georgiy
    Colonna-Romano, John
    SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXV, 2016, 9842
  • [42] Mining relational patterns from multiple relational tables
    Tsechansky, Maytal Saar
    Pliskin, Nava
    Rabinowitz, Gadi
    Porath, Avi
    Decision Support Systems, 1999, 27 (01): : 177 - 195
  • [43] A parallel algorithm for mining multiple partial periodic patterns
    Lee, Guanling
    Yang, Wenpo
    Lee, Jia-Min
    INFORMATION SCIENCES, 2006, 176 (24) : 3591 - 3609
  • [44] Mining relational patterns from multiple relational tables
    Tsechansky, MS
    Pliskin, N
    Rabinowitz, G
    Porath, A
    DECISION SUPPORT SYSTEMS, 1999, 27 (1-2) : 177 - 195
  • [45] Role Mining Based on Weights
    Ma, Xiaopu
    Li, Ruixuan
    Lu, Zhengding
    SACMAT 2010: PROCEEDINGS OF THE 15TH ACM SYMPOSIUM ON ACCESS CONTROL MODELS AND TECHNOLOGIES, 2010, : 65 - 74
  • [46] Interplay between topology and edge weights in real-world graphs: concepts, patterns, and an algorithm
    Fanchen Bu
    Shinhwan Kang
    Kijung Shin
    Data Mining and Knowledge Discovery, 2023, 37 : 2139 - 2191
  • [47] Interplay between topology and edge weights in real-world graphs: concepts, patterns, and an algorithm
    Bu, Fanchen
    Kang, Shinhwan
    Shin, Kijung
    DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 37 (06) : 2139 - 2191
  • [48] Case Studies and Evaluation of Green Mining considering Uncertainty Factors and Multiple Indicator Weights
    Wang, J.
    Hu, B.
    Chang, J.
    Wang, W. P.
    Li, H. L.
    GEOFLUIDS, 2020, 2020
  • [49] TIPTAP: Approximate Mining of Frequent k-Subgraph Patterns in Evolving Graphs
    Nasir, Muhammad Anis Uddin
    Aslay, Cigdem
    Morales, Gianmarco De Francisci
    Riondato, Matteo
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (03)
  • [50] Flexible and Feasible Support Measures for Mining Frequent Patterns in Large Labeled Graphs
    Meng, Jinghan
    Tu, Yi-Cheng
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 391 - 402