Efficient structure similarity searches: a partition-based approach

被引:0
|
作者
Xiang Zhao
Chuan Xiao
Xuemin Lin
Wenjie Zhang
Yang Wang
机构
[1] National University of Defense Technology,
[2] Collaborative Innovation Center of Geospatial Technology,undefined
[3] Nagoya University,undefined
[4] The University of New South Wales,undefined
来源
The VLDB Journal | 2018年 / 27卷
关键词
Graph database; Similarity query; Graph edit distance; Top-; search;
D O I
暂无
中图分类号
学科分类号
摘要
Graphs are widely used to model complex data in many applications, such as bioinformatics, chemistry, social networks, pattern recognition. A fundamental and critical query primitive is to efficiently search similar structures in a large collection of graphs. This article mainly studies threshold-based graph similarity search with edit distance constraints. Existing solutions to the problem utilize fixed-size overlapping substructures to generate candidates, and thus become susceptible to large vertex degrees and distance thresholds. In this article, we present a partition-based approach to tackle the problem. By dividing data graphs into variable-size non-overlapping partitions, the edit distance constraint is converted to a graph containment constraint for candidate generation. We develop efficient query processing algorithms based on the novel paradigm. Moreover, candidate-pruning techniques and an improved graph edit distance verification algorithm are developed to boost the performance. In addition, a cost-aware graph partitioning method is devised to optimize the index. Extending the partition-based filtering paradigm, we present a solution to the top-k\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k$$\end{document} graph similarity search problem, where tailored filtering, look-ahead and computation-sharing strategies are exploited. Using both public real-life and synthetic datasets, extensive experiments demonstrate that our approaches significantly outperform the baseline and its alternatives.
引用
收藏
页码:53 / 78
页数:25
相关论文
共 50 条
  • [41] A partition-based global optimization algorithm
    Liuzzi, Giampaolo
    Lucidi, Stefano
    Piccialli, Veronica
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2010, 48 (01) : 113 - 128
  • [42] Equal Area Partition-Based Energy Efficient Routing Algorithm for Circular WSN
    Hu Liqin
    Wang Sanyou
    Ma Fujun
    Zhang Shubo
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [43] PARTITION-BASED BAYESIAN OPTIMIZATION FOR STOCHASTIC SIMULATIONS
    Wang, Songhao
    Ng, Szu Hui
    [J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 2832 - 2843
  • [44] Fast CU partition-based machine learning approach for reducing HEVC complexity
    Bouaafia, Soulef
    Khemiri, Randa
    Sayadi, Fatma Ezahra
    Atri, Mohamed
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (01) : 185 - 196
  • [45] K-maximin clustering: a maximin correlation approach to partition-based clustering
    Lee, Taehoon
    Kim, Seung Jean
    Chung, Eui-Young
    Yoon, Sungroh
    [J]. IEICE ELECTRONICS EXPRESS, 2009, 6 (17): : 1205 - 1211
  • [46] Fast CU partition-based machine learning approach for reducing HEVC complexity
    Soulef Bouaafia
    Randa Khemiri
    Fatma Ezahra Sayadi
    Mohamed Atri
    [J]. Journal of Real-Time Image Processing, 2020, 17 : 185 - 196
  • [47] ParRouting: An Efficient Area Partition-Based Congestion-Aware Routing Algorithm for NoCs
    Fang, Juan
    Zhang, Di
    Li, Xiaqing
    [J]. MICROMACHINES, 2020, 11 (12) : 1 - 17
  • [48] Partition-Based Faults Diagnosis of a VLIW Processor
    Sabena, Davide
    Reorda, Matteo Sonza
    Sterpone, Luca
    [J]. VLSI-SOC: AT THE CROSSROADS OF EMERGING TRENDS, 2015, 461 : 208 - 226
  • [49] Partition-based and sharp uniform error bounds
    Bax, E
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (06): : 1315 - 1320
  • [50] A partition-based efficient algorithm for large scale mul tiple-strings matching
    Liu, Ping
    Liu, Yan-Bing
    Tan, Jian-Long
    [J]. String Processing and Information Retrieval, Proceedings, 2005, 3772 : 399 - 404