A Partition-Based Approach to Structure Similarity Search

被引:35
|
作者
Zhao, Xiang [1 ]
Xiao, Chuan
Lin, Xuemin [1 ,3 ]
Liu, Qing [4 ]
Zhang, Wenjie [2 ]
机构
[1] Univ New South Wales, Sydney, NSW, Australia
[2] Nagoya Univ, Nagoya, Aichi, Japan
[3] China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai, Peoples R China
[4] CSIRO, Sydney, NSW, Australia
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2013年 / 7卷 / 03期
基金
澳大利亚研究理事会;
关键词
D O I
10.14778/2732232.2732236
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graphs are widely used to model complex data in many applications, such as bioinformatics, chemistry, social networks, pattern recognition, etc. A fundamental and critical query primitive is to efficiently search similar structures in a large collection of graphs. This paper studies the graph similarity queries with edit distance constraints. Existing solutions to the problem utilize fixed-size overlapping substructures to generate candidates, and thus become susceptible to large vertex degrees or large distance thresholds. In this paper, we present a partition-based approach to tackle the problem. By dividing data graphs into variable-size non-overlapping partitions, the edit distance constraint is converted to a graph containment constraint for candidate generation. We develop efficient query processing algorithms based on the new paradigm. A candidate pruning technique and an improved graph edit distance algorithm are also developed to further boost the performance. In addition, a cost-aware graph partitioning technique is devised to optimize the index. Extensive experiments demonstrate our approach significantly outperforms existing approaches.
引用
收藏
页码:169 / 180
页数:12
相关论文
共 50 条
  • [1] Efficient structure similarity searches: a partition-based approach
    Xiang Zhao
    Chuan Xiao
    Xuemin Lin
    Wenjie Zhang
    Yang Wang
    [J]. The VLDB Journal, 2018, 27 : 53 - 78
  • [2] Efficient structure similarity searches: a partition-based approach
    Zhao, Xiang
    Xiao, Chuan
    Lin, Xuemin
    Zhang, Wenjie
    Wang, Yang
    [J]. VLDB JOURNAL, 2018, 27 (01): : 53 - 78
  • [3] Load Balancing for Partition-based Similarity Search
    Tang, Xun
    Alabduljalil, Maha
    Jin, Xin
    Yang, Tao
    [J]. SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 193 - 202
  • [4] A Partition-Based Random Search Method for Multimodal Optimization
    Lin, Ziwei
    Matta, Andrea
    Du, Sichang
    Sahin, Evren
    [J]. MATHEMATICS, 2023, 11 (01)
  • [5] Pass-Join: A Partition-based Method for Similarity Joins
    Li, Guoliang
    Deng, Dong
    Wang, Jiannan
    Feng, Jianhua
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 5 (03): : 253 - 264
  • [6] FrepJoin: an efficient partition-based algorithm for edit similarity join
    Ji-zhou Luo
    Sheng-fei Shi
    Hong-zhi Wang
    Jian-zhong Li
    [J]. Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 1499 - 1510
  • [7] FrepJoin:an efficient partition-based algorithm for edit similarity join
    Ji-zhou LUO
    Sheng-fei SHI
    Hong-zhi WANG
    Jian-zhong LI
    [J]. Frontiers of Information Technology & Electronic Engineering, 2017, 18 (10) : 1499 - 1510
  • [8] FrepJoin: an efficient partition-based algorithm for edit similarity join
    Luo, Ji-zhou
    Shi, Sheng-fei
    Wang, Hong-zhi
    Li, Jian-zhong
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (10) : 1499 - 1510
  • [9] A practical partition-based approach for ontology version
    Wang, ZJ
    Zhang, SS
    Wang, YL
    Du, T
    [J]. CURRENT TRENDS IN HIGH PERFORMANCE COMPUTING AND ITS APPLICATIONS, PROCEEDINGS, 2005, : 495 - 499
  • [10] An Optimal Sample Allocation Strategy for Partition-Based Random Search
    Chen, Weiwei
    Gao, Siyang
    Chen, Chun-Hung
    Shi, Leyuan
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (01) : 177 - 186