Efficient processing of graph similarity queries with edit distance constraints

被引:39
|
作者
Zhao, Xiang [1 ,2 ]
Xiao, Chuan [3 ]
Lin, Xuemin [1 ]
Wang, Wei [1 ]
Ishikawa, Yoshiharu [3 ]
机构
[1] Univ New S Wales, Sydney, NSW, Australia
[2] NICTA, Sydney, NSW, Australia
[3] Nagoya Univ, Nagoya, Aichi 4648601, Japan
来源
VLDB JOURNAL | 2013年 / 22卷 / 06期
关键词
Graph similarity query; Edit distance; q-Gram; JOINS;
D O I
10.1007/s00778-013-0306-1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graphs are widely used to model complicated data semantics in many applications in bioinformatics, chemistry, social networks, pattern recognition, etc. A recent trend is to tolerate noise arising from various sources such as erroneous data entries and find similarity matches. In this paper, we study graph similarity queries with edit distance constraints. Inspired by the -gram idea for string similarity problems, our solution extracts paths from graphs as features for indexing. We establish a lower bound of common features to generate candidates. Efficient algorithms are proposed to handle three types of graph similarity queries by exploiting both matching and mismatching features as well as degree information to improve the filtering and verification on candidates. We demonstrate the proposed algorithms significantly outperform existing approaches with extensive experiments on real and synthetic datasets.
引用
收藏
页码:727 / 752
页数:26
相关论文
共 50 条
  • [1] Efficient processing of graph similarity queries with edit distance constraints
    Xiang Zhao
    Chuan Xiao
    Xuemin Lin
    Wei Wang
    Yoshiharu Ishikawa
    [J]. The VLDB Journal, 2013, 22 : 727 - 752
  • [2] Efficient Graph Similarity Joins with Edit Distance Constraints
    Zhao, Xiang
    Xiao, Chuan
    Lin, Xuemin
    Wang, Wei
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 834 - 845
  • [3] Graph Similarity Using Tree Edit Distance
    Dwivedi, Shri Prakash
    Srivastava, Vishal
    Gupta, Umesh
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2022, 2022, 13813 : 233 - 241
  • [4] Ed-Join: An Efficient Algorithm for Similarity Joins With Edit Distance Constraints
    Xiao, Chuan
    Wang, Wei
    Lin, Xuemin
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 933 - 944
  • [5] Bounded Occurrence Edit Distance: A New Metric for String Similarity Joins with Edit Distance Constraints
    Komatsu, Tomoki
    Okuta, Ryosuke
    Narisawa, Kazuyuki
    Shinohara, Ayumi
    [J]. SOFSEM 2014: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2014, 8327 : 363 - 374
  • [6] Graph Similarity Search with Edit Distance Constraint in Large Graph Databases
    Zheng, Weiguo
    Zou, Lei
    Lian, Xiang
    Wang, Dong
    Zhao, Dongyan
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1595 - 1600
  • [7] Processing distance join queries with constraints
    Papadopoulos, AN
    Nanopoulos, A
    Manolopoulos, Y
    [J]. COMPUTER JOURNAL, 2006, 49 (03): : 281 - 296
  • [8] Efficient Parallel Computing of Graph Edit Distance
    Wang, Ran
    Fang, Yixiang
    Feng, Xing
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 233 - 240
  • [9] An efficient algorithm for graph edit distance computation
    Chen, Xiaoyang
    Huo, Hongwei
    Huan, Jun
    Vitter, Jeffrey Scott
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 762 - 775
  • [10] Graph Edit Distance or Graph Edit Pseudo-Distance?
    Serratosa, Francesc
    Cortes, Xavier
    Moreno, Carlos-Francisco
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 530 - 540