RTED: A Robust Algorithm for the Tree Edit Distance

被引:113
|
作者
Pawlik, Mateusz [1 ]
Augsten, Nikolaus [1 ]
机构
[1] Free Univ Bozen Bolzano, Bolzano, Italy
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2011年 / 5卷 / 04期
关键词
D O I
10.14778/2095686.2095692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the classical tree edit distance between ordered labeled trees, which is defined as the minimum-cost sequence of node edit operations that transform one tree into another. The state-of-the-art solutions for the tree edit distance are not satisfactory. The main competitors in the field either have optimal worst-case complexity, but the worst case happens frequently, or they are very efficient for some tree shapes, but degenerate for others. This leads to unpredictable and often infeasible runtimes. There is no obvious way to choose between the algorithms. In this paper we present RTED, a robust tree edit distance algorithm. The asymptotic complexity of RTED is smaller or equal to the complexity of the best competitors for any input instance, i.e., RTED is both efficient and worst -case optimal. We introduce the class of LRH (Left -Right -Heavy) algorithms, which includes RTED and the fastest tree edit distance algorithms presented in literature. We prove that RTED outperforms all previously proposed LRH algorithms in terms of runtime complexity. In our experiments on synthetic and real world data we empirically evaluate our solution and compare it to the state-of-the-art.
引用
收藏
页码:334 / 345
页数:12
相关论文
共 50 条
  • [1] An Optimal Decomposition Algorithm for Tree Edit Distance
    Demaine, Erik D.
    Mozes, Shay
    Rossman, Benjamin
    Weimann, Oren
    [J]. ACM TRANSACTIONS ON ALGORITHMS, 2009, 6 (01)
  • [2] An optimal decomposition algorithm for tree edit distance
    Demaine, Erik D.
    Mozes, Shay
    Rossman, Benjamin
    Weimann, Oren
    [J]. AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2007, 4596 : 146 - +
  • [3] Tree edit distance: Robust and memory-efficient
    Pawlik, Mateusz
    Augsten, Nikolaus
    [J]. INFORMATION SYSTEMS, 2016, 56 : 157 - 173
  • [4] An A*-algorithm for the Unordered Tree Edit Distance with Custom Costs
    Paassen, Benjamin
    [J]. SIMILARITY SEARCH AND APPLICATIONS, SISAP 2021, 2021, 13058 : 364 - 371
  • [5] Approximating tree edit distance through string edit distance
    Akutsu, Tatsuya
    Fukagawa, Daiji
    Takasu, Atsuhiro
    [J]. ALGORITHMS AND COMPUTATION, PROCEEDINGS, 2006, 4288 : 90 - +
  • [6] Approximating Tree Edit Distance through String Edit Distance
    Akutsu, Tatsuya
    Fukagawa, Daiji
    Takasu, Atsuhiro
    [J]. ALGORITHMICA, 2010, 57 (02) : 325 - 348
  • [7] Approximating Tree Edit Distance through String Edit Distance
    Tatsuya Akutsu
    Daiji Fukagawa
    Atsuhiro Takasu
    [J]. Algorithmica, 2010, 57 : 325 - 348
  • [8] A linear tree edit distance algorithm for similar ordered trees
    Touzet, H
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2005, 3537 : 334 - 345
  • [9] A Dynamic Programming A* Algorithm for Computing Unordered Tree Edit Distance
    Yoshino, Takuya
    Higuchi, Shoichi
    Hirata, Kouichi
    [J]. 2013 SECOND IIAI INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2013), 2013, : 135 - 140
  • [10] An improved algorithm for tree edit distance incorporating structural linearity
    Chen, Shihyen
    Zhang, Kaizhong
    [J]. COMPUTING AND COMBINATORICS, PROCEEDINGS, 2007, 4598 : 482 - +