RTED: A Robust Algorithm for the Tree Edit Distance

被引:113
|
作者
Pawlik, Mateusz [1 ]
Augsten, Nikolaus [1 ]
机构
[1] Free Univ Bozen Bolzano, Bolzano, Italy
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2011年 / 5卷 / 04期
关键词
D O I
10.14778/2095686.2095692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the classical tree edit distance between ordered labeled trees, which is defined as the minimum-cost sequence of node edit operations that transform one tree into another. The state-of-the-art solutions for the tree edit distance are not satisfactory. The main competitors in the field either have optimal worst-case complexity, but the worst case happens frequently, or they are very efficient for some tree shapes, but degenerate for others. This leads to unpredictable and often infeasible runtimes. There is no obvious way to choose between the algorithms. In this paper we present RTED, a robust tree edit distance algorithm. The asymptotic complexity of RTED is smaller or equal to the complexity of the best competitors for any input instance, i.e., RTED is both efficient and worst -case optimal. We introduce the class of LRH (Left -Right -Heavy) algorithms, which includes RTED and the fastest tree edit distance algorithms presented in literature. We prove that RTED outperforms all previously proposed LRH algorithms in terms of runtime complexity. In our experiments on synthetic and real world data we empirically evaluate our solution and compare it to the state-of-the-art.
引用
下载
收藏
页码:334 / 345
页数:12
相关论文
共 50 条
  • [21] An improved algorithm for tree edit distance with applications for RNA secondary structure comparison
    Chen, Shihyen
    Zhang, Kaizhong
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2014, 27 (04) : 778 - 797
  • [22] An improved algorithm for tree edit distance with applications for RNA secondary structure comparison
    Shihyen Chen
    Kaizhong Zhang
    Journal of Combinatorial Optimization, 2014, 27 : 778 - 797
  • [23] Decomposition algorithms for the tree edit distance problem
    Dulucq, Serge
    Touzet, Helene
    JOURNAL OF DISCRETE ALGORITHMS, 2005, 3 (2-4) : 448 - 471
  • [24] Graph Edit Distance Compacted Search Tree
    Chegrane, Ibrahim
    Hocine, Imane
    Yahiaoui, Said
    Bendjoudi, Ahcene
    Nouali-Taboudjemat, Nadia
    SIMILARITY SEARCH AND APPLICATIONS (SISAP 2022), 2022, 13590 : 181 - 189
  • [25] Analyzing edit distance on trees: Tree swap distance is intractable
    Department of Computing Science, Umeå University, 90187 Umeå, Sweden
    Proc. Prag. Str. Conf., (59-73):
  • [26] Tree edit distance from information theory
    Torsello, A
    Hancock, ER
    GRAPH BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS, 2003, 2726 : 71 - 82
  • [27] Tree Edit Distance and Maximum Agreement Subtree
    Shin, Kilho
    INFORMATION PROCESSING LETTERS, 2015, 115 (01) : 69 - 73
  • [28] A survey on tree edit distance and related problems
    Bille, P
    THEORETICAL COMPUTER SCIENCE, 2005, 337 (1-3) : 217 - 239
  • [29] Learning probabilistic models of tree edit distance
    Bernard, Marc
    Boyer, Laurent
    Habrard, Amaury
    Sebban, Marc
    PATTERN RECOGNITION, 2008, 41 (08) : 2611 - 2629
  • [30] Analysis of tree edit distance on XML data
    Wu, Yu-Fang
    Lin, Shu-Fen
    Yen, Hsu-Chun
    PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INTERNET, AND INFORMATION TECHNOLOGY, 2007, : 5 - 10