SuMoTED: An intuitive edit distance between rooted unordered uniquely-labelled trees

被引:6
|
作者
McVicar, Matt [1 ]
Sach, Benjamin [2 ]
Mesnage, Cedric [1 ]
Lijffijt, Jefrey [1 ,3 ]
Spyropoulou, Eirini [1 ]
De Bie, Tijl [1 ,3 ]
机构
[1] Univ Bristol, Dept Engn Math, Woodland Rd, Bristol BS8 1UB, England
[2] Univ Bristol, Dept Comp Sci, Woodland Rd, Bristol BS8 1UB, England
[3] Univ Ghent, Data Sci Lab, B-9000 Ghent, Belgium
基金
欧洲研究理事会; 英国工程与自然科学研究理事会;
关键词
Tree edit distance; Taxonomies; ALGORITHMS;
D O I
10.1016/j.patrec.2016.04.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Defining and computing distances between tree structures is a classical area of study in theoretical computer science, with practical applications in the areas of computational biology, information retrieval, text analysis, and many others. In this paper, we focus on rooted, unordered, uniquely-labelled trees such as taxonomies and other hierarchies. For trees as these, we introduce the intuitive concept of a 'local move' operation as an atomic edit of a tree. We then introduce SuMoTED, a new edit distance measure between such trees, defined as the minimal number of local moves required to convert one tree into another. We show how SuMoTED can be computed using a scalable algorithm with quadratic time complexity. Finally, we demonstrate its use on a collection of music genre taxonomies. (C) 2016TheAuthors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:52 / 59
页数:8
相关论文
共 31 条
  • [1] Designing an A* algorithm for calculating edit distance between rooted-unordered trees
    Horesh, Yair
    Mehr, Ramit
    Unger, Ron
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (06) : 1165 - 1176
  • [2] A constrained edit distance between unordered labeled trees
    Zhang, KZ
    ALGORITHMICA, 1996, 15 (03) : 205 - 222
  • [3] Anchored Alignment Distance between Rooted Labeled Unordered Trees
    Yoshino, Takuya
    Ishizaka, Yuma
    Hirata, Kouichi
    PROCEEDINGS OF THE 2017 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2017, : 433 - 440
  • [4] Exact algorithms for computing the tree edit distance between unordered trees
    Akutsu, Tatsuya
    Fukagawa, Daiji
    Takasu, Atsuhiro
    Tamura, Takeyuki
    THEORETICAL COMPUTER SCIENCE, 2011, 412 (4-5) : 352 - 364
  • [5] Distance between rooted and unordered trees based on vertex and edge mappings
    Liu, SM
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2004, E87A (05) : 1034 - 1041
  • [6] Approximation and parameterized algorithms for common subtrees and edit distance between unordered trees
    Akutsu, Tatsuya
    Fukagawa, Daiji
    Halldorsson, Magnus M.
    Takasu, Atsuhiro
    Tanaka, Keisuke
    THEORETICAL COMPUTER SCIENCE, 2013, 470 : 10 - 22
  • [7] Efficient exponential-time algorithms for edit distance between unordered trees
    Akutsu, Tatsuya
    Tamura, Takeyuki
    Fukagawa, Daiji
    Takasu, Atsuhiro
    JOURNAL OF DISCRETE ALGORITHMS, 2014, 25 : 79 - 93
  • [8] Fast Computation of the Tree Edit Distance between Unordered Trees Using IP Solvers
    Kondo, Seiichi
    Otaki, Keisuke
    Ikeda, Madori
    Yamamoto, Akihiro
    DISCOVERY SCIENCE, DS 2014, 2014, 8777 : 156 - 167
  • [9] Constant Factor Approximation of Edit Distance of Bounded Height Unordered Trees
    Fukagawa, Daiji
    Akutsu, Tatsuya
    Takasu, Atsuhiro
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5721 : 7 - +
  • [10] Improved MAX SNP-Hard Results for Finding an Edit Distance between Unordered Trees
    Hirata, Kouichi
    Yamamoto, Yoshiyuki
    Kuboyama, Tetsuji
    COMBINATORIAL PATTERN MATCHING, 22ND ANNUAL SYMPOSIUM, CPM 2011, 2011, 6661 : 402 - 415