On distances between phylogenetic trees

被引:0
|
作者
He, X
Jiang, T
Li, M
Tromp, J
机构
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Different phylogenetic trees for the same group of species are often produced either by procedures that use diverse optimality criteria [18] or from different genes [12] in the study of molecular evolution. Comparing these trees to find their similarities(e.g. agreement or consensus) and dissimilarities, i.e. distance, is thus an important issue in computational molecular biology. The nearest neighbor interchange (nni) distance [25, 24, 32, 4, 5, 3, 16, 17, 19, 29, 20, 21, 23] and the subtree-transfer distance [12, 13, 15] are two major distance metrics that have been proposed and extensively studied for different reasons. Despite their many appealing aspects such as simplicity and sensitivity to tree topologies, computing these distances has remained very challenging. This article studies the complexity and efficient approximation algorithms for computing the nni distance and a natural extension of the subtree-transfer distance, called the linear-cost subtree-transfer distance. The linear-cost subtree-transfer model is more logical than the (unit-cost) subtree-transfer model and in fact coincides with the nni model under certain conditions. The following results have been obtained as part of our project of building a comprehensive software package for computing distances between phylogenies. 1. Computing the nni distance is NP-complete. This solves a 25 year old open question appearing again and again in, for example, [25, 32, 4, 5, 3, 16, 17, 19, 20, 21, 23] under the complexity-theoretic assumption of P not equal NP. We also answer an open question [4] regarding the nni distance between unlabeled trees for which an erroneous proof appeared in [19]. We give an algorithm to compute the optimal nni sequence in time O(n(2) log n+n . 2(O(d))), where the nni distance is at most d. The algorithm allows us to implement practical programs when d is small. All above results also hold for linear-cost subtree-transfer. 2. Biological applications require us to extend the nni and linear-cost subtree-transfer models to weighted phylogenies, where edge weights indicate the length of evolution along each edge. We present a logarithmic ratio approximation algorithm for nni and a ratio 2 approximation algorithm for linear-cost subtree-transfer, on weighted trees.
引用
收藏
页码:427 / 436
页数:10
相关论文
共 50 条
  • [21] On the distribution of distances between specified nodes in increasing trees
    Kuba, Markus
    Panholzer, Alois
    DISCRETE APPLIED MATHEMATICS, 2010, 158 (05) : 489 - 506
  • [22] ON A MATCHING DISTANCE BETWEEN ROOTED PHYLOGENETIC TREES
    Bogdanowicz, Damian
    Giaro, Krzysztof
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2013, 23 (03) : 669 - 684
  • [23] Genetic distances and phylogenetic trees of different Awassi sheep populations based on DNA sequencing
    Al-Atiyat, R. M.
    Aljumaah, R. S.
    GENETICS AND MOLECULAR RESEARCH, 2014, 13 (03): : 6557 - 6568
  • [24] On the Maximum Parsimony Distance Between Phylogenetic Trees
    Mareike Fischer
    Steven Kelk
    Annals of Combinatorics, 2016, 20 : 87 - 113
  • [25] ON THE MAXIMUM QUARTET DISTANCE BETWEEN PHYLOGENETIC TREES
    Alon, Noga
    Naves, Humberto
    Sudakov, Benny
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2016, 30 (02) : 718 - 735
  • [26] On the Maximum Parsimony Distance Between Phylogenetic Trees
    Fischer, Mareike
    Kelk, Steven
    ANNALS OF COMBINATORICS, 2016, 20 (01) : 87 - 113
  • [27] RELATIONSHIP BETWEEN MINIMUM SPANNING-TREES AND ADDITIVE PHYLOGENETIC TREES
    ZONTASGARAMELLA, L
    ATTI ASSOCIAZIONE GENETICA ITALIANA, 1981, 27 : 414 - 415
  • [28] THE DISTANCES BETWEEN UNROOTED AND CYCLICALLY ORDERED TREES AND THEIR COMPUTING METHODS
    LIU, SM
    TANAKA, E
    MASUDA, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1994, E77D (10) : 1094 - 1105
  • [29] Majorization and distances in trees
    Dahl, Geir
    NETWORKS, 2007, 50 (04) : 251 - 257
  • [30] An exact algorithm for the geodesic distance between phylogenetic trees
    Kupczok, Anne
    Von Haeseler, Arndt
    Klaere, Steffen
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2008, 15 (06) : 577 - 591