On distances between phylogenetic trees

被引:0
|
作者
He, X
Jiang, T
Li, M
Tromp, J
机构
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Different phylogenetic trees for the same group of species are often produced either by procedures that use diverse optimality criteria [18] or from different genes [12] in the study of molecular evolution. Comparing these trees to find their similarities(e.g. agreement or consensus) and dissimilarities, i.e. distance, is thus an important issue in computational molecular biology. The nearest neighbor interchange (nni) distance [25, 24, 32, 4, 5, 3, 16, 17, 19, 29, 20, 21, 23] and the subtree-transfer distance [12, 13, 15] are two major distance metrics that have been proposed and extensively studied for different reasons. Despite their many appealing aspects such as simplicity and sensitivity to tree topologies, computing these distances has remained very challenging. This article studies the complexity and efficient approximation algorithms for computing the nni distance and a natural extension of the subtree-transfer distance, called the linear-cost subtree-transfer distance. The linear-cost subtree-transfer model is more logical than the (unit-cost) subtree-transfer model and in fact coincides with the nni model under certain conditions. The following results have been obtained as part of our project of building a comprehensive software package for computing distances between phylogenies. 1. Computing the nni distance is NP-complete. This solves a 25 year old open question appearing again and again in, for example, [25, 32, 4, 5, 3, 16, 17, 19, 20, 21, 23] under the complexity-theoretic assumption of P not equal NP. We also answer an open question [4] regarding the nni distance between unlabeled trees for which an erroneous proof appeared in [19]. We give an algorithm to compute the optimal nni sequence in time O(n(2) log n+n . 2(O(d))), where the nni distance is at most d. The algorithm allows us to implement practical programs when d is small. All above results also hold for linear-cost subtree-transfer. 2. Biological applications require us to extend the nni and linear-cost subtree-transfer models to weighted phylogenies, where edge weights indicate the length of evolution along each edge. We present a logarithmic ratio approximation algorithm for nni and a ratio 2 approximation algorithm for linear-cost subtree-transfer, on weighted trees.
引用
收藏
页码:427 / 436
页数:10
相关论文
共 50 条
  • [1] Distances Between Phylogenetic Trees: A Survey
    Shi, Feng
    Feng, Qilong
    Chen, Jianer
    Wang, Lusheng
    Wang, Jianxin
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2013, 18 (05) : 490 - 499
  • [2] Distances Between Phylogenetic Trees: A Survey
    Feng Shi
    Qilong Feng
    Jianer Chen
    Lusheng Wang
    Jianxin Wang
    [J]. Tsinghua Science and Technology, 2013, 18 (05) : 490 - 499
  • [3] On the Distribution of the Distances Between Pairs of Leaves in Phylogenetic Trees
    Mir, Arnau
    Rossello, Francesc
    [J]. BIOTECHNO 2011: THE THIRD INTERNATIONAL CONFERENCE ON BIOINFORMATICS, BIOCOMPUTATIONAL SYSTEMS AND BIOTECHNOLOGIES, 2011, : 100 - 103
  • [4] Nodal distances for rooted phylogenetic trees
    Cardona, Gabriel
    Llabres, Merce
    Rossello, Francesc
    Valiente, Gabriel
    [J]. JOURNAL OF MATHEMATICAL BIOLOGY, 2010, 61 (02) : 253 - 276
  • [5] Nodal distances for rooted phylogenetic trees
    Gabriel Cardona
    Mercè Llabrés
    Francesc Rosselló
    Gabriel Valiente
    [J]. Journal of Mathematical Biology, 2010, 61 : 253 - 276
  • [6] Computing nearest neighbour interchange distances between ranked phylogenetic trees
    Collienne, Lena
    Gavryushkin, Alex
    [J]. JOURNAL OF MATHEMATICAL BIOLOGY, 2021, 82 (1-2)
  • [7] Computing nearest neighbour interchange distances between ranked phylogenetic trees
    Lena Collienne
    Alex Gavryushkin
    [J]. Journal of Mathematical Biology, 2021, 82
  • [8] DISTANCES BETWEEN TREES
    MARGUSH, T
    [J]. DISCRETE APPLIED MATHEMATICS, 1982, 4 (04) : 281 - 290
  • [9] Analysis on Algorithms for Constructing Phylogenetic Trees From Distances
    Wang, Juan
    [J]. IEEE ACCESS, 2019, 7 : 129430 - 129436
  • [10] Probabilistic Distances Between Trees
    Garba, Maryam K.
    Nye, Tom M. W.
    Boys, Richard J.
    [J]. SYSTEMATIC BIOLOGY, 2018, 67 (02) : 320 - 327