On distances between phylogenetic trees

被引:0
|
作者
He, X
Jiang, T
Li, M
Tromp, J
机构
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Different phylogenetic trees for the same group of species are often produced either by procedures that use diverse optimality criteria [18] or from different genes [12] in the study of molecular evolution. Comparing these trees to find their similarities(e.g. agreement or consensus) and dissimilarities, i.e. distance, is thus an important issue in computational molecular biology. The nearest neighbor interchange (nni) distance [25, 24, 32, 4, 5, 3, 16, 17, 19, 29, 20, 21, 23] and the subtree-transfer distance [12, 13, 15] are two major distance metrics that have been proposed and extensively studied for different reasons. Despite their many appealing aspects such as simplicity and sensitivity to tree topologies, computing these distances has remained very challenging. This article studies the complexity and efficient approximation algorithms for computing the nni distance and a natural extension of the subtree-transfer distance, called the linear-cost subtree-transfer distance. The linear-cost subtree-transfer model is more logical than the (unit-cost) subtree-transfer model and in fact coincides with the nni model under certain conditions. The following results have been obtained as part of our project of building a comprehensive software package for computing distances between phylogenies. 1. Computing the nni distance is NP-complete. This solves a 25 year old open question appearing again and again in, for example, [25, 32, 4, 5, 3, 16, 17, 19, 20, 21, 23] under the complexity-theoretic assumption of P not equal NP. We also answer an open question [4] regarding the nni distance between unlabeled trees for which an erroneous proof appeared in [19]. We give an algorithm to compute the optimal nni sequence in time O(n(2) log n+n . 2(O(d))), where the nni distance is at most d. The algorithm allows us to implement practical programs when d is small. All above results also hold for linear-cost subtree-transfer. 2. Biological applications require us to extend the nni and linear-cost subtree-transfer models to weighted phylogenies, where edge weights indicate the length of evolution along each edge. We present a logarithmic ratio approximation algorithm for nni and a ratio 2 approximation algorithm for linear-cost subtree-transfer, on weighted trees.
引用
收藏
页码:427 / 436
页数:10
相关论文
共 50 条
  • [31] Comparing the rates of speciation and extinction between phylogenetic trees
    Revell, Liam J.
    ECOLOGY AND EVOLUTION, 2018, 8 (11): : 5303 - 5312
  • [32] Phylogenetic congruence between subtropical trees and their associated fungi
    Liu, Xubing
    Liang, Minxia
    Etienne, Rampal S.
    Gilbert, Gregory S.
    Yu, Shixiao
    ECOLOGY AND EVOLUTION, 2016, 6 (23): : 8412 - 8422
  • [33] Distribution of distances between topologies and its effect on detection of phylogenetic recombination
    Leonardo de Oliveira Martins
    Hirohisa Kishino
    Annals of the Institute of Statistical Mathematics, 2010, 62 : 145 - 159
  • [34] Distribution of distances between topologies and its effect on detection of phylogenetic recombination
    de Oliveira Martins, Leonardo
    Kishino, Hirohisa
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2010, 62 (01) : 145 - 159
  • [35] Probabilistic Distances Between Trees (vol 67, pg 320, 2017)
    Garba, M. K.
    Nye, T. M. W.
    Boys, R. J.
    SYSTEMATIC BIOLOGY, 2018, 67 (02) : 366 - 366
  • [36] Incongruence between gene trees and species trees and phylogenetic signal variation in plastid genes
    Goncalves, Deise J. P.
    Simpson, Beryl B.
    Ortiz, Edgardo M.
    Shimizu, Gustavo H.
    Jansen, Robert K.
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 2019, 138 : 219 - 232
  • [37] Exploring the relationship between sequence similarity and accurate phylogenetic trees
    Cantarel, Brandi L.
    Morrison, Hilary G.
    Pearson, William
    MOLECULAR BIOLOGY AND EVOLUTION, 2006, 23 (11) : 2090 - 2100
  • [38] Phylogenetic Detection of Recombination with a Bayesian Prior on the Distance between Trees
    Martins, Leonardo de Oliveira
    Leal, Elcio
    Kishino, Hirohisa
    PLOS ONE, 2008, 3 (07):
  • [39] ANALYSIS OF QUARTET DISSIMILARITY MEASURES BETWEEN UNDIRECTED PHYLOGENETIC TREES
    DAY, WHE
    SYSTEMATIC ZOOLOGY, 1986, 35 (03): : 325 - 333
  • [40] Average distances on substitution trees
    Xi, Lifeng
    Ye, Qianqian
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 529