HYBRIDIZATION NUMBER ON THREE ROOTED BINARY TREES IS EPT

被引:12
|
作者
Van Iersel, Leo [1 ]
Kelk, Steven [2 ]
Lekic, Nela [2 ]
Whidden, Chris [3 ,4 ]
Zeh, Norbert [5 ]
机构
[1] Delft Univ Technol, Delft Inst Appl Math, POB 5, NL-2600 AA Delft, Netherlands
[2] Maastricht Univ, DKE, POB 616, NL-6200 MD Maastricht, Netherlands
[3] Fred Hutchinson Canc Res Ctr, Seattle, WA 98109 USA
[4] Life Sci Res Fdn, Princeton, NJ USA
[5] Dalhousie Univ, Fac Comp Sci, 6050 Univ Ave, Halifax, NS B3H 1W5, Canada
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
hybridization number; rooted phylogenetic tree; rooted phylogenetic network; reticulate evolution; agreement forest; fixed parameter tractability; NETWORKS; ALGORITHMS; EVENTS;
D O I
10.1137/15M1036579
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Phylogenetic networks are leaf-labeled directed acyclic graphs that are used to describe nontreelike evolutionary histories and are thus a generalization of phylogenetic trees. The hybridization number of a phylogenetic network is the sum of all in-degrees minus the number of nodes plus one. The hybridization number problem takes as input a collection of rooted binary phylogenetic trees and asks to construct a phylogenetic network that contains an embedding of each of the input trees and has the smallest possible hybridization number. We present an algorithm for the hybridization number problem on three binary phylogenetic trees on n leaves that runs in time O(c(k)poly(n)) with k the hybridization number of an optimal network and c some (astronomical) constant. For the case of two trees, an algorithm with running time O(3.18(k)n) was proposed before, whereas an algorithm with running time O(c(k)poly(n)), also called an EPT algorithm, had prior to this article remained elusive for more than two trees. The algorithm for two trees uses the close connection to acyclic agreement forests to achieve a linear exponent in the running time, while previous algorithms for more than two trees (explicitly or implicitly) relied on a brute force search through all possible underlying network topologies, leading to running times that are not O(c(k)poly(n)) for any c. The connection to acyclic agreement forests is much weaker for more than two trees, so even given the right agreement forest, the reconstruction of the network poses major challenges. We prove novel structural results that allow us to reconstruct a network without having to guess the underlying topology. Our techniques generalize to more than three input trees with the exception of one key lemma that maps nodes in the network to tree nodes in order to minimize the amount of guessing involved in constructing the network. The main open problem therefore is to prove results that establish such a mapping for more than three trees.
引用
收藏
页码:1607 / 1631
页数:25
相关论文
共 50 条
  • [41] The distribution of the number of the infected individuals in a stochastic SIR model on regular rooted trees
    Jamshidi, Babak
    Alavi, Sayed Mohammad Reza
    Parham, Gholam Ali
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (05) : 1427 - 1444
  • [42] L-Functions of Carlitz modules, resultantal varieties and rooted binary trees, II
    Grishkov, A.
    Logachev, D.
    Zobnin, A.
    JOURNAL OF ALGEBRA AND ITS APPLICATIONS, 2023, 22 (06)
  • [43] Improved Practical Algorithms for Rooted Subtree Prune and Regraft (rSPR) Distance and Hybridization Number
    Yamada, Kohei
    Chen, Zhi-Zhong
    Wang, Lusheng
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (09) : 1422 - 1432
  • [44] On locally finite ordered rooted trees and their rooted subtrees
    Agnarsson, Geir
    Alhajjar, Elie
    Dawkins, Aleyah
    DISCRETE APPLIED MATHEMATICS, 2025, 360 : 1 - 21
  • [45] Improved approximation algorithm for maximum agreement forest of two rooted binary phylogenetic trees
    Shi, Feng
    Feng, Qilong
    You, Jie
    Wang, Jianxin
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2016, 32 (01) : 111 - 143
  • [46] L-functions of Carlitz modules, resultantal varieties and rooted binary trees - I
    Grishkov, A.
    Logachev, D.
    Zobnin, A.
    JOURNAL OF NUMBER THEORY, 2022, 238 : 269 - 312
  • [47] Fast Computation of the Exact Hybridization Number of Two Phylogenetic Trees
    Wu, Yufeng
    Wang, Jiayin
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PROCEEDINGS, 2010, 6053 : 203 - 214
  • [48] Improved approximation algorithm for maximum agreement forest of two rooted binary phylogenetic trees
    Feng Shi
    Qilong Feng
    Jie You
    Jianxin Wang
    Journal of Combinatorial Optimization, 2016, 32 : 111 - 143
  • [49] COMPUTER ENUMERATION AND GENERATION OF TREES AND ROOTED TREES
    KNOP, JV
    MULLER, WR
    JERICEVIC, Z
    TRINAJSTIC, N
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1981, 21 (02): : 91 - 99
  • [50] Three-dimensional embedding of binary trees
    Bein, WW
    Larmore, LL
    Shields, C
    Sudborough, IH
    I-SPAN 2000: INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES ALGORITHMS AND NETWORKS, PROCEEDINGS, 2000, : 140 - 147