Robinson-Foulds Reticulation Networks

被引:7
|
作者
Markin, Alexey [1 ]
Anderson, Tavis K. [2 ]
Vadali, Venkata Sai Krishna Teja [1 ]
Eulenstein, Oliver [1 ]
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
[2] USDA ARS, Natl Anim Dis Ctr, Washington, DC 20250 USA
基金
美国国家科学基金会;
关键词
INFLUENZA-A VIRUSES; HYBRIDIZATION NUMBER; UNITED-STATES; SWINE; GENE; EVOLUTION; TREES; TRANSMISSION; ALGORITHMS; INFERENCE;
D O I
10.1145/3307339.3342151
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Phylogenetic (hybridization) networks allow investigation of evolutionary species histories that involve complex phylogenetic events other than speciation, such as reassortment in virus evolution or introgressive hybridization in invertebrates and mammals. Reticulation networks can be inferred by solving the reticulation network problem, typically known as the hybridization network problem. Given a collection of phylogenetic input trees, this problem seeks a minimum reticulation network with the smallest number of reticulation vertices into which the input trees can be embedded exactly. Unfortunately, this problem is limited in practice, since minimum reticulation networks can be easily obfuscated by even small topological errors that typically occur in input trees inferred from biological data. We adapt the reticulation network problem to address erroneous input trees using the classic Robinson-Foulds distance. The RF embedding cost allows trees to be embedded into reticulation networks inexactly, but up to a measurable error. The adapted problem, called the Robinson-Foulds reticulation network (RF-Network) problem is, as we show and like many other problems applied in molecular biology, NP-hard. To address this, we employ local search strategies that have been successfully applied in other NP-hard phylogenetic problems. Our local search method benefits from recent theoretical advancements in this area. Further, we introduce in-practice effective algorithms for the computational challenges involved in our local search approach. Using simulations we experimentally validate the ability of our method, RF-Net, to reconstruct correct phylogenetic networks in the presence of error in input data. Finally, we demonstrate how RF-networks can help identify reassortment in influenza A viruses, and provide insight into the evolutionary history of these viruses. RF-Net was able to estimate a large and credible reassortment network with 164 taxa.
引用
收藏
页码:77 / 86
页数:10
相关论文
共 50 条
  • [1] Robinson-Foulds Supertrees
    Bansal, Mukul S.
    Burleigh, J. Gordon
    Eulenstein, Oliver
    Fernandez-Baca, David
    [J]. ALGORITHMS FOR MOLECULAR BIOLOGY, 2010, 5
  • [2] Robinson-Foulds Supertrees
    Mukul S Bansal
    J Gordon Burleigh
    Oliver Eulenstein
    David Fernández-Baca
    [J]. Algorithms for Molecular Biology, 5
  • [3] Faster computation of the Robinson-Foulds distance between phylogenetic networks
    Asano, Tetsuo
    Jansson, Jesper
    Sadakane, Kunihiko
    Uehara, Ryuhei
    Valiente, Gabriel
    [J]. INFORMATION SCIENCES, 2012, 197 : 77 - 90
  • [4] Computing the distribution of the Robinson-Foulds distance
    Hayati, Maryam
    Chindelevitch, Leonid
    [J]. Chindelevitch, Leonid (leonid_chindelevitch@sfu.ca), 1600, Elsevier Ltd (87):
  • [5] Metrics for Phylogenetic Networks I: Generalizations of the Robinson-Foulds Metric
    Cardona, Gabriel
    Llabres, Merce
    Rossello, Francesc
    Valiente, Gabriel
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2009, 6 (01) : 46 - 61
  • [6] Properties of the generalized Robinson-Foulds metric
    Borozan, L.
    Matijevic, D.
    Canzar, S.
    [J]. 2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 330 - 335
  • [7] Efficiently computing the Robinson-Foulds metric
    Pattengale, Nicholas D.
    Gottlieb, Eric J.
    Moret, Bernard M. E.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2007, 14 (06) : 724 - 735
  • [8] Faster Computation of the Robinson-Foulds Distance between Phylogenetic Networks
    Asano, Tetsuo
    Jansson, Jesper
    Sadakane, Kunihiko
    Uehara, Ryuhei
    Valiente, Gabriel
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2010, 6129 : 190 - +
  • [9] Computing the distribution of the Robinson-Foulds distance
    Hayati, Maryam
    Chindelevitch, Leonid
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 87
  • [10] Scalable and Extensible Robinson-Foulds for Comparative Phylogenetics
    Chon, Alvin
    Gorecki, Pawel
    Eulenstein, Oliver
    Huang, Xiaoqiu
    Jannesari, Ali
    [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 166 - 175