Phylogenetic Detection of Recombination with a Bayesian Prior on the Distance between Trees

被引:21
|
作者
Martins, Leonardo de Oliveira [1 ]
Leal, Elcio [2 ]
Kishino, Hirohisa [1 ]
机构
[1] Univ Tokyo, Grad Sch Agr & Life Sci, Tokyo, Japan
[2] Univ Fed Sao Paulo, Sao Paulo, Brazil
来源
PLOS ONE | 2008年 / 3卷 / 07期
基金
日本学术振兴会;
关键词
D O I
10.1371/journal.pone.0002651
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genomic regions participating in recombination events may support distinct topologies, and phylogenetic analyses should incorporate this heterogeneity. Existing phylogenetic methods for recombination detection are challenged by the enormous number of possible topologies, even for a moderate number of taxa. If, however, the detection analysis is conducted independently between each putative recombinant sequence and a set of reference parentals, potential recombinations between the recombinants are neglected. In this context, a recombination hotspot can be inferred in phylogenetic analyses if we observe several consecutive breakpoints. We developed a distance measure between unrooted topologies that closely resembles the number of recombinations. By introducing a prior distribution on these recombination distances, a Bayesian hierarchical model was devised to detect phylogenetic inconsistencies occurring due to recombinations. This model relaxes the assumption of known parental sequences, still common in HIV analysis, allowing the entire dataset to be analyzed at once. On simulated datasets with up to 16 taxa, our method correctly detected recombination breakpoints and the number of recombination events for each breakpoint. The procedure is robust to rate and transition: transversion heterogeneities for simulations with and without recombination. This recombination distance is related to recombination hotspots. Applying this procedure to a genomic HIV-1 dataset, we found evidence for hotspots and de novo recombination.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Distances Between Phylogenetic Trees: A Survey
    Feng Shi
    Qilong Feng
    Jianer Chen
    Lusheng Wang
    Jianxin Wang
    TsinghuaScienceandTechnology, 2013, 18 (05) : 490 - 499
  • [42] The persistence of clade prior influence in Bayesian phylogenetic analyses.
    Pickett, KM
    Randle, CP
    CLADISTICS, 2004, 20 (06) : 602 - 602
  • [43] Prior Density Learning in Variational Bayesian Phylogenetic Parameters Inference
    Remita, Amine M.
    Vitae, Golrokh
    Diallo, Abdoulaye Banire
    COMPARATIVE GENOMICS, RECOMB-CG 2023, 2023, 13883 : 112 - 130
  • [44] SIMPLE METHOD FOR CONSTRUCTING PHYLOGENETIC TREES FROM DISTANCE MATRICES
    LI, WH
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1981, 78 (02): : 1085 - 1089
  • [45] Tool that Integrates Distance Based Programs for Reconstructing Phylogenetic Trees
    Torres, M.
    Dias, G.
    Goncalves, G.
    Vieira, C.
    IEEE LATIN AMERICA TRANSACTIONS, 2011, 9 (05) : 895 - 901
  • [46] Effects of sequence diversity and recombination on the accuracy of phylogenetic trees estimated by kSNP
    Hall, Barry G.
    CLADISTICS, 2016, 32 (01) : 90 - 99
  • [47] Phylogenetic inference under recombination using Bayesian stochastic topology selection
    Webb, Alex
    Hancock, John M.
    Holmes, Chris C.
    BIOINFORMATICS, 2009, 25 (02) : 197 - 203
  • [49] Detecting contact in language trees: a Bayesian phylogenetic model with horizontal transfer
    Nico Neureiter
    Peter Ranacher
    Nour Efrat-Kowalsky
    Gereon A. Kaiping
    Robert Weibel
    Paul Widmer
    Remco R. Bouckaert
    Humanities and Social Sciences Communications, 9
  • [50] An Edit Distance between Quotiented Trees
    Algorithmica, 2003, 36 : 1 - 39