Computing the probability of gene trees concordant with the species tree in the multispecies coalescent

被引:1
|
作者
Truszkowski, Jakub [1 ,4 ]
Scornavacca, Celine [2 ,3 ]
Pardi, Fabio [1 ,3 ]
机构
[1] Univ Montpellier, CNRS, LIRMM, Montpellier, France
[2] Univ Montpellier, CNRS, ISEM, Montpellier, France
[3] Inst Biol Computat, Montpellier, France
[4] RBC Borealis AI, Waterloo, ON, Canada
关键词
Multispecies coalescent; Gene tree; Species tree; Coalescent; Dynamic programming; Incomplete lineage sorting; INFERENCE; GENEALOGY; ALGORITHM;
D O I
10.1016/j.tpb.2020.12.002
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The multispecies coalescent process models the genealogical relationships of genes sampled from several species, enabling useful predictions about phenomena such as the discordance between a gene tree and the species phylogeny due to incomplete lineage sorting. Conversely, knowledge of large collections of gene trees can inform us about several aspects of the species phylogeny, such as its topology and ancestral population sizes. A fundamental open problem in this context is how to efficiently compute the probability of a gene tree topology, given the species phylogeny. Although a number of algorithms for this task have been proposed, they either produce approximate results, or, when they are exact, they do not scale to large data sets. In this paper, we present some progress towards exact and efficient computation of the probability of a gene tree topology. We provide a new algorithm that, given a species tree and the number of genes sampled for each species, calculates the probability that the gene tree topology will be concordant with the species tree. Moreover, we provide an algorithm that computes the probability of any specific gene tree topology concordant with the species tree. Both algorithms run in polynomial time and have been implemented in Python. Experiments show that they are able to analyze data sets where thousands of genes are sampled in a matter of minutes to hours. (c) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:22 / 31
页数:10
相关论文
共 50 条
  • [31] STELLS2: fast and accurate coalescent-based maximum likelihood inference of species trees from gene tree topologies
    Pei, Jingwen
    Wu, Yufeng
    BIOINFORMATICS, 2017, 33 (12) : 1789 - 1797
  • [32] Hierarchical Heuristic Species Delimitation Under the Multispecies Coalescent Model with Migration
    Kornai, Daniel
    Jiao, Xiyun
    Ji, Jiayi
    Flouri, Tomas
    Yang, Ziheng
    SYSTEMATIC BIOLOGY, 2024,
  • [33] NANUQ: a method for inferring species networks from gene trees under the coalescent model
    Allman, Elizabeth S.
    Banos, Hector
    Rhodes, John A.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2019, 14 (01)
  • [34] Algorithmic improvements to species delimitation and phylogeny estimation under the multispecies coalescent
    Jones, Graham
    JOURNAL OF MATHEMATICAL BIOLOGY, 2017, 74 (1-2) : 447 - 467
  • [35] On the Robustness to Gene Tree Estimation Error (or lack thereof) of Coalescent-Based Species Tree Methods
    Roch, Sebastien
    Warnow, Tandy
    SYSTEMATIC BIOLOGY, 2015, 64 (04) : 663 - 676
  • [36] NANUQ: a method for inferring species networks from gene trees under the coalescent model
    Elizabeth S. Allman
    Hector Baños
    John A. Rhodes
    Algorithms for Molecular Biology, 14
  • [37] Inferring Species Trees Directly from Biallelic Genetic Markers: Bypassing Gene Trees in a Full Coalescent Analysis
    Bryant, David
    Bouckaert, Remco
    Felsenstein, Joseph
    Rosenberg, Noah A.
    RoyChoudhury, Arindam
    MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (08) : 1917 - 1932
  • [38] Algorithmic improvements to species delimitation and phylogeny estimation under the multispecies coalescent
    Graham Jones
    Journal of Mathematical Biology, 2017, 74 : 447 - 467
  • [39] Impact of Model Violations on the Inference of Species Boundaries Under the Multispecies Coalescent
    Barley, Anthony J.
    Brown, Jeremy M.
    Thomson, Robert C.
    SYSTEMATIC BIOLOGY, 2018, 67 (02) : 269 - 284
  • [40] From gene to organismal phylogeny: Reconciled trees and the gene tree species tree problem
    Page, RDM
    Charleston, MA
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 1997, 7 (02) : 231 - 240