Enumeration of compact coalescent histories for matching gene trees and species trees

被引:7
|
作者
Disanto, Filippo [1 ]
Rosenberg, Noah A. [2 ]
机构
[1] Univ Pisa, Dept Math, I-56126 Pisa, Italy
[2] Stanford Univ, Dept Biol, Stanford, CA 94305 USA
基金
美国国家卫生研究院;
关键词
Compact coalescent histories; Gene trees; Generating functions; Phylogenetics; Species trees; PROBABILITY; INFERENCE; NUMBER;
D O I
10.1007/s00285-018-1271-5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Compact coalescent histories are combinatorial structures that describe for a given gene tree G and species tree S possibilities for the numbers of coalescences of G that take place on the various branches of S. They have been introduced as a data structure for evaluating probabilities of gene tree topologies conditioning on species trees, reducing computation time compared to standard coalescent histories. When gene trees and species trees have a matching labeled topology , the compact coalescent histories of t are encoded by particular integer labelings of the branches of t, each integer specifying the number of coalescent events of G present in a branch of S. For matching gene trees and species trees, we investigate enumerative properties of compact coalescent histories. We report a recursion for the number of compact coalescent histories for matching gene trees and species trees, using it to study the numbers of compact coalescent histories for small trees. We show that the number of compact coalescent histories equals the number of coalescent histories if and only if the labeled topology is a caterpillar or a bicaterpillar. The number of compact coalescent histories is seen to increase with tree imbalance: we prove that as the number of taxa n increases, the exponential growth of the number of compact coalescent histories follows in the case of caterpillar or bicaterpillar labeled topologies and approximately and for lodgepole and balanced topologies, respectively. We prove that the mean number of compact coalescent histories of a labeled topology of size n selected uniformly at random grows with . Our results contribute to the analysis of the computational complexity of algorithms for computing gene tree probabilities, and to the combinatorial study of gene trees and species trees more generally.
引用
收藏
页码:155 / 188
页数:34
相关论文
共 50 条
  • [31] RELATIONSHIPS BETWEEN GENE TREES AND SPECIES TREES
    PAMILO, P
    NEI, M
    MOLECULAR BIOLOGY AND EVOLUTION, 1988, 5 (05) : 568 - 583
  • [32] Gene trees, species and species trees in the Ctenosaura palearis clade
    Pasachnik, Stesha A.
    Echternacht, Arthur C.
    Fitzpatrick, Benjamin M.
    CONSERVATION GENETICS, 2010, 11 (05) : 1767 - 1781
  • [33] Gene trees, species and species trees in the Ctenosaura palearis clade
    Stesha A. Pasachnik
    Arthur C. Echternacht
    Benjamin M. Fitzpatrick
    Conservation Genetics, 2010, 11 : 1767 - 1781
  • [34] BEST: Bayesian estimation of species trees under the coalescent model
    Liu, Liang
    BIOINFORMATICS, 2008, 24 (21) : 2542 - 2543
  • [35] Coalescent methods for estimating species trees from phylogenomic data
    Liu, Liang
    Wu, Shaoyuan
    Yu, Lili
    JOURNAL OF SYSTEMATICS AND EVOLUTION, 2015, 53 (05) : 380 - 390
  • [36] Gene trees, species trees, and systematics: A cladistic perspective
    Brower, AVZ
    DeSalle, R
    Vogler, A
    ANNUAL REVIEW OF ECOLOGY AND SYSTEMATICS, 1996, 27 : 423 - 450
  • [37] Discordance of species trees with their most likely gene trees
    Degnan, James H.
    Rosenberg, Noah A.
    PLOS GENETICS, 2006, 2 (05): : 762 - 768
  • [38] GENE TREES, SPECIES TREES AND THE SEGREGATION OF ANCESTRAL ALLELES
    HUDSON, RR
    GENETICS, 1992, 131 (02) : 509 - 512
  • [39] Enumeration and Coding of Compact Code Trees for Binary AIFV Codes
    Hashimoto, Kengo
    Iwata, Ken-ichi
    Yamamoto, Hirosuke
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 1527 - 1531
  • [40] The probability of topological concordance of gene trees and species trees
    Rosenberg, NA
    THEORETICAL POPULATION BIOLOGY, 2002, 61 (02) : 225 - 247