A Simulation Study to Examine the Information Content in Phylogenomic Data Sets under the Multispecies Coalescent Model

被引:16
|
作者
Huang, Jun [1 ,2 ]
Flouri, Tomas [1 ]
Yang, Ziheng [1 ]
机构
[1] UCL, Dept Genet Evolut & Environm, London, England
[2] Beijing Jiaotong Univ, Dept Math, Beijing, Peoples R China
基金
英国生物技术与生命科学研究理事会;
关键词
Bayesian inference; BPP; information content; multispecies coalescent; MSC; MSC with introgression; MSci; SPECIES TREE ESTIMATION; ANCESTRAL POPULATION SIZES; BAYESIAN-INFERENCE; EVOLUTIONARY RATE; SEQUENCE DATA; GENE TREES; IMPLEMENTATION; PROBABILITY; SYSTEMATICS; ALGORITHMS;
D O I
10.1093/molbev/msaa166
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We use computer simulation to examine the information content in multilocus data sets for inference under the multispecies coalescent model. Inference problems considered include estimation of evolutionary parameters (such as species divergence times, population sizes, and cross-species introgression probabilities), species tree estimation, and species delimitation based on Bayesian comparison of delimitation models. We found that the number of loci is the most influential factor for almost all inference problems examined. Although the number of sequences per species does not appear to be important to species tree estimation, it is very influential to species delimitation. Increasing the number of sites and the per-sitemutation rate both increase the mutation rate for the whole locus and these have the same effect on estimation of parameters, but the sequence length has a greater effect than the per-site mutation rate for species tree estimation. We discuss the computational costs when the data size increases and provide guidelines concerning the subsampling of genomic data to enable the application of full-likelihood methods of inference.
引用
收藏
页码:3211 / 3224
页数:14
相关论文
共 50 条
  • [1] A simulation study to examine the impact of recombination on phylogenomic inferences under the multispecies coalescent model
    Zhu, Tianqi
    Flouri, Tomas
    Yang, Ziheng
    [J]. MOLECULAR ECOLOGY, 2022, 31 (10) : 2814 - 2829
  • [2] The Multispecies Coalescent Model Outperforms Concatenation Across Diverse Phylogenomic Data Sets
    Jiang, Xiaodong
    Edwards, Scott, V
    Liu, Liang
    [J]. SYSTEMATIC BIOLOGY, 2020, 69 (04) : 795 - 812
  • [3] Short branch attraction in phylogenomic inference under the multispecies coalescent
    Liu, Liang
    Yu, Lili
    Wu, Shaoyuan
    Arnold, Jonathan
    Whalen, Christopher
    Davis, Charles
    Edwards, Scott
    [J]. FRONTIERS IN ECOLOGY AND EVOLUTION, 2023, 11
  • [4] A Bayesian Implementation of the Multispecies Coalescent Model with Introgression for Phylogenomic Analysis
    Flouri, Tomas
    Jiao, Xiyun
    Rannala, Bruce
    Yang, Ziheng
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2020, 37 (04) : 1211 - 1223
  • [5] Phase Resolution of Heterozygous Sites in Diploid Genomes is Important to Phylogenomic Analysis under the Multispecies Coalescent Model
    Huang, Jun
    Bennett, Jeremy
    Flouri, Tomas
    Leache, Adam D.
    Yang, Ziheng
    [J]. SYSTEMATIC BIOLOGY, 2022, 71 (02) : 334 - 352
  • [6] Challenges in Species Tree Estimation Under the Multispecies Coalescent Model
    Xu, Bo
    Yang, Ziheng
    [J]. GENETICS, 2016, 204 (04) : 1353 - 1368
  • [7] A Bayesian Implementation of the Multispecies Coalescent Model with Introgression for Phylogenomic Analysis (vol 37, pg 1211, 2020)
    Flouri, Tomas
    Jiao, Xiyun
    Rannala, Bruce
    Yang, Ziheng
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2022, 39 (11)
  • [8] Hierarchical Heuristic Species Delimitation Under the Multispecies Coalescent Model with Migration
    Kornai, Daniel
    Jiao, Xiyun
    Ji, Jiayi
    Flouri, Tomas
    Yang, Ziheng
    [J]. SYSTEMATIC BIOLOGY, 2024,
  • [9] A stochastic Farris transform for genetic data under the multispecies coalescent with applications to data requirements
    Dasarathy, Gautam
    Mossel, Elchanan
    Nowak, Robert
    Roch, Sebastien
    [J]. JOURNAL OF MATHEMATICAL BIOLOGY, 2022, 84 (05)
  • [10] Impact of Model Violations on the Inference of Species Boundaries Under the Multispecies Coalescent
    Barley, Anthony J.
    Brown, Jeremy M.
    Thomson, Robert C.
    [J]. SYSTEMATIC BIOLOGY, 2018, 67 (02) : 269 - 284