Estimating species trees using multiple-allele DNA sequence data

被引:166
|
作者
Liu, Liang [1 ]
Pearl, Dennis K. [2 ]
Brumfield, Robb T. [3 ,4 ]
Edwards, Scott V. [1 ]
机构
[1] Harvard Univ, Museum Comparat Zool, Cambridge, MA 02138 USA
[2] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
[3] Louisiana State Univ, Museum Nat Sci, Baton Rouge, LA 70803 USA
[4] Louisiana State Univ, Dept Biol Sci, Baton Rouge, LA 70803 USA
关键词
Bayesian hierarchical model; coalescent theory; gene tree; species tree;
D O I
10.1111/j.1558-5646.2008.00414.x
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Several techniques, such as concatenation and consensus methods, are available for combining data from multiple loci to produce a single statement of phylogenetic relationships. However, when multiple alleles are sampled from individual species, it becomes more challenging to estimate relationships at the level of species, either because concatenation becomes inappropriate due to conflicts among individual gene trees, or because the species from which multiple alleles have been sampled may not form monophyletic groups in the estimated tree. We propose a Bayesian hierarchical model to reconstruct species trees from multipleallele, multilocus sequence data, building on a recently proposed method for estimating species trees from single allele multilocus data. A two-step Markov Chain Monte Carlo (MCMC) algorithm is adopted to estimate the posterior distribution of the species tree. The model is applied to estimate the posterior distribution of species trees for two multiple-allele datasets-yeast (Saccharomyces) and birds (Manacus-manakins). The estimates of the species trees using our method are consistent with those inferred from other methods and genetic markers, but in contrast to other species tree methods, it provides credible regions for the species tree. The Bayesian approach described here provides a powerful framework for statistical testing and integration of population genetics and phylogenetics.
引用
收藏
页码:2080 / 2091
页数:12
相关论文
共 50 条
  • [1] Unguided Species Delimitation Using DNA Sequence Data from Multiple Loci
    Yang, Ziheng
    Rannala, Bruce
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2014, 31 (12) : 3125 - 3135
  • [2] ESTIMATING NUCLEOTIDE SUBSTITUTION RATES USING DNA-SEQUENCE DATA
    KAPLAN, N
    [J]. BIOMETRICS, 1981, 37 (03) : 614 - 614
  • [3] A METHOD FOR ESTIMATING RATES OF NUCLEOTIDE SUBSTITUTION USING DNA-SEQUENCE DATA
    KAPLAN, N
    RISKO, K
    [J]. THEORETICAL POPULATION BIOLOGY, 1982, 21 (03) : 318 - 328
  • [4] Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations
    Vikas Bansal
    Ondrej Libiger
    [J]. BMC Bioinformatics, 16
  • [5] Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations
    Bansal, Vikas
    Libiger, Ondrej
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [6] Estimating species trees using approximate Bayesian computation
    Fan, Helen Hang
    Kubatko, Laura S.
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2011, 59 (02) : 354 - 363
  • [7] Coalescent methods for estimating species trees from phylogenomic data
    Liu, Liang
    Wu, Shaoyuan
    Yu, Lili
    [J]. JOURNAL OF SYSTEMATICS AND EVOLUTION, 2015, 53 (05) : 380 - 390
  • [8] Heuristic optimization for global species clustering of DNA sequence data from multiple loci
    Chesters, Douglas
    Yu, Fang
    Cao, Huan-Xi
    Dai, Qing-Yan
    Wu, Qing-Tao
    Shi, Weifeng
    Zheng, Weimin
    Zhu, Chao-Dong
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2013, 4 (10): : 961 - 970
  • [9] EVALUATION OF THE RESTRICTED MAXIMUM-LIKELIHOOD METHOD FOR ESTIMATING PHYLOGENETIC TREES USING SIMULATED ALLELE-FREQUENCY DATA
    ROHLF, FJ
    WOOTEN, MC
    [J]. EVOLUTION, 1988, 42 (03) : 581 - 595
  • [10] Mutation parameters from DNA sequence data using graph theoretic measures on lineage trees
    Magori-Cohen, Reuma
    Louzoun, Yoram
    Kleinstein, Steven H.
    [J]. BIOINFORMATICS, 2006, 22 (14) : E332 - E340