Assessing what is needed to resolve a molecular phylogeny: simulations and empirical data from emydid turtles

被引:49
|
作者
Spinks, Phillip Q. [1 ,2 ]
Thomson, Robert C. [1 ,2 ]
Lovely, Geoff A. [1 ]
Shaffer, H. Bradley [1 ,2 ]
机构
[1] Dept Ecol & Evolut, Davis, CA USA
[2] Univ Calif Davis, Ctr Populat Biol, Davis, CA 95616 USA
来源
基金
美国国家科学基金会;
关键词
DNA-SEQUENCE; GENE TREES; TAXA; CHARACTERS; EVOLUTION; NUCLEOTIDES; SYSTEMATICS; RESOLUTION; PARSIMONY; INFERENCE;
D O I
10.1186/1471-2148-9-56
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Phylogenies often contain both well-supported and poorly supported nodes. Determining how much additional data might be required to eventually recover most or all nodes with high support is an important pragmatic goal, and simulations have been used to examine this question. Most simulations have been based on few empirical loci, and suggest that well supported phylogenies can be determined with a very modest amount of data. Here we report the results of an empirical phylogenetic analysis of all 10 genera and 25 of 48 species of the new world pond turtles (family Emydidae) based on one mitochondrial (1070 base pairs) and seven nuclear loci (5961 base pairs), and a more biologically realistic simulation analysis incorporating variation among gene trees, aimed at determining how much more data might be necessary to recover weakly-supported nodes with strong support. Results: Our mitochondrial-based phylogeny was well resolved, and congruent with some previous mitochondrial results. For example, all genera, and all species except Pseudemys concinna, P. peninsularis, and Terrapene carolina were monophyletic with strong support from at least one analytical method. The Emydinae was recovered as monophyletic, but the Deirochelyinae was not. Based on nuclear data, all genera were monophyletic with strong support except Trachemys, and all species except Graptemys pseudogeographica, P. concinna, T. carolina, and T. coahuila were monophyletic, generally with strong support. However, the branches subtending most genera were relatively short, and intergeneric relationships within subfamilies were mostly unsupported. Our simulations showed that relatively high bootstrap support values (i.e. >= 70) for all nodes were reached in all datasets, but an increase in data did not necessarily equate to an increase in support values. However, simulations based on a single empirical locus reached higher overall levels of support with less data than did the simulations that were based on all seven empirical nuclear loci, and symmetric tree distances were much lower for single versus multiple gene simulation analyses. Conclusion: Our empirical results provide new insights into the phylogenetics of the Emydidae, but the short branches recovered deep in the tree also indicate the need for additional work on this clade to recover all intergeneric relationships with confidence and to delimit species for some problematic groups. Our simulation results suggest that moderate (in the few-to-tens of kb range) amounts of data are necessary to recover most emydid relationships with high support values. They also suggest that previous simulations that do not incorporate among-gene tree topological variance probably underestimate the amount of data needed to recover well supported phylogenies.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Assessing what is needed to resolve a molecular phylogeny: simulations and empirical data from emydid turtles
    Phillip Q Spinks
    Robert C Thomson
    Geoff A Lovely
    H Bradley Shaffer
    BMC Evolutionary Biology, 9
  • [2] In Silico AFLP: An Application to Assess What Is Needed to Resolve a Phylogeny
    Jesus Garcia-Pereira, Maria
    Caballero, Armando
    Quesada, Humberto
    ADVANCES IN BIOINFORMATICS, 2010, 74 : 137 - 141
  • [3] How much data are needed to resolve a difficult phylogeny? Case study in Lamiales
    Wortley, AH
    Rudall, PJ
    Harris, DJ
    Scotland, RW
    SYSTEMATIC BIOLOGY, 2005, 54 (05) : 697 - 709
  • [4] A large phylogeny of turtles (Testudines) using molecular data
    Guillon, Jean-Michel
    Guery, Lorelei
    Hulin, Vincent
    Girondot, Marc
    CONTRIBUTIONS TO ZOOLOGY, 2012, 81 (03) : 147 - 158
  • [5] Estimating the phylogeny of geoemydid turtles (Cryptodira) from landmark data: an assessment of different methods
    Ascarrunz, Eduardo
    Claude, Julien
    Joyce, Walter G.
    PEERJ, 2019, 7
  • [6] Sharing Data from Molecular Simulations
    Abraham, Mark
    Apostolov, Rossen
    Barnoud, Jonathan
    Bauer, Paul
    Blau, Christian
    Bonvin, Alexandre M. J. J.
    Chavent, Matthieu
    Chodera, John
    Condic-Jurkic, Karmen
    Delemotte, Lucie
    Grubmueller, Helmut
    Howard, Rebecca J.
    Jordan, E. Joseph
    Lindahl, Erik
    Ollila, O. H. Samuli
    Selent, Jana
    Smith, Daniel G. A.
    Stansfeld, Phillip J.
    Tiemann, Johanna K. S.
    Trellet, Mikael
    Woods, Christopher
    Zhmurov, Artem
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (10) : 4093 - 4099
  • [7] Phylogeny of the Falconidae inferred from molecular and morphological data
    Griffiths, CS
    AUK, 1999, 116 (01): : 116 - 130
  • [8] Phylogeny of Euonymus inferred from molecular and morphological data
    Li, Yan-Nan
    Xie, Lei
    Li, Jin-Yu
    Zhang, Zhi-Xiang
    JOURNAL OF SYSTEMATICS AND EVOLUTION, 2014, 52 (02) : 149 - 160
  • [9] The phylogeny of Alyssum (Brassicaceae) inferred from molecular data
    Li, Yan
    Feng, Ying
    Lv, Guanghui
    Liu, Bin
    Qi, Aladaer
    NORDIC JOURNAL OF BOTANY, 2015, 33 (06) : 715 - 721
  • [10] ASSESSING IMPLICIT ATTITUDES: WHAT CAN BE LEARNED FROM SIMULATIONS?
    Quek, Boon-Kiat
    Ortony, Andrew
    SOCIAL COGNITION, 2012, 30 (05) : 610 - 630