Assessing what is needed to resolve a molecular phylogeny: simulations and empirical data from emydid turtles

被引:49
|
作者
Spinks, Phillip Q. [1 ,2 ]
Thomson, Robert C. [1 ,2 ]
Lovely, Geoff A. [1 ]
Shaffer, H. Bradley [1 ,2 ]
机构
[1] Dept Ecol & Evolut, Davis, CA USA
[2] Univ Calif Davis, Ctr Populat Biol, Davis, CA 95616 USA
来源
基金
美国国家科学基金会;
关键词
DNA-SEQUENCE; GENE TREES; TAXA; CHARACTERS; EVOLUTION; NUCLEOTIDES; SYSTEMATICS; RESOLUTION; PARSIMONY; INFERENCE;
D O I
10.1186/1471-2148-9-56
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Phylogenies often contain both well-supported and poorly supported nodes. Determining how much additional data might be required to eventually recover most or all nodes with high support is an important pragmatic goal, and simulations have been used to examine this question. Most simulations have been based on few empirical loci, and suggest that well supported phylogenies can be determined with a very modest amount of data. Here we report the results of an empirical phylogenetic analysis of all 10 genera and 25 of 48 species of the new world pond turtles (family Emydidae) based on one mitochondrial (1070 base pairs) and seven nuclear loci (5961 base pairs), and a more biologically realistic simulation analysis incorporating variation among gene trees, aimed at determining how much more data might be necessary to recover weakly-supported nodes with strong support. Results: Our mitochondrial-based phylogeny was well resolved, and congruent with some previous mitochondrial results. For example, all genera, and all species except Pseudemys concinna, P. peninsularis, and Terrapene carolina were monophyletic with strong support from at least one analytical method. The Emydinae was recovered as monophyletic, but the Deirochelyinae was not. Based on nuclear data, all genera were monophyletic with strong support except Trachemys, and all species except Graptemys pseudogeographica, P. concinna, T. carolina, and T. coahuila were monophyletic, generally with strong support. However, the branches subtending most genera were relatively short, and intergeneric relationships within subfamilies were mostly unsupported. Our simulations showed that relatively high bootstrap support values (i.e. >= 70) for all nodes were reached in all datasets, but an increase in data did not necessarily equate to an increase in support values. However, simulations based on a single empirical locus reached higher overall levels of support with less data than did the simulations that were based on all seven empirical nuclear loci, and symmetric tree distances were much lower for single versus multiple gene simulation analyses. Conclusion: Our empirical results provide new insights into the phylogenetics of the Emydidae, but the short branches recovered deep in the tree also indicate the need for additional work on this clade to recover all intergeneric relationships with confidence and to delimit species for some problematic groups. Our simulation results suggest that moderate (in the few-to-tens of kb range) amounts of data are necessary to recover most emydid relationships with high support values. They also suggest that previous simulations that do not incorporate among-gene tree topological variance probably underestimate the amount of data needed to recover well supported phylogenies.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Molecular phylogeny of Kalyptorhynchia (Rhabdocoela, Platyhelminthes) inferred from ribosomal sequence data
    Tessens, Bart
    Janssen, Toon
    Artois, Tom
    ZOOLOGICA SCRIPTA, 2014, 43 (05) : 519 - 530
  • [42] Geoscience explanations: Identifying what is needed for generating scientific narratives from data models
    Reitsma, Femke
    ENVIRONMENTAL MODELLING & SOFTWARE, 2010, 25 (01) : 93 - 99
  • [43] Assessing mental health from registry data: What is the best proxy?
    Beerten, Simon Gabriel
    De Pauw, Robby
    Van Pottelbergh, Gijs
    Casas, Lidia
    Vaes, Bert
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2024, 183
  • [44] Assessing mental health from registry data: what is the best proxy?
    Beerten, S. G.
    De Pauw, R.
    Van Pottelbergh, G.
    Casas, L.
    Vaes, B.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2023, 33
  • [45] Molecular phylogeny and taxonomy of Phlomoides (Lamiaceae subfamily Lamioideae) in China: Insights from molecular and morphological data
    Zhao, Yue
    Chen, Ya-Ping
    Drew, Bryan T.
    Zhao, Fei
    Almasi, Maryam
    Turginov, Orzimat T.
    Xiao, Jin-Fei
    Karimi, Abdul G.
    Salmaki, Yasaman
    Yu, Xiang-Qin
    Xiang, Chun-Lei
    PLANT DIVERSITY, 2024, 46 (04) : 462 - 475
  • [46] Molecular relationships of the Australian Ennominae (Lepidoptera: Geometridae) and implications for the phylogeny of the Geometridae from molecular and morphological data
    Young, Catherine J.
    ZOOTAXA, 2006, (1264) : 1 - 147
  • [47] Molecular phylogeny and taxonomy of Phlomoides (Lamiaceae subfamily Lamioideae) in China:Insights from molecular and morphological data
    Yue Zhao
    YaPing Chen
    Bryan TDrew
    Fei Zhao
    Maryam Almasi
    Orzimat TTurginov
    JinFei Xiao
    Abdul GKarimi
    Yasaman Salmaki
    XiangQin Yu
    ChunLei Xiang
    Plant Diversity, 2024, 46 (04) : 462 - 475
  • [48] What landscape elements are needed for hospital healing spaces? Evidence from an empirical study of 10 compact hospitals
    Guo, Haoxu
    Zhou, Weiqiang
    Lai, Wenbo
    Yao, Lihao
    FRONTIERS IN PUBLIC HEALTH, 2023, 11
  • [49] WHAT DOES IT TAKE TO RESOLVE RELATIONSHIPS AND TO IDENTIFY SPECIES WITH MOLECULAR MARKERS? AN EXAMPLE FROM THE EPIPHYTIC RHIPSALIDEAE (CACTACEAE)
    Korotkova, Nadja
    Borsch, Thomas
    Quandt, Dietmar
    Taylor, Nigel P.
    Mueller, Kai F.
    Barthlott, Wilhelm
    AMERICAN JOURNAL OF BOTANY, 2011, 98 (09) : 1549 - 1572
  • [50] Molecular phylogeny of the suborder leucodontineae (Musci; Leucodontales) inferred from rbcL sequence data
    Maeda, S
    Kosuge, K
    Gonzalez, D
    De Luna, E
    Akiyama, H
    JOURNAL OF PLANT RESEARCH, 2000, 113 (1109) : 29 - 38