The impact of GC bias on phylogenetic accuracy using targeted enrichment phylogenomic data

被引:41
|
作者
Bossert, Silas [1 ]
Murray, Elizabeth A. [1 ]
Blaimer, Bonnie B. [2 ]
Danforth, Bryan N. [1 ]
机构
[1] Cornell Univ, Dept Entomol, Ithaca, NY 14853 USA
[2] Smithsonian Inst, Natl Museum Nat Hist, Dept Entomol, Washington, DC 20560 USA
基金
美国国家科学基金会;
关键词
Gene tree incongruence; Ultraconserved elements; GC biased gene conversion; Gene trees; Sociality; Corbiculates; ULTRACONSERVED ELEMENTS; GENE CONVERSION; RECOMBINATION; EVOLUTION; LINEAGE; INCONGRUENCE; HYMENOPTERA; PARSIMONY; SEQUENCES; THOUSANDS;
D O I
10.1016/j.ympev.2017.03.022
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The field of sequence based phylogenetic analyses is currently being transformed by novel hybrid-based targeted enrichment methods, such as the use of ultraconserved elements (UCEs). Rather than analyzing relationships among organisms using a small number of genes, these methods now allow us to evaluate relationships with many hundreds to thousands of individual gene loci. However, the inclusion of thousands of loci does not necessarily overcome the long-standing challenge of incongruence among phylogenetic trees derived from different genes or gene regions. One factor that impacts the level of incongruence in phylogenomic data sets is the level of GC bias. GC rich gene regions are prone to higher recombination rates than AT rich regions, driven by a process referred to as "GC biased gene conversion". As a result, high GC content can be negatively associated with phylogenetic accuracy, but the extent to which this impacts incongruence among UCEs is currently unstudied. We investigated the impact of GC content on phylogeny reconstruction using in silico captured UCE data for the corbiculate bees (Hymenoptera: Apidae). The phylogeny of this group has been the subject of extensive study, and incongruence among gene trees is thought to be a source of phylogenetic error. We conducted coalescent- and concatenation-based analyses of 810 individual gene loci from all 13 currently available bee genomes, including 8 corbiculate taxa. Both coalescent- and concatenation-based methods converged on a single topology for the corbiculate tribes. In contrast to concatenation, the coalescent-based methods revealed significant topological conflict at nodes involving the orchid bees (Euglossini) and honeybees (Apini). Partitioning the loci by GC content reveals decreasing support for the inferred topology with increasing GC bias. Based on the results of this study, we report the first evidence that GC biased gene conversion may contribute to topological incongruence in studies based on ultraconserved elements. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:149 / 157
页数:9
相关论文
共 43 条
  • [1] Evaluating Fast Maximum Likelihood-Based Phylogenetic Programs Using Empirical Phylogenomic Data Sets
    Zhou, Xiaofan
    Shen, Xing-Xing
    Hittinger, Chris Todd
    Rokas, Antonis
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2018, 35 (02) : 486 - 503
  • [2] The impact of probe sample bias on the accuracy of commercial floating car data speeds
    Bruwer, Megan M.
    Walker, Ian
    Andersen, Simen J.
    [J]. TRANSPORTATION PLANNING AND TECHNOLOGY, 2022, 45 (08) : 611 - 628
  • [3] The accuracy of crime statistics: assessing the impact of police data bias on geographic crime analysis
    Buil-Gil, David
    Moretti, Angelo
    Langton, Samuel H.
    [J]. JOURNAL OF EXPERIMENTAL CRIMINOLOGY, 2022, 18 (03) : 515 - 541
  • [4] The accuracy of crime statistics: assessing the impact of police data bias on geographic crime analysis
    David Buil-Gil
    Angelo Moretti
    Samuel H. Langton
    [J]. Journal of Experimental Criminology, 2022, 18 : 515 - 541
  • [5] Phylogenomic analysis of brachyuran crabs using transcriptome data reveals possible sources of conflicting phylogenetic relationships within the group
    Pan, Da
    Sun, Yunlong
    Shi, Boyang
    Wang, Ruxiao
    Ng, Peter K. L.
    Guinot, Daniele
    Cumberlidge, Neil
    Sun, Hongying
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2024, 201
  • [6] A phylogenomic perspective on gene tree conflict and character evolution in Caprifoliaceae using target enrichment data, with Zabelioideae recognized as a new subfamily
    Wang, Hong-Xin
    Morales-Briones, Diego F.
    Moore, Michael J.
    Wen, Jun
    Wang, Hua-Feng
    [J]. JOURNAL OF SYSTEMATICS AND EVOLUTION, 2021, 59 (05) : 897 - 914
  • [7] Using targeted enrichment of nuclear genes to increase phylogenetic resolution in the neotropical rain forest genus Inga (Leguminosae: Mimosoideae)
    Nicholls, James A.
    Pennington, R. Toby
    Koenen, Erik J. M.
    Hughes, Colin E.
    Hearn, Jack
    Bunnefeld, Lynsey
    Dexter, Kyle G.
    Stone, Graham N.
    Kidner, Catherine A.
    [J]. FRONTIERS IN PLANT SCIENCE, 2015, 6
  • [8] IMPACT OF USING OLDER DATA ON THE ACCURACY OF CARDIOVASCULAR RISK SCORES
    Sussman, Jeremy
    Wiitala, Wyndy L.
    Levine, Deborah A.
    Bentley, Douglas R.
    Youles, Bradley
    Hofer, TImothy
    Hayward, Rodney A.
    [J]. JOURNAL OF GENERAL INTERNAL MEDICINE, 2018, 33 : S230 - S231
  • [9] The Impact of Targeted Data Collection on Nonresponse Bias in an Establishment Survey: A Simulation Study of Adaptive Survey Design
    McCarthy, Jaki
    Wagner, James
    Sanders, Herschel Lisette
    [J]. JOURNAL OF OFFICIAL STATISTICS, 2017, 33 (03) : 857 - 871
  • [10] Patching rainfall data using regression methods .2. Comparisons of accuracy, bias and efficiency
    Makhuvha, T
    Pegram, G
    Sparks, R
    Zucchini, W
    [J]. JOURNAL OF HYDROLOGY, 1997, 198 (1-4) : 308 - 318