Analysis of phylogenomic datasets reveals conflict, concordance, and gene duplications with examples from animals and plants

被引:291
|
作者
Smith, Stephen A. [1 ]
Moore, Michael J. [2 ]
Brown, Joseph W. [1 ]
Yang, Ya [1 ]
机构
[1] Univ Michigan, Dept Ecol & Evolutionary Biol, Ann Arbor, MI 48109 USA
[2] Oberlin Coll, Dept Biol, Oberlin, OH 44074 USA
来源
BMC EVOLUTIONARY BIOLOGY | 2015年 / 15卷
基金
美国国家科学基金会;
关键词
Phylogenomics; Incomplete lineage sorting; Transcriptome; Gene tree conflict; Gene duplication; EVOLUTIONARY RELATIONSHIPS; SPECIES TREES; DRAFT GENOME; CARYOPHYLLALES; INCONGRUENCE; COALESCENT; DIVERSIFICATION; PHYLOGENETICS; DIVERGENCES; INSIGHTS;
D O I
10.1186/s12862-015-0423-0
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The use of transcriptomic and genomic datasets for phylogenetic reconstruction has become increasingly common as researchers attempt to resolve recalcitrant nodes with increasing amounts of data. The large size and complexity of these datasets introduce significant phylogenetic noise and conflict into subsequent analyses. The sources of conflict may include hybridization, incomplete lineage sorting, or horizontal gene transfer, and may vary across the phylogeny. For phylogenetic analysis, this noise and conflict has been accommodated in one of several ways: by binning gene regions into subsets to isolate consistent phylogenetic signal; by using gene-tree methods for reconstruction, where conflict is presumed to be explained by incomplete lineage sorting (ILS); or through concatenation, where noise is presumed to be the dominant source of conflict. The results provided herein emphasize that analysis of individual homologous gene regions can greatly improve our understanding of the underlying conflict within these datasets. Results: Here we examined two published transcriptomic datasets, the angiosperm group Caryophyllales and the aculeate Hymenoptera, for the presence of conflict, concordance, and gene duplications in individual homologs across the phylogeny. We found significant conflict throughout the phylogeny in both datasets and in particular along the backbone. While some nodes in each phylogeny showed patterns of conflict similar to what might be expected with ILS alone, the backbone nodes also exhibited low levels of phylogenetic signal. In addition, certain nodes, especially in the Caryophyllales, had highly elevated levels of strongly supported conflict that cannot be explained by ILS alone. Conclusion: This study demonstrates that phylogenetic signal is highly variable in phylogenomic data sampled across related species and poses challenges when conducting species tree analyses on large genomic and transcriptomic datasets. Further insight into the conflict and processes underlying these complex datasets is necessary to improve and develop adequate models for sequence analysis and downstream applications. To aid this effort, we developed the open source software phyparts (https://bitbucket.org/blackrim/phyparts), which calculates unique, conflicting, and concordant bipartitions, maps gene duplications, and outputs summary statistics such as internode certainy (ICA) scores and node-specific counts of gene duplications.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Phylogenomic Analysis of the α Proteasome Gene Family from Early-Diverging Eukaryotes
    Juan L. Bouzatrid="*"rid="†"
    Leslie K. McNeilrid="*"
    Hugh M. Robertson
    Leellen F. Solter
    Julie E. Nixon
    Jonathan E. Beever
    H. Rex Gaskins
    Gary Olsen
    Shankar Subramaniam
    Mitchell L. Sogin
    Harris A. Lewin
    [J]. Journal of Molecular Evolution, 2000, 51 : 532 - 543
  • [32] Phylogenomic analysis of vertebrate thrombospondins reveals fish-specific paralogues, ancestral gene relationships and a tetrapod innovation
    McKenzie, Patrick
    Chadalavada, Seetharam C.
    Bohrer, Justin
    Adams, Josephine C.
    [J]. BMC EVOLUTIONARY BIOLOGY, 2006, 6 (1)
  • [33] | Evolutionary analysis of the LORELEI gene family in plants reveals regulatory subfunctionalization
    Noble, Jennifer A.
    Bielski, Nicholas, V
    Liu, Ming-Che James
    DeFalco, Thomas A.
    Stegmann, Martin
    Nelson, Andrew D. L.
    McNamara, Kara
    Sullivan, Brooke
    Dinh, Khanhlinh K.
    Khuu, Nicholas
    Hancock, Sarah
    Shiu, Shin-Han
    Zipfel, Cyril
    Cheung, Alice Y.
    Beilstein, Mark A.
    Palanivelu, Ravishankar
    [J]. PLANT PHYSIOLOGY, 2022, 190 (04) : 2539 - 2556
  • [34] Phylogenomic analysis of vertebrate thrombospondins reveals fish-specific paralogues, ancestral gene relationships and a tetrapod innovation
    Patrick McKenzie
    Seetharam C Chadalavada
    Justin Bohrer
    Josephine C Adams
    [J]. BMC Evolutionary Biology, 6
  • [35] Analysis of evolution of carbonic anhydrases IV and XV reveals a rich history of gene duplications and a new group of isozymes
    Tolvanen, Martti E. E.
    Ortutay, Csaba
    Barker, Harlan R.
    Aspatwar, Ashok
    Patrikainen, Maarit
    Parkkila, Seppo
    [J]. BIOORGANIC & MEDICINAL CHEMISTRY, 2013, 21 (06) : 1503 - 1510
  • [36] A haplotype-level analysis reveals adaptive polymorphic gene duplications in humans affecting pigmentation and hair morphology
    Saitou, Marie
    Gokcumen, Omer
    [J]. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2019, 168 : 214 - 214
  • [37] Phylogenomic Synteny Network Analysis of MADS-Box Transcription Factor Genes Reveals Lineage-Specific Transpositions, Ancient Tandem Duplications, and Deep Positional Conservation
    Zhao, Tao
    Holmer, Rens
    de Bruijn, Suzanne
    Angenent, Gerco C.
    van den Burg, Harrold A.
    Schranz, M. Eric
    [J]. PLANT CELL, 2017, 29 (06): : 1278 - 1292
  • [38] Phylogenetic analysis of three complete gap junction gene families reveals lineage-specific duplications and highly supported gene classes
    Eastman, SD
    Chen, THP
    Falk, MM
    Mendelson, TC
    Iovine, MK
    [J]. GENOMICS, 2006, 87 (02) : 265 - 274
  • [39] Phylogenomic Analysis Reveals Dynamic Evolutionary History of the Drosophila Heterochromatin Protein 1 (HP1) Gene Family
    Levine, Mia T.
    McCoy, Connor
    Vermaak, Danielle
    Lee, Yuh Chwen G.
    Hiatt, Mary Alice
    Matsen, Frederick A.
    Malik, Harmit S.
    [J]. PLOS GENETICS, 2012, 8 (06):
  • [40] Cell signalling and gene regulation signalling mechanisms in plants: examples from the present and the future
    Coupland, G
    Monguio, SP
    [J]. CURRENT OPINION IN PLANT BIOLOGY, 2005, 8 (05) : 457 - 461