A New Orthology Assessment Method for Phylogenomic Data: Unrooted Phylogenetic Orthology

被引:47
|
作者
Ballesteros, Jesus A. [1 ]
Hormiga, Gustavo [1 ]
机构
[1] George Washington Univ, Dept Biol Sci, Washington, DC 20052 USA
基金
美国国家科学基金会;
关键词
Markov cluster; protein homology; spiders; Araneae; transcriptomics; genomics; COMPARATIVE TRANSCRIPTOMICS; QUALITY ASSESSMENT; TREE; EVOLUTION; GENOMICS; SYSTEMATICS; PREDICTION; SEQUENCES; ALGORITHM; ALIGNMENT;
D O I
10.1093/molbev/msw069
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Current sequencing technologies are making available unprecedented amounts of genetic data for a large variety of species including nonmodel organisms. Although many phylogenomic surveys spend considerable time finding orthologs from the wealth of sequence data, these results do not transcend the original study and after being processed for specific phylogenetic purposes these orthologs do not become stable orthology hypotheses. We describe a procedure to detect and document the phylogenetic distribution of orthologs allowing researchers to use this information to guide selection of loci best suited to test specific evolutionary questions. At the core of this pipeline is a new phylogenetic orthology method that is neither affected by the position of the root nor requires explicit assignment of outgroups. We discuss the properties of this new orthology assessment method and exemplify its utility for phylogenomics using a small insects dataset. In addition, we exemplify the pipeline to identify and document stable orthologs for the group of orb-weaving spiders (Araneoidea) using RNAseq data. The scripts used in this study, along with sample files and additional documentation, are available at https://github.com/ballesterus/UPhO.
引用
收藏
页码:2117 / 2134
页数:18
相关论文
共 50 条
  • [21] OrthoPhyl-streamlining large-scale, orthology-based phylogenomic studies of bacteria at broad evolutionary scales
    Middlebrook, Earl A.
    Katani, Robab
    Fair, Jeanne M.
    [J]. G3-GENES GENOMES GENETICS, 2024, 14 (08):
  • [22] OrthoClust: an orthology-based network framework for clustering data across multiple species
    Koon-Kiu Yan
    Daifeng Wang
    Joel Rozowsky
    Henry Zheng
    Chao Cheng
    Mark Gerstein
    [J]. Genome Biology, 15
  • [23] A Novel Method for Predicting Essential Proteins Based on Subcellular Localization, Orthology and PPI Networks
    Li, Gaoshi
    Li, Min
    Wang, Jianxin
    Pan, Yi
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS (ISBRA 2015), 2015, 9096 : 427 - 428
  • [24] OrthoClust: an orthology-based network framework for clustering data across multiple species
    Yan, Koon-Kiu
    Wang, Daifeng
    Rozowsky, Joel
    Zheng, Henry
    Cheng, Chao
    Gerstein, Mark
    [J]. GENOME BIOLOGY, 2014, 15 (08): : R100
  • [25] MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
    Pryszcz, Leszek P.
    Huerta-Cepas, Jaime
    Gabaldon, Toni
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 (05) : e32
  • [26] TreeFam v9: a new website, more species and orthology-on-the-fly
    Schreiber, Fabian
    Patricio, Mateus
    Muffato, Matthieu
    Pignatelli, Miguel
    Bateman, Alex
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D922 - D925
  • [27] Iteration method for predicting essential proteins based on orthology and protein-protein interaction networks
    Peng, Wei
    Wang, Jianxin
    Wang, Weiping
    Liu, Qing
    Wu, Fang-Xiang
    Pan, Yi
    [J]. BMC SYSTEMS BIOLOGY, 2012, 6
  • [28] Phylogenetic position of nemertea derived from phylogenomic data
    Struck, Torsten H.
    Fisse, Frauke
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (04) : 728 - 736
  • [29] Species-agnostic transfer learning for cross-species transcriptomics data integration without gene orthology
    Park, Youngjun
    Muttray, Nils P.
    Hauschild, Anne-Christin
    [J]. BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [30] Ortho-proteogenomics: Multiple proteomes investigation through orthology and a new MS-based protocol
    Gallien, Sebastien
    Perrodou, Emmanuel
    Carapito, Christine
    Deshayes, Caroline
    Reyrat, Jean-Marc
    Van Dorsselaer, Alain
    Poch, Olivier
    Schaeffer, Christine
    Lecompte, Odile
    [J]. GENOME RESEARCH, 2009, 19 (01) : 128 - 135