OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups

被引:638
|
作者
Chen, Feng
Mackey, Aaron J.
Stoeckert, Christian J., Jr.
Roos, David S. [1 ]
机构
[1] Univ Penn, Dept Chem, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Biol, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Genet, Ctr Bioinformat, Penn Genom Inst, Philadelphia, PA 19104 USA
关键词
D O I
10.1093/nar/gkj123
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The OrthoMCL database (http://orthomcl.cbil.upenn.edu) houses ortholog group predictions for 55 species, including 16 bacterial and 4 archaeal genomes representing phylogenetically diverse lineages, and most currently available complete eukaryotic genomes: 24 unikonts (12 animals, 9 fungi, microsporidium, Dictyostelium, Entamoeba), 4 plants/algae and 7 apicomplexan parasites. OrthoMCL software was used to cluster proteins based on sequence similarity, using an all-against-all BLAST search of each species' proteome, followed by normalization of inter-species differences, and Markov clustering. A total of 511797 proteins (81.6% of the total dataset) were clustered into 70388 ortholog groups. The ortholog database may be queried based on protein or group accession numbers, keyword descriptions or BLAST similarity. Ortholog groups exhibiting specific phyletic patterns may also be identified, using either a graphical interface or a text-based Phyletic Pattern Expression grammar. Information for ortholog groups includes the phyletic profile, the list of member proteins and a multiple sequence alignment, a statistical summary and graphical view of similarities, and a graphical representation of domain architecture. OrthoMCL software, the entire FASTA dataset employed and clustering results are available for download. OrthoMCL-DB provides a centralized warehouse for orthology prediction among multiple species, and will be updated and expanded as additional genome sequence data become available.
引用
收藏
页码:D363 / D368
页数:6
相关论文
共 15 条
  • [1] The nuclear question: rethinking species importance in multi-species animal groups
    Srinivasan, Umesh
    Raza, Rashid Hasnain
    Quader, Suhel
    JOURNAL OF ANIMAL ECOLOGY, 2010, 79 (05) : 948 - 954
  • [2] To Eat and Not Be Eaten: Modelling Resources and Safety in Multi-Species Animal Groups
    Srinivasan, Umesh
    Quader, Suhel
    PLOS ONE, 2012, 7 (07):
  • [3] eccDB: a comprehensive repository for eccDNA-mediated chromatin contacts in multi-species
    Yang, Min
    Qiu, Bo
    He, Guo-You
    Zhou, Jian-Yuan
    Yu, Hao-Jie
    Zhang, Yu-Ying
    Li, Yan-Shang
    Li, Tai-Song
    Guo, Jin-Cheng
    Li, Xue-Cang
    Xie, Jian-Jun
    BIOINFORMATICS, 2023, 39 (04)
  • [4] Septicemic cutaneous ulcerative disease in a multi-species collection of semi-aquatic turtles
    Enfermedad ulcerativa cutánea septicémica en una colección multi-especie de tortugas semiacuáticas
    1600, Universidad Nacional Mayor de San Marcos (24):
  • [5] Single- and multi-species groups: A descriptive study of cattle and broiler behaviour on pasture
    Schanz, Lisa
    Hintze, Sara
    Huebner, Severin
    Barth, Kerstin
    Winckler, Christoph
    APPLIED ANIMAL BEHAVIOUR SCIENCE, 2022, 257
  • [6] Gill-specific salinity response in the blue crab and developing a comprehensive multi-species gill transcriptome
    Havird, J. C.
    Mitchell, R. T.
    Henry, R. P.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2016, 56 : E86 - E86
  • [7] Is the multi-species variation in leaf anatomical traits along the environmental gradient modulated by herbaceous functional groups?
    Liu, Xinrui
    Wang, Xue
    Chen, Haoxuan
    Chen, Kaixi
    Mo, Weiyi
    Yuan, Yanqi
    Zhu, Jiang
    Wang, Ruili
    Zhang, Shuoxin
    ECOLOGICAL INDICATORS, 2023, 154
  • [8] Definition of plant functional groups for informing implementation scenarios in resource-limited multi-species recovery planning
    Kooyman, Robert
    Rossetto, Maurizio
    BIODIVERSITY AND CONSERVATION, 2008, 17 (12) : 2917 - 2937
  • [9] Definition of plant functional groups for informing implementation scenarios in resource-limited multi-species recovery planning
    Robert Kooyman
    Maurizio Rossetto
    Biodiversity and Conservation, 2008, 17
  • [10] Cyclebase.org: version 2.0, an updated comprehensive, multi-species repository of cell cycle experiments and derived analysis results
    Gauthier, Nicholas Paul
    Jensen, Lars Juhl
    Wernersson, Rasmus
    Brunak, Soren
    Jensen, Thomas S.
    NUCLEIC ACIDS RESEARCH, 2010, 38 : D699 - D702