ProtoMap: automatic classification of protein sequences and hierarchy of protein families

被引:108
|
作者
Yona, G
Linial, N
Linial, M
机构
[1] Stanford Univ, Dept Biol Struct, Stanford, CA 94305 USA
[2] Hebrew Univ Jerusalem, Inst Comp Sci, IL-91904 Jerusalem, Israel
[3] Hebrew Univ Jerusalem, Inst Life Sci, Dept Biol Chem, IL-91904 Jerusalem, Israel
关键词
D O I
10.1093/nar/28.1.49
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The ProtoMap site offers an exhaustive classification of all proteins in the SWISS-PROT database, into groups of related proteins. The classification is based on analysis of all pairwise similarities among protein sequences, The analysis makes essential use of transitivity to identify homologies among proteins. Within each group of the classification, every two members are either directly or transitively related. However, transitivity is applied restrictively in order to prevent unrelated proteins from clustering together, The classification is done at different levels of confidence, and yields a hierarchical organization of all:proteins. The resulting classification splits the protein space into well-defined groups of proteins, which are closely correlated with natural biological families and superfamilies. Many clusters contain protein sequences that are not classified by other databases. The hierarchical organization suggested by our analysis may help in detecting finer subfamilies in families of known proteins. In addition it brings forth interesting relationships between protein families, upon which local maps for the neighborhood of protein families can be sketched. The ProtoMap web server can be accessed at http://www.protomap.cs.huji.ac.il.
引用
收藏
页码:49 / 55
页数:7
相关论文
共 50 条
  • [31] Automatic Classification of Enzyme Family in Protein Annotation
    dos Santos, Cassia T.
    Bazzan, Ana L. C.
    Lemke, Ney
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2009, 5676 : 86 - +
  • [32] Automatic classification of protein functions from the literature
    Blaschke, C
    Valencia, A
    COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (01): : 75 - 79
  • [33] Automatic Classification of Protein Sequences into Structure/Function Groups via Parallel Cascade Identification: A Feasibility Study
    Michael J. Korenberg
    Robert David
    Ian W. Hunter
    Jerry E. Solomon
    Annals of Biomedical Engineering, 2000, 28 : 803 - 811
  • [34] Automatic classification of protein sequences into structure/function groups via parallel cascade identification: A feasibility study
    Korenberg, MJ
    David, R
    Hunter, IW
    Solomon, JE
    ANNALS OF BIOMEDICAL ENGINEERING, 2000, 28 (07) : 803 - 811
  • [35] Evolutionary hierarchy of vertebrate-like heterotrimeric G protein families
    Krishnan, Arunkumar
    Mustafa, Arshi
    Almen, Markus Sallman
    Fredriksson, Robert
    Williams, Michael J.
    Schioth, Helgi B.
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 2015, 91 : 27 - 40
  • [36] Identification of latent periodicity in amino acid sequences of protein families
    Turutina, VP
    Laskin, AA
    Kudryashov, NA
    Skryabin, KG
    Korotkov, EV
    BIOCHEMISTRY-MOSCOW, 2006, 71 (01) : 18 - 31
  • [37] UPSEC: An algorithm for classifying unaligned protein sequences into functional families
    Ma, Patrick C. H.
    Chan, Keith C. C.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2008, 15 (04) : 431 - 443
  • [38] IMAGES OF PROTEIN FAMILIES FOR COMPARISON WITH AMINO-ACID-SEQUENCES
    BACHINSKII, AG
    KULICHKOV, VA
    YARYGIN, AA
    MOLECULAR BIOLOGY, 1994, 28 (04) : 608 - 613
  • [39] Identification of latent periodicity in amino acid sequences of protein families
    V. P. Turutina
    A. A. Laskin
    N. A. Kudryashov
    K. G. Skryabin
    E. V. Korotkov
    Biochemistry (Moscow), 2006, 71 : 18 - 31
  • [40] Exploring microbial genome sequences to identify protein families on the grid
    Sun, Yudong
    Wipat, Anil
    Pocock, Matthew
    Lee, Peter A.
    Flanagan, Keith
    Worthington, James T.
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2007, 11 (04): : 435 - 442