ProtoMap: automatic classification of protein sequences and hierarchy of protein families

被引:108
|
作者
Yona, G
Linial, N
Linial, M
机构
[1] Stanford Univ, Dept Biol Struct, Stanford, CA 94305 USA
[2] Hebrew Univ Jerusalem, Inst Comp Sci, IL-91904 Jerusalem, Israel
[3] Hebrew Univ Jerusalem, Inst Life Sci, Dept Biol Chem, IL-91904 Jerusalem, Israel
关键词
D O I
10.1093/nar/28.1.49
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The ProtoMap site offers an exhaustive classification of all proteins in the SWISS-PROT database, into groups of related proteins. The classification is based on analysis of all pairwise similarities among protein sequences, The analysis makes essential use of transitivity to identify homologies among proteins. Within each group of the classification, every two members are either directly or transitively related. However, transitivity is applied restrictively in order to prevent unrelated proteins from clustering together, The classification is done at different levels of confidence, and yields a hierarchical organization of all:proteins. The resulting classification splits the protein space into well-defined groups of proteins, which are closely correlated with natural biological families and superfamilies. Many clusters contain protein sequences that are not classified by other databases. The hierarchical organization suggested by our analysis may help in detecting finer subfamilies in families of known proteins. In addition it brings forth interesting relationships between protein families, upon which local maps for the neighborhood of protein families can be sketched. The ProtoMap web server can be accessed at http://www.protomap.cs.huji.ac.il.
引用
收藏
页码:49 / 55
页数:7
相关论文
共 50 条
  • [41] Identifying representative sequences of protein families using submodular optimization
    Nguyen, Ha
    Nguyen, Hung
    Nguyen, Phuong
    Luu, Anh N.
    Cantu, David C.
    Nguyen, Tin
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [42] On the Natural Structure of Amino Acid Patterns in Families of Protein Sequences
    Turjanski, Pablo
    Ferreiro, Diego U.
    JOURNAL OF PHYSICAL CHEMISTRY B, 2018, 122 (49): : 11295 - 11301
  • [43] An alignment-free method for classification of protein sequences
    Deshmukh, Sandeep
    Khaitan, Sanjeet
    Das, Debasish
    Gupta, Manish
    Wangikar, Pramod P.
    PROTEIN AND PEPTIDE LETTERS, 2007, 14 (07): : 647 - 657
  • [44] VECTOR QUANTIZATION KERNELS FOR THE CLASSIFICATION OF PROTEIN SEQUENCES AND STRUCTURES
    Clark, Wyatt T.
    Radivojac, Predrag
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2014, 2014, : 316 - 327
  • [45] An Efficient Computational Intelligence Technique for Classification of Protein Sequences
    Iqbal, Muhammad Javed
    Faye, Ibrahima
    Said, Abas Md
    Samir, Brahim Belhaouari
    2014 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2014,
  • [46] Moment Vector Encoding of Protein Sequences for Supervised Classification
    Altartouri, Haneen
    Glasmachers, Tobias
    PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 1005 : 25 - 35
  • [47] From the similarity analysis of protein cavities to the functional classification of protein families using Cavbase
    Kuhn, Daniel
    Weskamp, Nils
    Schmitt, Stefan
    Huellermeier, Eyke
    Klebe, Gerhard
    JOURNAL OF MOLECULAR BIOLOGY, 2006, 359 (04) : 1023 - 1044
  • [48] PeCoP: automatic determination of persistently conserved positions in protein families
    Friedberg, I
    Margalit, H
    BIOINFORMATICS, 2002, 18 (09) : 1276 - 1277
  • [49] Discovering protein function classification rules from reduced alphabet representations of protein sequences
    Andorf, CM
    Dobbs, DL
    Honavar, VG
    PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 1200 - 1206
  • [50] A Simple Protein Evolutionary Classification Method Based on the Mutual Relations Between Protein Sequences
    Wan, Xiaogeng
    Tan, Xinying
    CURRENT BIOINFORMATICS, 2020, 15 (10) : 1113 - 1129