Template-based recognition of protein fold within the midnight and twilight zones of protein sequence similarity

被引:3
|
作者
Pirun, M
Babnigg, G
Stevens, FJ [1 ]
机构
[1] Argonne Natl Lab, Biosci Div, Argonne, IL 60439 USA
[2] Univ Illinois, Dept Bioengn, Chicago, IL 60607 USA
关键词
homology; fold recognition; evolution; statistical insignificance;
D O I
10.1002/jmr.728
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Most homologous pairs of proteins have no significant sequence similarity to each other and are not identified by direct sequence comparison or profile-based strategies. However, multiple sequence alignments of low similarity homologues typically reveal a limited number of positions that are well conserved despite diversity of function. It may be inferred that conservation at most of these positions is the result of the importance of the contribution of these amino acids to the folding and stability of the protein. As such, these amino acids and their relative positions may define a structural signature. We demonstrate that extraction of this fold template provides the basis for the sequence database to be searched for patterns consistent with the fold, enabling identification of homologs that are not recognized by global sequence analysis. The fold template method was developed to address the need for a tool that could comprehensively search the midnight and twilight zones of protein sequence similarity without reliance on global statistical significance. Manual implementations of the fold template method were performed on three folds-immunoglobulin, c-lectin and TIM barrel. Following proof of concept of the template method, an automated version of the approach was developed. This automated fold template method was used to develop fold templates for 10 of the more populated folds in the SCOP database. The fold template method developed three-dimensional structural motifs or signatures that were able to return a diverse collection of proteins, while maintaining a low false positive rate. Although the results of the manual fold template method were more comprehensive than the automated fold template method, the diversity of the results from the automated fold template method surpassed those of current methods that rely on statistical significance to infer evolutionary relationships among divergent proteins. Copyright (c) 2004 John Wiley & Sons, Ltd.
引用
收藏
页码:203 / 212
页数:10
相关论文
共 50 条
  • [11] Protein Fold Recognition by Combining Support Vector Machines and Pairwise Sequence Similarity Scores
    Yan, Ke
    Wen, Jie
    Liu, Jin-Xing
    Xu, Yong
    Liu, Bin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (05) : 2008 - 2016
  • [12] Deep template-based protein structure prediction
    Wu, Fandi
    Xu, Jinbo
    PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (05)
  • [13] Sequence alignment using machine learning for accurate template-based protein structure prediction
    Makigaki, Shuichiro
    Ishida, Takashi
    BIOINFORMATICS, 2020, 36 (01) : 104 - 111
  • [14] Template-based identification of protein-protein interfaces using eFindSitePPI
    Maheshwari, Surabhi
    Brylinski, Michal
    METHODS, 2016, 93 : 64 - 71
  • [15] Sequence Alignment Using Machine Learning for Accurate Template-based Protein Structure Prediction
    Makigaki, Shuichiro
    Ishida, Takashi
    BIO-PROTOCOL, 2020, 10 (09):
  • [16] Context similarity scoring improves protein sequence alignments in the midnight zone
    Meier, Armin
    Soeding, Johannes
    BIOINFORMATICS, 2015, 31 (05) : 674 - 681
  • [17] An ensemble approach to protein fold classification by integration of template-based assignment and support vector machine classifier
    Xia, Jiaqi
    Peng, Zhenling
    Qi, Dawei
    Mu, Hongbo
    Yang, Jianyi
    BIOINFORMATICS, 2017, 33 (06) : 863 - 870
  • [18] Evaluating template-based and template-free protein-protein complex structure prediction
    Vreven, Thom
    Hwang, Howook
    Pierce, Brian G.
    Weng, Zhiping
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (02) : 169 - 176
  • [19] Template-based prediction of protein structure with deep learning
    Zhang, Haicang
    Shen, Yufeng
    BMC GENOMICS, 2020, 21 (Suppl 11)
  • [20] Template-based prediction of protein structure with deep learning
    Haicang Zhang
    Yufeng Shen
    BMC Genomics, 21