Improving protein secondary structure prediction with aligned homologous sequences

被引:0
|
作者
DiFrancesco, V
Garnier, J
Munson, PJ
机构
[1] NIH, DIV COMP RES & TECHNOL,STRUCT BIOL LAB, ANALYT BIOSTAT SECT,12 S MSC 5626, BETHESDA, MD 20892 USA
[2] NIH, FOGARTY INT CTR, BETHESDA, MD 20892 USA
关键词
prediction; protein secondary structure; sequence alignments;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Most recent protein secondary structure prediction methods use sequence alignments to improve the prediction quality. We investigate the relationship between the location of secondary structural elements, gaps, and variable residue positions in multiple sequence alignments. We further investigate how these relationships compare with those found in structurally aligned protein families. We show how such associations may be used to improve the quality of prediction of the secondary structure elements, using the Quadratic-Logistic method with profiles. Furthermore, we analyze the extent to which the number of homologous sequences influences the quality of prediction. The analysis of variable residue positions shows that surprisingly, helical regions exhibit greater variability than do coil regions, which are generally thought to be the most common secondary structure elements in loops. However, the correlation between variability and the presence of helices does not significantly improve prediction quality. Gaps are a distinct signal for coil regions. Increasing the coil propensity for those residues occurring in gap regions enhances the overall prediction quality. Prediction accuracy increases initially with the number of homologues, but changes negligibly as the number of homologues exceeds about 14. The alignment quality affects the prediction more than other factors, hence a careful selection and alignment of even a small number of homologues can lead to significant improvements in prediction accuracy.
引用
收藏
页码:106 / 113
页数:8
相关论文
共 50 条
  • [1] USE OF HOMOLOGOUS SEQUENCES TO IMPROVE PROTEIN SECONDARY STRUCTURE PREDICTION
    NIERMANN, T
    KIRSCHNER, K
    [J]. METHODS IN ENZYMOLOGY, 1991, 202 : 45 - 59
  • [2] Secondary structure prediction for aligned RNA sequences
    Hofacker, IL
    Fekete, M
    Stadler, PF
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2002, 319 (05) : 1059 - 1066
  • [3] PREDICTION OF PROTEIN SECONDARY STRUCTURE AND ACTIVE-SITES USING THE ALIGNMENT OF HOMOLOGOUS SEQUENCES
    ZVELEBIL, MJ
    BARTON, GJ
    TAYLOR, WR
    STERNBERG, MJE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1987, 195 (04) : 957 - 961
  • [4] Improving the accuracy of predicting secondary structure for aligned RNA sequences
    Hamada, Michiaki
    Sato, Kengo
    Asai, Kiyoshi
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 (02) : 393 - 402
  • [5] Improving protein secondary structure predictions by prediction fusion
    Palopoli, Luigi
    Rombo, Simona E.
    Terracina, Giorgio
    Tradigo, Giuseppe
    Veltri, Pierangelo
    [J]. INFORMATION FUSION, 2009, 10 (03) : 217 - 232
  • [6] BIOSITE - A PROGRAM FOR THE INTERACTIVE COMPARISON OF ALIGNED HOMOLOGOUS PROTEIN SEQUENCES
    COCKCROFT, VB
    PEDERSEN, JT
    LUNT, GG
    OSGUTHORPE, DJ
    [J]. COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (01): : 71 - 73
  • [7] Prediction of protein secondary structure content for the twilight zone sequences
    Homaeian, Leila
    Kurgan, Lukasz A.
    Ruan, Jishou
    Cios, Krzysztof J.
    Chen, Ke
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 (03) : 486 - 498
  • [8] A SIMPLE AND FAST APPROACH TO PREDICTION OF PROTEIN SECONDARY STRUCTURE FROM MULTIPLY ALIGNED SEQUENCES WITH ACCURACY ABOVE 70-PERCENT
    MEHTA, PK
    HERINGA, J
    ARGOS, P
    [J]. PROTEIN SCIENCE, 1995, 4 (12) : 2517 - 2525
  • [9] The estimation of relative site variability among aligned homologous protein sequences
    Horner, DS
    Pesole, G
    [J]. BIOINFORMATICS, 2003, 19 (05) : 600 - 606
  • [10] Improving protein secondary-structure prediction by predicting ends of secondary-structure segments
    Midic, U
    Dunker, AK
    Obradovic, Z
    [J]. Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, 2005, : 490 - 497