Prediction of protein secondary structure by mining structural fragment database

被引:20
|
作者
Cheng, HT
Sen, TZ
Kloczkowski, A
Margaritis, D
Jernigan, RL
机构
[1] Iowa State Univ, LH Baker Ctr Bioinformat & Biol Stat, Dept Biochem Biophys & Mol Biol, Ames, IA 50011 USA
[2] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
关键词
secondary structure; sequence; cut-off;
D O I
10.1016/j.polymer.2005.02.040
中图分类号
O63 [高分子化学(高聚物)];
学科分类号
070305 ; 080501 ; 081704 ;
摘要
A new method for predicting protein secondary structure from amino acid sequence has been developed. The method is based on multiple sequence alignment of the query sequence with all other sequences with known structure from the protein data bank (PDB) by using BLAST. The fragments of the alignments belonging to proteins from the PBD are then used for further analysis. We have studied various schemes of assigning weights for matching segments and calculated normalized scores to predict one of the three secondary structures: alpha-helix, beta-sheet, or coil. We applied several artificial intelligence techniques: decision trees (DT), neural networks (NN) and support vector machines (SVM) to improve the accuracy of predictions and found that SVM gave the best performance. Preliminary data show that combining the fragment mining approach with GOR V (Kloczkowski et al, Proteins 49 (2002) 154-166) for regions of low sequence similarity improves the prediction accuracy. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4314 / 4321
页数:8
相关论文
共 50 条
  • [31] Extraction of Prediction Rules: Protein Secondary Structure Prediction
    Muhamud, Ahmed I.
    Abdelhalim, M. B.
    Mabrouk, Mai S.
    2014 10TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2014, : 21 - 25
  • [32] Critical Features of Fragment Libraries for Protein Structure Prediction
    Trevizani, Raphael
    Custodio, Fabio Lima
    dos Santos, Karina Baptista
    Dardenne, Laurent Emmanuel
    PLOS ONE, 2017, 12 (01):
  • [33] Protein structure mining using a structural alphabet
    Tyagi, M.
    De Brevern, A. G.
    Srinivasan, N.
    Offmann, B.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (02) : 920 - 937
  • [34] Neuro-fuzzy structural classification of proteins for improved protein secondary structure prediction
    Hering, JA
    Innocent, PR
    Haris, PI
    PROTEOMICS, 2003, 3 (08) : 1464 - 1475
  • [35] Sequence Representation and Prediction of Protein Secondary Structure for Structural Motifs in Twilight Zone Proteins
    Lukasz Kurgan
    Kanaka Durga Kedarisetti
    The Protein Journal, 2006, 25 : 463 - 474
  • [36] Sequence representation and prediction of protein secondary structure for structural motifs in twilight zone proteins
    Kurgan, Lukasz
    Kedarisetti, Kanaka Durga
    PROTEIN JOURNAL, 2006, 25 (7-8): : 463 - 474
  • [37] Protein structure prediction and structural genomics
    Baker, D
    Sali, A
    SCIENCE, 2001, 294 (5540) : 93 - 96
  • [38] The BAD project: data mining, database and prediction of protein adsorption on surfaces
    Vasina, Elena N.
    Paszek, Ewa
    Nicolau, Dan V., Jr.
    Nicolau, Dan V.
    LAB ON A CHIP, 2009, 9 (07) : 891 - 900
  • [39] Single Chain Fragment Variable (scFv) Secondary Structure Prediction and Evaluation
    Mahgoub, I. O.
    Ali, A. M.
    Hamid, M.
    Alitheen, N. M.
    FASEB JOURNAL, 2011, 25
  • [40] Impact of protein dynamics on secondary structure prediction
    de Brevern, Alexandre G.
    BIOCHIMIE, 2020, 179 : 14 - 22