Prediction of protein secondary structure by mining structural fragment database

被引:20
|
作者
Cheng, HT
Sen, TZ
Kloczkowski, A
Margaritis, D
Jernigan, RL
机构
[1] Iowa State Univ, LH Baker Ctr Bioinformat & Biol Stat, Dept Biochem Biophys & Mol Biol, Ames, IA 50011 USA
[2] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
关键词
secondary structure; sequence; cut-off;
D O I
10.1016/j.polymer.2005.02.040
中图分类号
O63 [高分子化学(高聚物)];
学科分类号
070305 ; 080501 ; 081704 ;
摘要
A new method for predicting protein secondary structure from amino acid sequence has been developed. The method is based on multiple sequence alignment of the query sequence with all other sequences with known structure from the protein data bank (PDB) by using BLAST. The fragments of the alignments belonging to proteins from the PBD are then used for further analysis. We have studied various schemes of assigning weights for matching segments and calculated normalized scores to predict one of the three secondary structures: alpha-helix, beta-sheet, or coil. We applied several artificial intelligence techniques: decision trees (DT), neural networks (NN) and support vector machines (SVM) to improve the accuracy of predictions and found that SVM gave the best performance. Preliminary data show that combining the fragment mining approach with GOR V (Kloczkowski et al, Proteins 49 (2002) 154-166) for regions of low sequence similarity improves the prediction accuracy. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4314 / 4321
页数:8
相关论文
共 50 条
  • [1] A consensus data mining secondary structure prediction by combining GOR V and fragment database mining
    Sen, Taner Z.
    Cheng, Haitao
    Kloczkowski, Andrzej
    Jernigan, Robert L.
    [J]. PROTEIN SCIENCE, 2006, 15 (11) : 2499 - 2506
  • [2] Consensus data mining (CDM) protein secondary structure prediction server: Combining GOR v and fragment database mining (FDM)
    Cheng, Haitao
    Sen, Taner Z.
    Jernigan, Robert L.
    Kloczkowski, Andrzej
    [J]. BIOINFORMATICS, 2007, 23 (19) : 2628 - 2630
  • [3] Data Mining for Protein Secondary Structure Prediction
    Cheng, Haitao
    Sen, Taner Z.
    Jernigan, Robert L.
    Kloczkowski, Andrzej
    [J]. DATA MINING IN CRYSTALLOGRAPHY, 2010, 134 : 135 - 167
  • [4] Protein secondary structure prediction in different structural classes
    Gromiha, MM
    Selvaraj, S
    [J]. PROTEIN ENGINEERING, 1998, 11 (04): : 249 - 251
  • [5] Enhancing fragment-based protein structure prediction by customising fragment cardinality according to local secondary structure
    Jad Abbass
    Jean-Christophe Nebel
    [J]. BMC Bioinformatics, 21
  • [6] Enhancing fragment-based protein structure prediction by customising fragment cardinality according to local secondary structure
    Abbass, Jad
    Nebel, Jean-Christophe
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [7] Defining a protein: Mining the protein structure database
    Stec, Boguslaw
    Prasad, B. V. L. S.
    Zhang, Ying
    Godzik, Adam
    [J]. ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2008, 64 : C627 - C628
  • [8] Molecular fragment replacement approach to protein structure determination by chemical shift and dipolar homology database mining
    Kontaxis, G
    Delaglio, F
    Bax, A
    [J]. NUCLEAR MAGNETIC RESONANCE OF BIOLOGICAL MACROMOLECULES, PART C, 2005, 394 : 42 - +
  • [9] Improving the accuracy of protein secondary structure prediction using structural alignment
    Scott Montgomerie
    Shan Sundararaj
    Warren J Gallin
    David S Wishart
    [J]. BMC Bioinformatics, 7
  • [10] PREDICTION OF PROTEIN SECONDARY STRUCTURE
    CHOU, PY
    FASMAN, GD
    [J]. BIOPHYSICAL JOURNAL, 1977, 17 (02) : A53 - A53