GenTHREADER: An efficient and reliable protein fold recognition method for genomic sequences

被引:699
|
作者
Jones, DT [1 ]
机构
[1] Univ Warwick, Dept Biol Sci, Coventry CV4 7AL, W Midlands, England
关键词
genome; protein structure prediction; fold recognition; threading; sequence alignment;
D O I
10.1006/jmbi.1999.2583
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A new protein fold recognition method is described which is both fast and reliable. The method uses a traditional sequence alignment algorithm to generate alignments which are then evaluated by a method derived from threading techniques. As a final step, each threaded model is evaluated by a neural network in order to produce a single measure of confidence in the proposed prediction. The speed of the method, along with its sensitivity and very low false-positive rate makes it ideal for automatically predicting the structure of all the proteins in a translated bacterial genome (proteome). The method has been applied to the genome of Mycoplasma genitalium, and analysis of the results shows that as many as 46% of the proteins derived from the predicted protein coding regions have a significant relationship to a protein of known structure. Ln some cases, however, only one domain of the protein can be predicted, giving a total coverage of 30 % when calculated as a fraction of the number of amino acid residues in the whole proteome. (C) 1999 Academic Press.
引用
收藏
页码:797 / 815
页数:19
相关论文
共 50 条
  • [21] Ensemble of classifiers for protein fold recognition
    Nanni, L
    NEUROCOMPUTING, 2006, 69 (7-9) : 850 - 853
  • [22] Cooperative approach for the protein fold recognition
    Ota, M
    Kawabata, T
    Kinjo, AR
    Nishikawa, K
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1999, : 126 - 132
  • [23] What are the baselines for protein fold recognition?
    McGuffin, LJ
    Bryson, K
    Jones, DT
    BIOINFORMATICS, 2001, 17 (01) : 63 - 72
  • [24] Fusion of classifiers for protein fold recognition
    Nanni, L
    NEUROCOMPUTING, 2005, 68 : 315 - 321
  • [25] A NEW APPROACH TO PROTEIN FOLD RECOGNITION
    JONES, DT
    TAYLOR, WR
    THORNTON, JM
    NATURE, 1992, 358 (6381) : 86 - 89
  • [26] Confidence measures for protein fold recognition
    Sommer, I
    Zien, A
    von Öhsen, N
    Zimmer, R
    Lengauer, T
    BIOINFORMATICS, 2002, 18 (06) : 802 - 812
  • [27] An Efficient Multi-class Support Vector Machine Classifier for Protein Fold Recognition
    Chmielnicki, Wieslaw
    Stapor, Katarzyna
    Roterman-Konieczna, Irena
    ADVANCES IN BIOINFORMATICS, 2010, 74 : 77 - 84
  • [28] Efficient computation of absent words in genomic sequences
    Herold, Julia
    Kurtz, Stefan
    Giegerich, Robert
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [29] Efficient computation of absent words in genomic sequences
    Julia Herold
    Stefan Kurtz
    Robert Giegerich
    BMC Bioinformatics, 9
  • [30] DELIMINATE-a fast and efficient method for loss-less compression of genomic sequences
    Mohammed, Monzoorul Haque
    Dutta, Anirban
    Bose, Tungadri
    Chadaram, Sudha
    Mande, Sharmila S.
    BIOINFORMATICS, 2012, 28 (19) : 2527 - 2529