Development and benchmarking of TASSERiter for the iterative improvement of protein structure predictions

被引:7
|
作者
Lee, Seung Yup [1 ]
Skolnick, Jeffrey [1 ]
机构
[1] Georgia Inst Technol, Ctr Study Syst Biol, Atlanta, GA 30318 USA
关键词
TASSER(iter); protein structure prediction; TASSER; protein structure refinement; threading;
D O I
10.1002/prot.21440
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To improve the accuracy of TASSER models especially in the limit where threading provided template alignments are of poor quality, we have developed the TASSER(iter) algorithm which uses the templates and contact restraints from TASSER generated models for iterative structure refinement. We apply TASSER(iter) to a large benchmark set of 2,773 nonhomologous single domain proteins that are <= 200 in length and that cover the PDB at the level of 35% pairwise sequence identity. Overall, TASSER(iter) models have a smaller global average RMSD of 5.48 angstrom compared to 5.81 angstrom RMSD of the original TASSER models. Classifying the targets by the level of prediction difficulty (where Easy targets have a good template with a corresponding good threading alignment, Medium targets have a good template but a poor alignment, and Hard targets have an incorrectly identified template), TASSER(iter) (TASSER) models have an average RMSD of 4.15 angstrom (4.35 angstrom) for the Easy set and 9.05 A (9.52 angstrom) for the Hard set. The largest reduction of average RMSD is for the Medium set where the TASSER(iter) models have an average global RMSD of 5.67 angstrom compared to 6.72 angstrom of the TASSER models. Seventy percent of the Medium set TASSER(iter) models have a smaller RMSD than the TASSER models, while 63% of the Easy and 60% of the Hard TASSER models are improved by TASSER(iter). For the foldable cases, where the targets have a RMSD to the native <6.5 angstrom, TASSER(ite)r shows obvious improvement over TASSER models: For the Medium set, it improves the success rate from 57.0 to 67.2%, followed by the Hard targets where the success rate improves from 32.0 to 34.8%, with the smallest improvement in the Easy targets from 82.6 to 84.0%. These results suggest that TASSER(iter) can provide more reliable predictions for targets of Medium difficulty, a range that had resisted improvement in the quality of protein structure predictions.
引用
收藏
页码:39 / 47
页数:9
相关论文
共 50 条
  • [41] DEVELOPMENT OF SUBSOILS AFTER STRUCTURE IMPROVEMENT
    WERNER, D
    PITTELKOW, U
    ARCHIV FUR ACKER UND PFLANZENBAU UND BODENKUNDE-ARCHIVES OF AGRONOMY AND SOIL SCIENCE, 1979, 23 (12): : 721 - 732
  • [42] LiveBench-1: Continuous benchmarking of protein structure prediction servers
    Bujnicki, JM
    Elofsson, A
    Fischer, D
    Rychlewski, L
    PROTEIN SCIENCE, 2001, 10 (02) : 352 - 361
  • [43] Protein multiple sequence alignment benchmarking through secondary structure prediction
    Le, Quan
    Sievers, Fabian
    Higgins, Desmond G.
    BIOINFORMATICS, 2017, 33 (09) : 1331 - 1337
  • [44] A nonredundant structure dataset for benchmarking protein-RNA computational docking
    Huang, Sheng-You
    Zou, Xiaoqin
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2013, 34 (04) : 311 - 318
  • [45] The Development of a Fast Iterative Algorithm Structure of Cosine Transform
    Gryga, Volodymyr
    Kolosov, Igor
    Danyluk, Olga
    2016 13TH INTERNATIONAL CONFERENCE ON MODERN PROBLEMS OF RADIO ENGINEERING, TELECOMMUNICATIONS AND COMPUTER SCIENCE (TCSET), 2016, : 506 - 509
  • [46] Next Generation Protein Structure Predictions and Genetic Variant Interpretation
    Diwan, Gaurav D.
    Gonzalez-Sanchez, Juan Carlos
    Apic, Gordana
    Russell, Robert B.
    JOURNAL OF MOLECULAR BIOLOGY, 2021, 433 (20)
  • [47] Numerical criteria for the evaluation of ab initio predictions of protein structure
    Zemla, A
    Venclovas, C
    Reinhardt, A
    Fidelis, K
    Hubbard, TJ
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1997, : 140 - 150
  • [48] Processing and analysis of CASP3 protein structure predictions
    Zemla, A
    Venclovas, C
    Moult, J
    Fidelis, K
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1999, : 22 - 29
  • [49] AN ANALYSIS OF INCORRECTLY FOLDED PROTEIN MODELS - IMPLICATIONS FOR STRUCTURE PREDICTIONS
    NOVOTNY, J
    BRUCCOLERI, R
    KARPLUS, M
    JOURNAL OF MOLECULAR BIOLOGY, 1984, 177 (04) : 787 - 818
  • [50] Protein-complex structure completion using IPCAS (Iterative Protein Crystal structure Automatic Solution)
    Zhang, Weizhe
    Zhang, Hongmin
    Zhang, Tao
    Fan, Haifu
    Hao, Quan
    ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2015, 71 : 1487 - 1492