Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14

被引:18
|
作者
Liu, Jian [1 ]
Wu, Tianqi [1 ]
Guo, Zhiye [1 ]
Hou, Jie [2 ]
Cheng, Jianlin [1 ]
机构
[1] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
[2] St Louis Univ, Dept Comp Sci, St Louis, MO USA
关键词
inter-residue distance prediction; protein quality assessment; protein structure prediction; QUALITY ASSESSMENT; WEB SERVER; MODELS;
D O I
10.1002/prot.26186
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Substantial progresses in protein structure prediction have been made by utilizing deep-learning and residue-residue distance prediction since CASP13. Inspired by the advances, we improve our CASP14 MULTICOM protein structure prediction system by incorporating three new components: (a) a new deep learning-based protein inter-residue distance predictor to improve template-free (ab initio) tertiary structure prediction, (b) an enhanced template-based tertiary structure prediction method, and (c) distance-based model quality assessment methods empowered by deep learning. In the 2020 CASP14 experiment, MULTICOM predictor was ranked seventh out of 146 predictors in tertiary structure prediction and ranked third out of 136 predictors in inter-domain structure prediction. The results demonstrate that the template-free modeling based on deep learning and residue-residue distance prediction can predict the correct topology for almost all template-based modeling targets and a majority of hard targets (template-free targets or targets whose templates cannot be recognized), which is a significant improvement over the CASP13 MULTICOM predictor. Moreover, the template-free modeling performs better than the template-based modeling on not only hard targets but also the targets that have homologous templates. The performance of the template-free modeling largely depends on the accuracy of distance prediction closely related to the quality of multiple sequence alignments. The structural model quality assessment works well on targets for which enough good models can be predicted, but it may perform poorly when only a few good models are predicted for a hard target and the distribution of model quality scores is highly skewed. MULTICOM is available at and .
引用
收藏
页码:58 / 72
页数:15
相关论文
共 50 条
  • [1] Improving deep learning-based protein distance prediction in CASP14
    Guo, Zhiye
    Wu, Tianqi
    Liu, Jian
    Hou, Jie
    Cheng, Jianlin
    [J]. BIOINFORMATICS, 2021, 37 (19) : 3190 - 3196
  • [2] Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14
    Anishchenko, Ivan
    Baek, Minkyung
    Park, Hahnbeom
    Hiranuma, Naozumi
    Kim, David E.
    Dauparas, Justas
    Mansoor, Sanaa
    Humphreys, Ian R.
    Baker, David
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) : 1722 - 1733
  • [3] Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14
    Zheng, Wei
    Li, Yang
    Zhang, Chengxin
    Zhou, Xiaogen
    Pearce, Robin
    Bell, Eric W.
    Huang, Xiaoqiang
    Zhang, Yang
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) : 1734 - 1751
  • [4] High-accuracy protein structure prediction in CASP14
    Pereira, Joana
    Simpkin, Adam J.
    Hartmann, Marcus D.
    Rigden, Daniel J.
    Keegan, Ronan M.
    Lupas, Andrei N.
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) : 1687 - 1699
  • [5] Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
    Xiao Chen
    Jian Liu
    Zhiye Guo
    Tianqi Wu
    Jie Hou
    Jianlin Cheng
    [J]. Scientific Reports, 11
  • [6] Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
    Chen, Xiao
    Liu, Jian
    Guo, Zhiye
    Wu, Tianqi
    Hou, Jie
    Cheng, Jianlin
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [7] Protein oligomer structure prediction using GALAXY in CASP14
    Park, Taeyong
    Woo, Hyeonuk
    Yang, Jinsol
    Kwon, Sohee
    Won, Jonghun
    Seok, Chaok
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) : 1844 - 1851
  • [8] Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13
    Hou, Jie
    Wu, Tianqi
    Cao, Renzhi
    Cheng, Jianlin
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2019, 87 (12) : 1165 - 1178
  • [9] Protein Tertiary Structure Modeling Driven by Deep Learning and Contact Distance Prediction in CASP13
    Cheng, Jianlin
    [J]. ACM-BCB'19: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, 2019, : 551 - 551
  • [10] Modeling of protein complexes in CASP14 with emphasis on the interaction interface prediction
    Dapkunas, Justas
    Olechnovic, Kliment
    Venclovas, Ceslovas
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) : 1834 - 1843