Improved protein structure prediction by deep learning irrespective of co-evolution information

被引:117
|
作者
Xu, Jinbo [1 ]
McPartlon, Matthew [1 ,2 ]
Li, Jin [1 ,2 ]
机构
[1] Toyota Technol Inst, Chicago, IL 60637 USA
[2] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
RESIDUE-RESIDUE CONTACTS; COMPUTATIONAL DESIGN; SEQUENCE;
D O I
10.1038/s42256-021-00348-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting the tertiary structure of a protein from its primary sequence has been greatly improved by integrating deep learning and co-evolutionary analysis, as shown in CASP13 and CASP14. We describe our latest study of this idea, analysing the efficacy of network size and co-evolution data and its performance on both natural and designed proteins. We show that a large ResNet (convolutional residual neural networks) can predict structures of correct folds for 26 out of 32 CASP13 free-modelling targets and L/5 long-range contacts with precision over 80%. When co-evolution is not used, ResNet can still predict structures of correct folds for 18 CASP13 free-modelling targets, greatly exceeding previous methods that do not use co-evolution either. Even with only the primary sequence, ResNet can predict the structures of correct folds for all tested human-designed proteins. In addition, ResNet may fare better for the designed proteins when trained without co-evolution than with co-evolution. These results suggest that ResNet does not simply de-noise co-evolution signals, but instead may learn important protein sequence-structure relationships. This has important implications for protein design and engineering, especially when co-evolutionary data are unavailable. In the last few years, computational protein structure prediction has greatly advanced by combining deep learning including convolutional residual networks (ResNet) with co-evolution data. A new study finds that using deeper and wider ResNets improves predictions in the absence of co-evolution information, suggesting that the ResNets do not not simply de-noise co-evolution signals, but instead may learn important protein sequence-structure relationships.
引用
收藏
页码:601 / +
页数:10
相关论文
共 50 条
  • [1] Improved protein structure prediction by deep learning irrespective of co-evolution information
    Jinbo Xu
    Matthew McPartlon
    Jin Li
    [J]. Nature Machine Intelligence, 2021, 3 : 601 - 609
  • [2] Membrane protein contact and structure prediction using co-evolution in conjunction with machine learning
    Teixeira, Pedro L.
    Mendenhall, Jeff L.
    Heinze, Sten
    Weiner, Brian
    Skwark, Marcin J.
    Meiler, Jens
    [J]. PLOS ONE, 2017, 12 (05):
  • [3] Inserting Co-evolution Information from Contact Maps into a Multiobjective Genetic Algorithm for Protein Structure Prediction
    Rocha, Gregorio K.
    dos Santos, Karina B.
    Angelo, Jaqueline S.
    Custodio, Fabio L.
    Barbosa, Helio J. C.
    Dardenne, Laurent E.
    [J]. 2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 957 - 964
  • [4] Co-evolution Transformer for Protein Contact Prediction
    Zhang, He
    Ju, Fusong
    Zhu, Jianwei
    He, Liang
    Shao, Bin
    Zheng, Nanning
    Liu, Tie-Yan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
    Ju, Fusong
    Zhu, Jianwei
    Shao, Bin
    Kong, Lupeng
    Liu, Tie-Yan
    Zheng, Wei-Mou
    Bu, Dongbo
    [J]. NATURE COMMUNICATIONS, 2021, 12 (01)
  • [6] CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
    Fusong Ju
    Jianwei Zhu
    Bin Shao
    Lupeng Kong
    Tie-Yan Liu
    Wei-Mou Zheng
    Dongbo Bu
    [J]. Nature Communications, 12
  • [7] Comparing co-evolution methods and their application to template-free protein structure prediction
    de Oliveira, Saulo Henrique Pires
    Shi, Jiye
    Deane, Charlotte M.
    [J]. BIOINFORMATICS, 2017, 33 (03) : 373 - 381
  • [8] Improved protein structure prediction using potentials from deep learning
    Senior, Andrew W.
    Evans, Richard
    Jumper, John
    Kirkpatrick, James
    Sifre, Laurent
    Green, Tim
    Qin, Chongli
    Zidek, Augustin
    Nelson, Alexander W. R.
    Bridgland, Alex
    Penedones, Hugo
    Petersen, Stig
    Simonyan, Karen
    Crossan, Steve
    Kohli, Pushmeet
    Jones, David T.
    Silver, David
    Kavukcuoglu, Koray
    Hassabis, Demis
    [J]. NATURE, 2020, 577 (7792) : 706 - +
  • [9] Improved protein structure prediction using potentials from deep learning
    Andrew W. Senior
    Richard Evans
    John Jumper
    James Kirkpatrick
    Laurent Sifre
    Tim Green
    Chongli Qin
    Augustin Žídek
    Alexander W. R. Nelson
    Alex Bridgland
    Hugo Penedones
    Stig Petersen
    Karen Simonyan
    Steve Crossan
    Pushmeet Kohli
    David T. Jones
    David Silver
    Koray Kavukcuoglu
    Demis Hassabis
    [J]. Nature, 2020, 577 : 706 - 710
  • [10] Co-evolution of (Information) System Models
    Dahanayake, Ajantha
    Thalheim, Bernhard
    [J]. ENTERPRISE, BUSINESS-PROCESS AND INFORMATION SYSTEMS MODELING, 2010, 50 : 314 - +