Deep Learning to Predict Protein Backbone Structure from High-Resolution Cryo-EM Density Maps

被引:52
|
作者
Si, Dong [1 ]
Moritz, Spencer A. [1 ]
Pfab, Jonas [1 ]
Hou, Jie [2 ,3 ]
Cao, Renzhi [4 ]
Wang, Liguo [5 ]
Wu, Tianqi [6 ]
Cheng, Jianlin [6 ]
机构
[1] Univ Washington, Div Comp & Software Syst, Bothell, WA 98011 USA
[2] St Louis Univ, Dept Comp Sci, St Louis, MO 63103 USA
[3] St Louis Univ, Program Bioinformat & Computat Biol, St Louis, MO 63103 USA
[4] Pacific Lutheran Univ, Dept Comp Sci, Tacoma, WA 98447 USA
[5] Univ Washington, Dept Biol Struct, Seattle, WA 98185 USA
[6] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
关键词
VISUALIZATION; SLO2.2;
D O I
10.1038/s41598-020-60598-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cryo-electron microscopy (cryo-EM) has become a leading technology for determining protein structures. Recent advances in this field have allowed for atomic resolution. However, predicting the backbone trace of a protein has remained a challenge on all but the most pristine density maps (<2.5 angstrom resolution). Here we introduce a deep learning model that uses a set of cascaded convolutional neural networks (CNNs) to predict Ca atoms along a protein's backbone structure. The cascaded-CNN (C-CNN) is a novel deep learning architecture comprised of multiple CNNs, each predicting a specific aspect of a protein's structure. This model predicts secondary structure elements (SSEs), backbone structure, and Ca atoms, combining the results of each to produce a complete prediction map. The cascaded-CNN is a semantic segmentation image classifier and was trained using thousands of simulated density maps. This method is largely automatic and only requires a recommended threshold value for each protein density map. A specialized tabu-search path walking algorithm was used to produce an initial backbone trace with Ca placements. A helix-refinement algorithm made further improvements to the a-helix SSEs of the backbone trace. Finally, a novel quality assessment-based combinatorial algorithm was used to effectively map protein sequences onto Ca traces to obtain full-atom protein structures. This method was tested on 50 experimental maps between 2.6 angstrom and 4.4 angstrom resolution. It outperformed several state-of-the-art prediction methods including Rosetta de-novo, MAINMAST, and a Phenix based method by producing the most complete predicted protein structures, as measured by percentage of found Ca atoms. This method accurately predicted 88.9% (mean) of the Ca atoms within 3 A of a protein's backbone structure surpassing the 66.8% mark achieved by the leading alternate method (Phenix based fully automatic method) on the same set of density maps. The C-CNN also achieved an average root-mean-square deviation (RMSD) of 1.24 angstrom on a set of 50 experimental density maps which was tested by the Phenix based fully automatic method. The source code and demo of this research has been published at https://github.com/DrDongSi/Ca-Backbone-Prediction.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Deep Learning to Predict Protein Backbone Structure from High-Resolution Cryo-EM Density Maps
    Dong Si
    Spencer A. Moritz
    Jonas Pfab
    Jie Hou
    Renzhi Cao
    Liguo Wang
    Tianqi Wu
    Jianlin Cheng
    Scientific Reports, 10
  • [2] CryoRes: Local Resolution Estimation of Cryo-EM Density Maps by Deep Learning
    Dai, Muzhi
    Dong, Zhuoer
    Xu, Kui
    Zhang, Qiangfeng Cliff
    JOURNAL OF MOLECULAR BIOLOGY, 2023, 435 (09)
  • [3] Advancing structure modeling from cryo-EM maps with deep learning
    Li, Shu
    Terashi, Genki
    Zhang, Zicong
    Kihara, Daisuke
    BIOCHEMICAL SOCIETY TRANSACTIONS, 2025,
  • [4] Protein secondary structure detection in intermediate-resolution cryo-EM maps using deep learning
    Subramaniya, Sai Raghavendra Maddhuri Venkata
    Terashi, Genki
    Kihara, Daisuke
    NATURE METHODS, 2019, 16 (09) : 911 - +
  • [5] Protein secondary structure detection in intermediate-resolution cryo-EM maps using deep learning
    Sai Raghavendra Maddhuri Venkata Subramaniya
    Genki Terashi
    Daisuke Kihara
    Nature Methods, 2019, 16 : 911 - 917
  • [6] Protein Secondary Structure Detection in Intermediate-Resolution Cryo-EM Maps using Deep Learning
    Subramaniya, Sai Raghavendra Maddhuri Venkata
    Terashi, Genki
    Kihara, Daisuke
    BIOPHYSICAL JOURNAL, 2020, 118 (03) : 43A - 43A
  • [7] Cryo-EM goes high-resolution
    Doerr, Allison
    NATURE METHODS, 2015, 12 (07) : 598 - 599
  • [8] Cryo-EM goes high-resolution
    Allison Doerr
    Nature Methods, 2015, 12 : 599 - 599
  • [9] ANALYSES OF SUBNANOMETER RESOLUTION CRYO-EM DENSITY MAPS
    Baker, Matthew L.
    Baker, Mariah R.
    Hryc, Corey F.
    DiMaio, Frank
    METHODS IN ENZYMOLOGY, VOL 483: CRYO-EM, PART C: ANALYSES, INTERPRETATION, AND CASE STUDIES, 2010, 483 : 1 - 29
  • [10] Quantifying the local resolution of cryo-EM density maps
    Kucukelbir A.
    Sigworth F.J.
    Tagare H.D.
    Nature Methods, 2014, 11 (1) : 63 - 65