Geometric deep learning of protein-DNA binding specificity

被引:0
|
作者
Mitra, Raktim [1 ]
Li, Jinsen [1 ]
Sagendorf, Jared M. [1 ,7 ]
Jiang, Yibei [1 ]
Cohen, Ari S. [1 ]
Chiu, Tsu-Pei [1 ]
Glasscock, Cameron J. [2 ,3 ]
Rohs, Remo [1 ,4 ,5 ,6 ]
机构
[1] Univ Southern Calif, Dept Quantitat & Computat Biol, Los Angeles, CA 90007 USA
[2] Univ Washington, Dept Biochem, Seattle, WA USA
[3] Univ Washington, Inst Prot Design, Seattle, WA USA
[4] Univ Southern Calif, Dept Chem, Los Angeles, CA 90007 USA
[5] Univ Southern Calif, Dept Phys & Astron, Los Angeles, CA 90007 USA
[6] Univ Southern Calif, Thomas Lord Dept Comp Sci, Los Angeles, CA 90007 USA
[7] Univ Calif San Francisco, Dept Bioengn & Therapeut Sci, San Francisco, CA USA
基金
美国国家卫生研究院;
关键词
TRANSCRIPTION FACTORS; RECOGNITION; PREDICTION; P53; MODELS; ORIGINS; BASES; SHAPE;
D O I
10.1038/s41592-024-02372-w
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Predicting protein-DNA binding specificity is a challenging yet essential task for understanding gene regulation. Protein-DNA complexes usually exhibit binding to a selected DNA target site, whereas a protein binds, with varying degrees of binding specificity, to a wide range of DNA sequences. This information is not directly accessible in a single structure. Here, to access this information, we present Deep Predictor of Binding Specificity (DeepPBS), a geometric deep-learning model designed to predict binding specificity from protein-DNA structure. DeepPBS can be applied to experimental or predicted structures. Interpretable protein heavy atom importance scores for interface residues can be extracted. When aggregated at the protein residue level, these scores are validated through mutagenesis experiments. Applied to designed proteins targeting specific DNA sequences, DeepPBS was demonstrated to predict experimentally measured binding specificity. DeepPBS offers a foundation for machine-aided studies that advance our understanding of molecular interactions and guide experimental designs and synthetic biology. DeepPBS is a deep-learning model designed to predict the binding specificity of protein-DNA interactions using physicochemical and geometric contexts. DeepPBS functions across protein families and on experimentally determined as well as predicted protein-DNA complex structures.
引用
收藏
页码:1674 / 1683
页数:15
相关论文
共 50 条
  • [31] Structure-specificity relationship in protein-DNA recognition
    Kono, H
    Sarai, A
    BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, 2004, : 155 - 161
  • [32] AIDmut-Seq: a Three-Step Method for Detecting Protein-DNA Binding Specificity
    Li, Feixuan
    Liu, Xiao-Yu
    Ni, Lei
    Jin, Fan
    MICROBIOLOGY SPECTRUM, 2023, 11 (01):
  • [33] Dynamic and Structural Modeling of the Specificity in Protein-DNA Interactions Guided by Binding Assay and Structure Data
    Tan, Cheng
    Takada, Shoji
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2018, 14 (07) : 3877 - 3889
  • [34] New insights into protein-DNA binding specificity from hydrogen bond based comparative study
    Lin, Maoxuan
    Guo, Jun-tao
    NUCLEIC ACIDS RESEARCH, 2019, 47 (21) : 11103 - 11113
  • [35] Modeling in-vivo protein-DNA binding by combining multiple-instance learning with a hybrid deep neural network
    Zhang, Qinhu
    Shen, Zhen
    Huang, De-Shuang
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [36] Protein-DNA binding in high-resolution
    Mahony, Shaun
    Pugh, B. Franklin
    CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 2015, 50 (04) : 269 - 283
  • [37] Modeling in-vivo protein-DNA binding by combining multiple-instance learning with a hybrid deep neural network
    Qinhu Zhang
    Zhen Shen
    De-Shuang Huang
    Scientific Reports, 9
  • [38] Protein-DNA binding specificity: a grid-enabled computational approach applied to single and multiple protein assemblies
    Zakrzewska, Krystyna
    Bouvier, Benjamin
    Michon, Alexis
    Blanchet, Christophe
    Lavery, Richard
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2009, 11 (45) : 10712 - 10721
  • [39] Data Analytics for Protein-DNA Binding Interactions
    Wong, Ka-Chun
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 1573 - 1578
  • [40] Non-additivity in protein-DNA binding
    O'Flanagan, RA
    Paillard, G
    Lavery, R
    Sengupta, AM
    BIOINFORMATICS, 2005, 21 (10) : 2254 - 2263