Comprehensive review and empirical analysis of hallmarks of DNA-, RNA- and protein-binding residues in protein chains

被引:81
|
作者
Zhang, Jian [1 ]
Ma, Zhiqiang [2 ]
Kurgan, Lukasz [3 ]
机构
[1] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang, Peoples R China
[2] Northeast Normal Univ, Coll Humanities & Sci, Changchun, Jilin, Peoples R China
[3] Virginia Commonwealth Univ, Comp Sci, Richmond, VA USA
关键词
protein-RNA interactions; protein-DNA interactions; protein-nucleic acid interactions; protein-protein interactions; DNA-binding residues; RNA-binding residues; STRUCTURALLY CONSERVED RESIDUES; STRUCTURE-BASED PREDICTION; AROMATIC-AMINO-ACIDS; EVOLUTIONARY CONSERVATION; SECONDARY STRUCTURE; INTERACTION SITES; WEB SERVER; HOT-SPOTS; SEQUENCE; RECOGNITION;
D O I
10.1093/bib/bbx168
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Proteins interact with a variety of molecules including proteins and nucleic acids. We review a comprehensive collection of over 50 studies that analyze and/or predict these interactions. While majority of these studies address either solely protein-DNA or protein-RNA binding, only a few have a wider scope that covers both protein-protein and protein-nucleic acid binding. Our analysis reveals that binding residues are typically characterized with three hallmarks: relative solvent accessibility (RSA), evolutionary conservation and propensity of amino acids (AAs) for binding. Motivated by drawbacks of the prior studies, we perform a large-scale analysis to quantify and contrast the three hallmarks for residues that bind DNA-, RNA-, protein- and (for the first time) multi-ligand-binding residues that interact with DNA and proteins, and with RNA and proteins. Results generated on a well-annotated data set of over 23 000 proteins show that conservation of binding residues is higher for nucleic acid-than protein-binding residues. Multi-ligand-binding residues are more conserved and have higher RSA than single-ligand-binding residues. We empirically show that each hallmark discriminates between binding and non-binding residues, even predicted RSA, and that combining them improves discriminatory power for each of the five types of interactions. Linear scoring functions that combine these hallmarks offer good predictive performance of residue-level propensity for binding and provide intuitive interpretation of predictions. Better understanding of these residue-level interactions will facilitate development of methods that accurately predict binding in the exponentially growing databases of protein sequences.
引用
收藏
页码:1250 / 1268
页数:19
相关论文
共 50 条
  • [1] Comprehensive review and empirical analysis of hallmarks of DNA-, RNA- and protein-binding residues in protein chains (vol 20, pg 1250, 2019)
    Zhang, Jian
    Ma Zhiqiang
    Kurgan, Lukasz
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1856 - 1856
  • [2] DeepDISOBind: accurate prediction of RNA-, DNA- and protein-binding intrinsically disordered residues with deep multi-task learning
    Zhang, Fuhao
    Zhao, Bi
    Shi, Wenbo
    Li, Min
    Kurgan, Lukasz
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [3] DNA-]DNA, AND DNA-]RNA-]PROTEIN - ORCHESTRATION BY A SINGLE COMPLEX OPERON
    LUPSKI, JR
    GODSON, GN
    BIOESSAYS, 1989, 10 (05) : 152 - 157
  • [4] Comprehensive analysis of RNA-chromatin, RNA-, and DNA-protein interactions
    Khlebnikov, Daniil A.
    Nikolskaya, Arina, I
    Zharikova, Anastasia A.
    Mironov, Andrey A.
    NAR GENOMICS AND BIOINFORMATICS, 2025, 7 (01)
  • [5] A comprehensive comparative review of sequence-based predictors of DNA- and RNA-binding residues
    Yan, Jing
    Friedrich, Stefanie
    Kurgan, Lukasz
    BRIEFINGS IN BIOINFORMATICS, 2016, 17 (01) : 88 - 105
  • [6] Solution structure of a multifunctional DNA- and protein-binding motif of human Werner syndrome protein
    Hu, JS
    Feng, HQ
    Zeng, WY
    Lin, GX
    Xi, XG
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (51) : 18379 - 18384
  • [8] Systematic domain-based aggregation of protein structures highlights DNA-, RNA- and other ligand-binding positions
    Kobren, Shilpa Nadimpalli
    Singh, Mona
    NUCLEIC ACIDS RESEARCH, 2019, 47 (02) : 582 - 593
  • [9] Investigating DNA-, RNA-, and protein-based features as ameans to discriminate pathogenic synonymous variants
    Livingstone, Mark
    Folkman, Lukas
    Yang, Yuedong
    Zhang, Ping
    Mort, Matthew
    Cooper, David N.
    Liu, Yunlong
    Stantic, Bela
    Zhou, Yaoqi
    HUMAN MUTATION, 2017, 38 (10) : 1336 - 1347
  • [10] The DNA- and protein-binding properties and cytotoxicity of a new copper(II) hydrazone Schiff base complex
    Biswas, Niladri
    Saha, Sandeepta
    Biswas, Barun Kumar
    Chowdhury, Manas
    Rahaman, Ashikur
    Junghare, Vivek
    Mohapatra, Swati
    Hazra, Saugata
    Zangrando, Ennio
    Choudhury, Ruma Roy
    Choudhury, Chirantan Roy
    JOURNAL OF COORDINATION CHEMISTRY, 2021, 74 (9-10) : 1482 - 1504