共 50 条
Comprehensive review and empirical analysis of hallmarks of DNA-, RNA- and protein-binding residues in protein chains
被引:81
|作者:
Zhang, Jian
[1
]
Ma, Zhiqiang
[2
]
Kurgan, Lukasz
[3
]
机构:
[1] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang, Peoples R China
[2] Northeast Normal Univ, Coll Humanities & Sci, Changchun, Jilin, Peoples R China
[3] Virginia Commonwealth Univ, Comp Sci, Richmond, VA USA
关键词:
protein-RNA interactions;
protein-DNA interactions;
protein-nucleic acid interactions;
protein-protein interactions;
DNA-binding residues;
RNA-binding residues;
STRUCTURALLY CONSERVED RESIDUES;
STRUCTURE-BASED PREDICTION;
AROMATIC-AMINO-ACIDS;
EVOLUTIONARY CONSERVATION;
SECONDARY STRUCTURE;
INTERACTION SITES;
WEB SERVER;
HOT-SPOTS;
SEQUENCE;
RECOGNITION;
D O I:
10.1093/bib/bbx168
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Proteins interact with a variety of molecules including proteins and nucleic acids. We review a comprehensive collection of over 50 studies that analyze and/or predict these interactions. While majority of these studies address either solely protein-DNA or protein-RNA binding, only a few have a wider scope that covers both protein-protein and protein-nucleic acid binding. Our analysis reveals that binding residues are typically characterized with three hallmarks: relative solvent accessibility (RSA), evolutionary conservation and propensity of amino acids (AAs) for binding. Motivated by drawbacks of the prior studies, we perform a large-scale analysis to quantify and contrast the three hallmarks for residues that bind DNA-, RNA-, protein- and (for the first time) multi-ligand-binding residues that interact with DNA and proteins, and with RNA and proteins. Results generated on a well-annotated data set of over 23 000 proteins show that conservation of binding residues is higher for nucleic acid-than protein-binding residues. Multi-ligand-binding residues are more conserved and have higher RSA than single-ligand-binding residues. We empirically show that each hallmark discriminates between binding and non-binding residues, even predicted RSA, and that combining them improves discriminatory power for each of the five types of interactions. Linear scoring functions that combine these hallmarks offer good predictive performance of residue-level propensity for binding and provide intuitive interpretation of predictions. Better understanding of these residue-level interactions will facilitate development of methods that accurately predict binding in the exponentially growing databases of protein sequences.
引用
收藏
页码:1250 / 1268
页数:19
相关论文