DNABind: A hybrid algorithm for structure-based prediction of DNA-binding residues by combining machine learning- and template-based approaches

被引:52
|
作者
Liu, Rong [1 ,2 ]
Hu, Jianjun [1 ]
机构
[1] Univ S Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] Huazhong Agr Univ, Coll Life Sci & Technol, Ctr Bioinformat, Wuhan 430070, Peoples R China
基金
美国国家科学基金会;
关键词
protein-DNA interaction; DNA-binding residue; machine learning; template; structural analysis; conformational change; PROTEIN-STRUCTURE ALIGNMENT; AMINO-ACID-SEQUENCES; SECONDARY STRUCTURE; WEB SERVER; SITES; CONSERVATION; INFORMATION; EVOLUTIONARY; RECOGNITION; POTENTIALS;
D O I
10.1002/prot.24330
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accurate prediction of DNA-binding residues has become a problem of increasing importance in structural bioinformatics. Here, we presented DNABind, a novel hybrid algorithm for identifying these crucial residues by exploiting the complementarity between machine learning- and template-based methods. Our machine learning-based method was based on the probabilistic combination of a structure-based and a sequence-based predictor, both of which were implemented using support vector machines algorithms. The former included our well-designed structural features, such as solvent accessibility, local geometry, topological features, and relative positions, which can effectively quantify the difference between DNA-binding and nonbinding residues. The latter combined evolutionary conservation features with three other sequence attributes. Our template-based method depended on structural alignment and utilized the template structure from known protein-DNA complexes to infer DNA-binding residues. We showed that the template method had excellent performance when reliable templates were found for the query proteins but tended to be strongly influenced by the template quality as well as the conformational changes upon DNA binding. In contrast, the machine learning approach yielded better performance when high-quality templates were not available (about 1/3 cases in our dataset) or the query protein was subject to intensive transformation changes upon DNA binding. Our extensive experiments indicated that the hybrid approach can distinctly improve the performance of the individual methods for both bound and unbound structures. DNABind also significantly outperformed the state-of-art algorithms by around 10% in terms of Matthews's correlation coefficient. The proposed methodology could also have wide application in various protein functional site annotations. DNABind is freely available at http://mleg.cse.sc.edu/DNABind/. Proteins 2013; 81:1885-1899. (c) 2013 Wiley Periodicals, Inc.
引用
收藏
页码:1885 / 1899
页数:15
相关论文
共 50 条
  • [1] Structure-based prediction of nucleic acid binding residues by merging deep learning- and template-based approaches
    Jiang, Zheng
    Shen, Yue-Yue
    Liu, Rong
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (09)
  • [2] Structure based prediction of binding residues on DNA-binding proteins
    Bhardwaj, Nitin
    Langlois, Robert E.
    Hui, Guijun Zhao
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2611 - 2614
  • [3] Template-based prediction of protein structure with deep learning
    Zhang, Haicang
    Shen, Yufeng
    [J]. BMC GENOMICS, 2020, 21 (Suppl 11)
  • [4] Template-based prediction of protein structure with deep learning
    Haicang Zhang
    Yufeng Shen
    [J]. BMC Genomics, 21
  • [5] Sequence alignment using machine learning for accurate template-based protein structure prediction
    Makigaki, Shuichiro
    Ishida, Takashi
    [J]. BIOINFORMATICS, 2020, 36 (01) : 104 - 111
  • [6] Sequence Alignment Using Machine Learning for Accurate Template-based Protein Structure Prediction
    Makigaki, Shuichiro
    Ishida, Takashi
    [J]. BIO-PROTOCOL, 2020, 10 (09):
  • [7] Combining machine learning and structure-based approaches to develop oncogene PIM kinase inhibitors
    Almukadi, Haifa
    Jadkarim, Gada Ali
    Mohammed, Arif
    Almansouri, Majid
    Sultana, Nasreen
    Shaik, Noor Ahmad
    Banaganapalli, Babajan
    [J]. FRONTIERS IN CHEMISTRY, 2023, 11
  • [8] TCSP: a Template-Based Crystal Structure Prediction Algorithm for Materials Discovery
    Wei, Lai
    Fu, Nihang
    Siriwardane, Edirisuriya M. D.
    Yang, Wenhui
    Omee, Sadman Sadeed
    Dong, Rongzhi
    Xin, Rui
    Hu, Jianjun
    [J]. INORGANIC CHEMISTRY, 2022, 61 (22) : 8431 - 8439
  • [9] Structure-Based and Template-Based Automatic Speech Recognition - Comparing parametric and non-parametric approaches
    Deng, Li
    Strik, Helmer
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2608 - +
  • [10] Structure-based prediction of BRAF mutation classes using machine-learning approaches
    Fanny S. Krebs
    Christian Britschgi
    Sylvain Pradervand
    Rita Achermann
    Petros Tsantoulis
    Simon Haefliger
    Andreas Wicki
    Olivier Michielin
    Vincent Zoete
    [J]. Scientific Reports, 12