Predicting DNA-Binding Proteins and Binding Residues by Complex Structure Prediction and Application to Human Proteome

被引:28
|
作者
Zhao, Huiying [1 ,2 ,3 ]
Wang, Jihua [2 ,4 ]
Zhou, Yaoqi [1 ,2 ,4 ,5 ,6 ]
Yang, Yuedong [1 ,2 ,5 ,6 ]
机构
[1] Indiana Univ Purdue Univ, Sch Informat, Indianapolis, IN 46202 USA
[2] Indiana Univ, Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN USA
[3] QIMR Berghofer Med Res Inst, Brisbane, Qld, Australia
[4] Shandong Prov Key Lab Funct Macromol Biophy, Dezhou, Shandong, Peoples R China
[5] Griffith Univ, Inst Glyc, Southport, Qld 4215, Australia
[6] Griffith Univ, Sch Informat & Commun Tech, Southport, Qld 4215, Australia
来源
PLOS ONE | 2014年 / 9卷 / 05期
基金
美国国家卫生研究院; 中国国家自然科学基金;
关键词
SUPPORT VECTOR MACHINES; RIBOSOMAL-RNA-BINDING; AMINO-ACID-SEQUENCES; ENERGY FUNCTION; WEB SERVER; EFFICIENT PREDICTION; FOLD RECOGNITION; RANDOM FOREST; SITES; IDENTIFICATION;
D O I
10.1371/journal.pone.0096694
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
As more and more protein sequences are uncovered from increasingly inexpensive sequencing techniques, an urgent task is to find their functions. This work presents a highly reliable computational technique for predicting DNA-binding function at the level of protein-DNA complex structures, rather than low-resolution two-state prediction of DNA-binding as most existing techniques do. The method first predicts protein-DNA complex structure by utilizing the template-based structure prediction technique HHblits, followed by binding affinity prediction based on a knowledge-based energy function (Distance-scaled finite ideal-gas reference state for protein-DNA interactions). A leave-one-out cross validation of the method based on 179 DNA-binding and 3797 non-binding protein domains achieves a Matthews correlation coefficient (MCC) of 0.77 with high precision (94%) and high sensitivity (65%). We further found 51% sensitivity for 82 newly determined structures of DNA-binding proteins and 56% sensitivity for the human proteome. In addition, the method provides a reasonably accurate prediction of DNA-binding residues in proteins based on predicted DNA-binding complex structures. Its application to human proteome leads to more than 300 novel DNA-binding proteins; some of these predicted structures were validated by known structures of homologous proteins in APO forms. The method [SPOT-Seq (DNA)] is available as an on-line server at http://sparks-lab.org.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Structure based prediction of binding residues on DNA-binding proteins
    Bhardwaj, Nitin
    Langlois, Robert E.
    Hui, Guijun Zhao
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2611 - 2614
  • [2] DP-Bind: a Web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins
    Hwang, Seungwoo
    Gou, Zhenkun
    Kuznetsov, Igor B.
    [J]. BIOINFORMATICS, 2007, 23 (05) : 634 - 636
  • [3] Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information
    Ahmad, S
    Gromiha, MM
    Sarai, A
    [J]. BIOINFORMATICS, 2004, 20 (04) : 477 - 486
  • [4] Sequence-based prediction of DNA-binding sites on DNA-binding proteins
    Gou, Z.
    Hwang, S.
    Kuznetsov, B., I
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 268 - +
  • [5] Prediction of DNA-binding residues from sequence
    Ofran, Yanay
    Mysore, Venkatesh
    Rost, Burkhard
    [J]. BIOINFORMATICS, 2007, 23 (13) : I347 - I353
  • [6] A Threading-Based Method for the Prediction of DNA-Binding Proteins with Application to the Human Genome
    Gao, Mu
    Skolnick, Jeffrey
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (11)
  • [7] In Silico Prediction and Validation of Novel RNA Binding Proteins and Residues in the Human Proteome
    Chowdhury, Shomeek
    Zhang, Jian
    Kurgan, Lukasz
    [J]. PROTEOMICS, 2018, 18 (21-22)
  • [8] STRUCTURE AND FUNCTION OF DNA-BINDING PROTEINS
    NELSON, HCM
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 1995, 5 (02) : 180 - 189
  • [9] DNA-binding residues and binding mode prediction with binding-mechanism concerned models
    Huang Y.-F.
    Huang C.-C.
    Liu Y.-C.
    Oyang Y.-J.
    Huang C.-K.
    [J]. BMC Genomics, 10 (Suppl 3)
  • [10] A Review of DNA-binding Proteins Prediction Methods
    Qu, Kaiyang
    Wei, Leyi
    Zou, Quan
    [J]. CURRENT BIOINFORMATICS, 2019, 14 (03) : 246 - 254