Sequence-based prediction of DNA-binding sites on DNA-binding proteins

被引:0
|
作者
Gou, Z. [1 ]
Hwang, S. [1 ]
Kuznetsov, B., I [1 ]
机构
[1] SUNY Albany, Gen NY Sis Ctr Excellence Canc Genom, One Discovery Dr, Rensselaer, NY USA
关键词
protein-DNA interaction; position specific scoring matrix; evolutionary conservation; web-server; DNA binding; prediction; pattern recognition; machine learning;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Identification of DNA-binding sites on DNA-binding proteins is important for functional annotation. Experimental determination of the structure of a protein-DNA complex is an expensive process. Reliable computational methods that utilize the sequence of a DNA-binding protein to predict its DNA-binding interface are needed. Results: We present an application of three machine learning methods: support vector machine, kernel logistic regression, and penalized logistic regression to predict DNA-binding sites on a DNA-binding protein using its amino acid sequence as an input. Prediction is performed using either single sequence or a profile of evolutionary conservation. The performance of our predictors is better than that of other existing sequence-based methods. The outputs of all three individual methods are combined to obtain a consensus prediction. This further improves performance and results in accuracy of 82.4%, sensitivity of 84.9% and specificity of 83.1% for the strict consensus prediction. Availability: http://lcg.rit.albany.edu/dp-bind
引用
收藏
页码:268 / +
页数:2
相关论文
共 50 条
  • [1] DP-Bind: a Web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins
    Hwang, Seungwoo
    Gou, Zhenkun
    Kuznetsov, Igor B.
    [J]. BIOINFORMATICS, 2007, 23 (05) : 634 - 636
  • [2] Sequence-Based Prediction of DNA-Binding Residues in Proteins with Conservation and Correlation Information
    Ma, Xin
    Guo, Jing
    Liu, Hong-De
    Xie, Jian-Ming
    Sun, Xiao
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1766 - 1775
  • [3] A sequence-based multiple kernel model for identifying DNA-binding proteins
    Yuqing Qian
    Limin Jiang
    Yijie Ding
    Jijun Tang
    Fei Guo
    [J]. BMC Bioinformatics, 22
  • [4] A sequence-based multiple kernel model for identifying DNA-binding proteins
    Qian, Yuqing
    Jiang, Limin
    Ding, Yijie
    Tang, Jijun
    Guo, Fei
    [J]. BMC BIOINFORMATICS, 2021, 22 (SUPPL 3)
  • [5] Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins
    Jones, S
    Shanahan, HP
    Berman, HM
    Thornton, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (24) : 7189 - 7198
  • [6] Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins
    Kuznetsov, Igor B.
    Gou, Zhenkun
    Li, Run
    Hwang, Seungwoo
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 64 (01) : 19 - 27
  • [7] Structure based prediction of binding residues on DNA-binding proteins
    Bhardwaj, Nitin
    Langlois, Robert E.
    Hui, Guijun Zhao
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2611 - 2614
  • [8] DNA-BINDING BY PROTEINS
    SCHLEIF, R
    [J]. SCIENCE, 1988, 241 (4870) : 1182 - 1187
  • [9] DNA-BINDING PROTEINS
    PTASHNE, M
    [J]. NATURE, 1984, 308 (5961) : 753 - 754
  • [10] Moment-based prediction of DNA-binding proteins
    Ahmad, S
    Sarai, A
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2004, 341 (01) : 65 - 71