Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information

被引:288
|
作者
Ahmad, S [1 ]
Gromiha, MM
Sarai, A
机构
[1] Kyushu Inst Technol, Dept Biochem Sci & Engn, Iizuka, Fukuoka 8208502, Japan
[2] Jamia Millia Islamia, Dept Biosci, New Delhi 110025, India
[3] AIST, Computat Biol Res Ctr, CBRC, Koto Ku, Tokyo 1350064, Japan
关键词
D O I
10.1093/bioinformatics/btg432
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Though vitally important to cell function, the mechanism of protein-DNA binding has not yet been completely understood. We therefore analysed the relationship between DNA binding and protein sequence composition, solvent accessibility and secondary structure. Using non-redundant databases of transcription factors and protein-DNA complexes, neural network models were developed to utilize the information present in this relationship to predict DNA-binding proteins and their binding residues. Results: Sequence composition was found to provide sufficient information to predict the probability of its binding to DNA with nearly 69% sensitivity at 64% accuracy for the considered proteins; sequence neighbourhood and solvent accessibility information were sufficient to make binding site predictions with 40% sensitivity at 79% accuracy. Detailed analysis of binding residues shows that some three- and five-residue segments frequently bind to DNA and that solvent accessibility plays a major role in binding. Although, binding behaviour was not associated with any particular secondary structure, there were interesting exceptions at the residue level. Over-representation of some residues in the binding sites was largely lost at the total sequence level, but a different kind of compositional preference was observed in DNA-binding proteins.
引用
收藏
页码:477 / 486
页数:10
相关论文
共 50 条
  • [1] Sequence-Based Prediction of DNA-Binding Residues in Proteins with Conservation and Correlation Information
    Ma, Xin
    Guo, Jing
    Liu, Hong-De
    Xie, Jian-Ming
    Sun, Xiao
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1766 - 1775
  • [2] Structure based prediction of binding residues on DNA-binding proteins
    Bhardwaj, Nitin
    Langlois, Robert E.
    Hui, Guijun Zhao
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2611 - 2614
  • [3] Sequence-based prediction of DNA-binding sites on DNA-binding proteins
    Gou, Z.
    Hwang, S.
    Kuznetsov, B., I
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 268 - +
  • [4] DP-Bind: a Web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins
    Hwang, Seungwoo
    Gou, Zhenkun
    Kuznetsov, Igor B.
    [J]. BIOINFORMATICS, 2007, 23 (05) : 634 - 636
  • [5] Prediction of DNA-binding residues from sequence
    Ofran, Yanay
    Mysore, Venkatesh
    Rost, Burkhard
    [J]. BIOINFORMATICS, 2007, 23 (13) : I347 - I353
  • [6] Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins
    Kuznetsov, Igor B.
    Gou, Zhenkun
    Li, Run
    Hwang, Seungwoo
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 64 (01) : 19 - 27
  • [7] Prediction of DNA-binding residues from protein sequence information using random forests
    Liangjiang Wang
    Mary Qu Yang
    Jack Y Yang
    [J]. BMC Genomics, 10
  • [8] Prediction of DNA-binding residues from sequence information using convolutional neural network
    Zhou, Jiyun
    Lu, Qin
    Xu, Ruifeng
    Gui, Lin
    Wang, Hongpeng
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 17 (02) : 132 - 152
  • [9] Prediction of DNA-binding residues from protein sequence information using random forests
    Wang, Liangjiang
    Yang, Mary Qu
    Yang, Jack Y.
    [J]. BMC GENOMICS, 2009, 10
  • [10] CNNsite: Prediction of DNA-binding Residues in Proteins Using Convolutional Neural Network with Sequence Features
    Zhou, Jiyun
    Lu, Qin
    Xu, Ruifeng
    Gui, Lin
    Wang, Hongpeng
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 78 - 85