Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins

被引:120
|
作者
Kuznetsov, Igor B. [1 ]
Gou, Zhenkun [1 ]
Li, Run [1 ]
Hwang, Seungwoo [1 ]
机构
[1] SUNY Albany, Gen NY sis Ctr Excellence Canc Genom, Dept Epidemiol & Biostat, Rensselaer, NY 12144 USA
关键词
support vector machine; protein-DNA interaction prediction; position-specific scoring matrix; evolutionary conservation; structural information; DNA binding; accuracy; statistical analysis;
D O I
10.1002/prot.20977
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Proteins that interact with DNA are involved in a number of fundamental biological activities such as DNA replication, transcription, and repair. A reliable identification of DNA-binding sites in DNA-binding proteins is important for functional annotation, site-directed mutagenesis, and modeling protein-DNA interactions. We apply Support Vector Machine (SVM), a supervised pattern recognition method, to predict DNA-binding sites in DNA-binding proteins using the following features: amino acid sequence, profile of evolutionary conservation of sequence positions, and low-resolution structural information. We use a rigorous statistical approach to study the performance of predictors that utilize different combinations of features and how this performance is affected by structural and sequence properties of proteins. Our results indicate that an SVM predictor based on a properly scaled profile of evolutionary conservation in the form of a position specific scoring matrix (PSSM) significantly outperforms a PSSM-based neural network predictor. The highest accuracy is achieved by SVM predictor that combines the profile of evolutionary conservation with low-resolution structural information. Our results also show that knowledge-based predictors of DNA-binding sites perform significantly better on proteins from mainly-et structural class and that the performance of these predictors is significantly correlated with certain structural and sequence properties of proteins. These observations suggest that it may be possible to assign a reliability index to the overall accuracy of the prediction of DNA-binding sites in any given protein using its sequence and structural properties. A web-server implementation of the predictors is freely available online at http://lcg.rit.albany.edu/ dp-bind/.
引用
收藏
页码:19 / 27
页数:9
相关论文
共 50 条
  • [1] Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins
    Jones, S
    Shanahan, HP
    Berman, HM
    Thornton, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (24) : 7189 - 7198
  • [2] Sequence-based prediction of DNA-binding sites on DNA-binding proteins
    Gou, Z.
    Hwang, S.
    Kuznetsov, B., I
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 268 - +
  • [3] Identification of DNA-binding Proteins Using Structural, Electrostatic and Evolutionary Features
    Nimrod, Guy
    Szilagyi, Andras
    Leslie, Christina
    Ben-Tal, Nir
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2009, 387 (04) : 1040 - 1053
  • [4] DNA-BINDING BY PROTEINS
    SCHLEIF, R
    [J]. SCIENCE, 1988, 241 (4870) : 1182 - 1187
  • [5] Analysis and classification of DNA-binding sites in single-stranded and double-stranded DNA-binding proteins using protein information
    Wang, Wei
    Liu, Juan
    Xiong, Yi
    Zhu, Lida
    Zhou, Xionghui
    [J]. IET SYSTEMS BIOLOGY, 2014, 8 (04) : 176 - 183
  • [6] DNA-BINDING PROTEINS
    PTASHNE, M
    [J]. NATURE, 1984, 308 (5961) : 753 - 754
  • [7] SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS
    BERG, OG
    VONHIPPEL, PH
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1988, 13 (06) : 207 - 211
  • [8] RAPID ISOLATION OF SPECIFIC DNA-BINDING PROTEINS AND THEIR DNA-BINDING DOMAINS
    WICHSER, U
    BRACK, C
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 (15) : 4103 - 4104
  • [9] Role of Shape Deformation of DNA-Binding Sites in Regulating the Efficiency and Specificity in Their Recognition by DNA-Binding Proteins
    Sangeeta
    Mishra, Sujeet Kumar
    Bhattacherjee, Arnab
    [J]. JACS AU, 2024, 4 (07): : 2640 - 2655
  • [10] Structural changes in DNA-binding proteins on complexation
    Poddar, Sayan
    Chakravarty, Devlina
    Chakrabarti, Pinak
    [J]. NUCLEIC ACIDS RESEARCH, 2018, 46 (07) : 3298 - 3308