A novel prediction method for protein DNA-binding residues based on neighboring residue correlations

被引:1
|
作者
Song, Jiazhi [1 ,2 ,3 ]
Liu, Guixia [1 ,3 ]
Jiang, Jingqing [2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[2] Inner Mongolia Minzu Univ, Coll Comp Sci & Technol, Tongliao, Inner Mongolia, Peoples R China
[3] Jilin Univ, Coll Comp Sci & Technol, Dept Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Jilin, Peoples R China
关键词
Bioinformatics; protein; machine learning; binding sites; sequence information; INTEGRATING SEQUENCE; DOMAIN; SITES;
D O I
10.1080/13102818.2022.2122871
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Accurately identifying the protein DNA-binding residues is important for understanding the protein-DNA recognition mechanism and protein function annotation. Many computational methods have been proposed to predict protein-DNA binding residues using protein sequence information; however, for severe imbalanced data like the protein-DNA binding dataset, the under-sampling technique which is applied by most previous methods cannot achieve satisfactory performance. In this study, an adjustment algorithm is proposed to offset the biased prediction results from the classifier. The proposed adjustment algorithm uses the binding probability between the target residue and its neighboring residues to identify more true binding residues which are wrongly predicted as non-binding. The proposed prediction method with adjustment algorithm achieves an area under the ROC curve (AUC) of 0.926 and 0.866 on two benchmark datasets and 0.882 on the independent testing set, which demonstrates that the proposed method can efficiently predict specific residues for protein-DNA interactions.
引用
收藏
页码:865 / 877
页数:13
相关论文
共 50 条
  • [1] Computational Methods for DNA-binding Protein and Binding Residue Prediction
    Lu, Yao
    Wang, Xiang
    Chen, Xuesong
    Zhao, Guijun
    [J]. PROTEIN AND PEPTIDE LETTERS, 2013, 20 (03): : 346 - 351
  • [2] Structure based prediction of binding residues on DNA-binding proteins
    Bhardwaj, Nitin
    Langlois, Robert E.
    Hui, Guijun Zhao
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2611 - 2614
  • [3] Residue-level prediction of DNA-binding sites and its application on DNA-binding protein predictions
    Bhardwaj, Nitin
    Lu, Hui
    [J]. FEBS LETTERS, 2007, 581 (05) : 1058 - 1066
  • [4] A Novel Sequence-Based Method of Predicting Protein DNA-Binding Residues, Using a Machine Learning Approach
    Cai, Yudong
    He, ZhiSong
    Shi, Xiaohe
    Kong, Xiangying
    Gu, Lei
    Xie, Lu
    [J]. MOLECULES AND CELLS, 2010, 30 (02) : 99 - 105
  • [5] Prediction of DNA-binding residues from sequence
    Ofran, Yanay
    Mysore, Venkatesh
    Rost, Burkhard
    [J]. BIOINFORMATICS, 2007, 23 (13) : I347 - I353
  • [6] An accurate feature-based method for identifying DNA-binding residues on protein surfaces
    Xiong, Yi
    Liu, Juan
    Wei, Dong-Qing
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (02) : 509 - 517
  • [7] DP-Bind: a Web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins
    Hwang, Seungwoo
    Gou, Zhenkun
    Kuznetsov, Igor B.
    [J]. BIOINFORMATICS, 2007, 23 (05) : 634 - 636
  • [8] Prediction of DNA-binding Protein based on Alpha Shape Modeling
    Zhou, Weiqiang
    Yan, Hong
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 23 - 28
  • [9] DNA-binding protein prediction based on deep transfer learning
    Yan, Jun
    Jiang, Tengsheng
    Liu, Junkai
    Lu, Yaoyao
    Guan, Shixuan
    Li, Haiou
    Wu, Hongjie
    Ding, Yijie
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (08) : 7719 - 7736
  • [10] Prediction of DNA-binding residues from protein sequence information using random forests
    Liangjiang Wang
    Mary Qu Yang
    Jack Y Yang
    [J]. BMC Genomics, 10