Accurate prediction of RNA-binding protein residues with two discriminative structural descriptors

被引:24
|
作者
Sun, Meijian [1 ]
Wang, Xia [1 ]
Zou, Chuanxin [1 ]
He, Zenghui [1 ]
Liu, Wei [1 ]
Li, Honglin [1 ]
机构
[1] E China Univ Sci & Technol, Sch Pharm, Shanghai Key Lab New Drug Design, State Key Lab Bioreactor Engn, 130 Mei Long Rd, Shanghai 200237, Peoples R China
来源
BMC BIOINFORMATICS | 2016年 / 17卷
基金
中国国家自然科学基金;
关键词
Protein-RNA interactions; Residue triplet interface propensity; Residue electrostatic surface potential; Random forest classifier; Structural analysis; SECONDARY STRUCTURE; STRUCTURE ALIGNMENT; SITES; SEQUENCE; RECOGNITION; DNA; INFORMATION; INTERFACE; DATABASE; CLASSIFICATION;
D O I
10.1186/s12859-016-1110-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: RNA-binding proteins participate in many important biological processes concerning RNA-mediated gene regulation, and several computational methods have been recently developed to predict the protein-RNA interactions of RNA-binding proteins. Newly developed discriminative descriptors will help to improve the prediction accuracy of these prediction methods and provide further meaningful information for researchers. Results: In this work, we designed two structural features (residue electrostatic surface potential and triplet interface propensity) and according to the statistical and structural analysis of protein-RNA complexes, the two features were powerful for identifying RNA-binding protein residues. Using these two features and other excellent structure-and sequence-based features, a random forest classifier was constructed to predict RNA-binding residues. The area under the receiver operating characteristic curve (AUC) of five-fold cross-validation for our method on training set RBP195 was 0.900, and when applied to the test set RBP68, the prediction accuracy (ACC) was 0.868, and the F-score was 0.631. Conclusions: The good prediction performance of our method revealed that the two newly designed descriptors could be discriminative for inferring protein residues interacting with RNAs. To facilitate the use of our method, a web-server called RNAProSite, which implements the proposed method, was constructed and is freely available at http://lilab.ecust.edu.cn/NABind.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] A boosting approach for prediction of protein-RNA binding residues
    Tang, Yongjun
    Liu, Diwei
    Wang, Zixiang
    Wen, Ting
    Deng, Lei
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [32] A boosting approach for prediction of protein-RNA binding residues
    Yongjun Tang
    Diwei Liu
    Zixiang Wang
    Ting Wen
    Lei Deng
    [J]. BMC Bioinformatics, 18
  • [33] The locations of two RNA binding sites in the E-coli RNA-binding protein HFQ
    Sun, XG
    Wartell, RM
    [J]. BIOPHYSICAL JOURNAL, 2004, 86 (01) : 593A - 593A
  • [34] Refining the pool of RNA-binding domains advances the classification and prediction of RNA-binding proteins
    Wassmer, Elsa
    Koppany, Gergely
    Hermes, Malte
    Diederichs, Sven
    Caudron-Herger, Maiwen
    [J]. NUCLEIC ACIDS RESEARCH, 2024,
  • [35] RBRDetector: Improved prediction of binding residues on RNA-binding protein structures using complementary feature- and template-based strategies
    Yang, Xiao-Xia
    Deng, Zhi-Luo
    Liu, Rong
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 (10) : 2455 - 2471
  • [36] Prediction of clustered RNA-binding protein motif sites in the mammalian genome
    Zhang, Chaolin
    Lee, Kuang-Yung
    Swanson, Maurice S.
    Darnell, Robert B.
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (14) : 6793 - 6807
  • [37] Prediction of RNA-binding residues in proteins using the interaction propensities of amino acids and nucleotides
    Shrestha, Rojan
    Kim, Jisu
    Han, Kyungsook
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 114 - +
  • [38] Prediction of RNA-binding sites from evolutionary information of protein sequences
    Tong, Jing
    Jiang, Peng
    Lu, Zu-Hong
    [J]. PROGRESS ON POST-GENOME TECHNOLOGIES, 2007, : 205 - 208
  • [39] Prediction and validation of the unexplored RNA-binding protein atlas of the human proteome
    Zhao, Huiying
    Yang, Yuedong
    Janga, Sarath Chandra
    Kao, C. Cheng
    Zhou, Yaoqi
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 (04) : 640 - 647
  • [40] TERIUS: accurate prediction of lncRNA via high-throughput sequencing data representing RNA-binding protein association
    Choi, Seo-Won
    Nam, Jin-Wu
    [J]. BMC BIOINFORMATICS, 2018, 19