Accurate prediction of RNA-binding protein residues with two discriminative structural descriptors

被引:24
|
作者
Sun, Meijian [1 ]
Wang, Xia [1 ]
Zou, Chuanxin [1 ]
He, Zenghui [1 ]
Liu, Wei [1 ]
Li, Honglin [1 ]
机构
[1] E China Univ Sci & Technol, Sch Pharm, Shanghai Key Lab New Drug Design, State Key Lab Bioreactor Engn, 130 Mei Long Rd, Shanghai 200237, Peoples R China
来源
BMC BIOINFORMATICS | 2016年 / 17卷
基金
中国国家自然科学基金;
关键词
Protein-RNA interactions; Residue triplet interface propensity; Residue electrostatic surface potential; Random forest classifier; Structural analysis; SECONDARY STRUCTURE; STRUCTURE ALIGNMENT; SITES; SEQUENCE; RECOGNITION; DNA; INFORMATION; INTERFACE; DATABASE; CLASSIFICATION;
D O I
10.1186/s12859-016-1110-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: RNA-binding proteins participate in many important biological processes concerning RNA-mediated gene regulation, and several computational methods have been recently developed to predict the protein-RNA interactions of RNA-binding proteins. Newly developed discriminative descriptors will help to improve the prediction accuracy of these prediction methods and provide further meaningful information for researchers. Results: In this work, we designed two structural features (residue electrostatic surface potential and triplet interface propensity) and according to the statistical and structural analysis of protein-RNA complexes, the two features were powerful for identifying RNA-binding protein residues. Using these two features and other excellent structure-and sequence-based features, a random forest classifier was constructed to predict RNA-binding residues. The area under the receiver operating characteristic curve (AUC) of five-fold cross-validation for our method on training set RBP195 was 0.900, and when applied to the test set RBP68, the prediction accuracy (ACC) was 0.868, and the F-score was 0.631. Conclusions: The good prediction performance of our method revealed that the two newly designed descriptors could be discriminative for inferring protein residues interacting with RNAs. To facilitate the use of our method, a web-server called RNAProSite, which implements the proposed method, was constructed and is freely available at http://lilab.ecust.edu.cn/NABind.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Accurate prediction of RNA-binding protein residues with two discriminative structural descriptors
    Meijian Sun
    Xia Wang
    Chuanxin Zou
    Zenghui He
    Wei Liu
    Honglin Li
    [J]. BMC Bioinformatics, 17
  • [2] New Descriptors of Evolutionary Information for Accurate Prediction of DNA and RNA-Binding Residues in Protein Sequences
    Wang, Liangjiang
    Huang, Caiyan
    [J]. 2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 246 - 250
  • [3] RNA-binding residues prediction using structural features
    Huizhu Ren
    Ying Shen
    [J]. BMC Bioinformatics, 16
  • [4] RNA-binding residues prediction using structural features
    Ren, Huizhu
    Shen, Ying
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [5] BindN plus for accurate prediction of DNA and RNA-binding residues from protein sequence features
    Wang, Liangjiang
    Huang, Caiyan
    Yang, Mary Qu
    Yang, Jack Y.
    [J]. BMC SYSTEMS BIOLOGY, 2010, 4
  • [6] Improve the Prediction of RNA-Binding Residues Using Structural Neighbours
    Li, Quan
    Cao, Zanxia
    Liu, Haiyan
    [J]. PROTEIN AND PEPTIDE LETTERS, 2010, 17 (03): : 287 - 296
  • [7] PiRaNhA: a server for the computational prediction of RNA-binding residues in protein sequences
    Murakami, Yoichi
    Spriggs, Ruth V.
    Nakamura, Haruki
    Jones, Susan
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : W412 - W416
  • [8] PredRBR: Accurate Prediction of RNA-binding Residues in Proteins Using Gradient Tree Boosting
    Liu, Diwei
    Tang, Yongjun
    Fan, Chao
    Chen, Zhigang
    Deng, Lei
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 47 - 52
  • [9] Prediction of RNA-Binding residues in protein sequences using support vector machines
    Wang, Liangjiang
    Brown, Susan J.
    [J]. 2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 2382 - +
  • [10] XPredRBR: Accurate and Fast Prediction of RNA-Binding Residues in Proteins Using eXtreme Gradient Boosting
    Deng, Lei
    Dong, Zuojin
    Liu, Hui
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2018, 2018, 10847 : 163 - 173