Efficient mapping of RNA-binding residues in RNA-binding proteins using local sequence features of binding site residues in protein-RNA complexes

被引:2
|
作者
Agarwal, Ankita [1 ,2 ]
Kant, Shri [2 ]
Bahadur, Ranjit Prasad [2 ,3 ]
机构
[1] Indian Inst Technol Kharagpur, Sch Bio Sci, Kharagpur, India
[2] Indian Inst Technol Kharagpur, Dept Biotechnol, Computat Struct Biol Lab, Kharagpur, India
[3] Indian Inst Technol Kharagpur, Dept Biotechnol, Computat Struct Biol Lab, Kharagpur 721302, India
关键词
balanced random forest; machine learning; prediction; protein-RNA interactions; RNA-binding proteins; RNA-binding residues; PREDICTION; RECOGNITION; SVM; DNA; NUCLEOTIDES; SERVER;
D O I
10.1002/prot.26528
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein-RNA interactions play vital roles in plethora of biological processes such as regulation of gene expression, protein synthesis, mRNA processing and biogenesis. Identification of RNA-binding residues (RBRs) in proteins is essential to understand RNA-mediated protein functioning, to perform site-directed mutagenesis and to develop novel targeted drug therapies. Moreover, the extensive gap between sequence and structural data restricts the identification of binding sites in unsolved structures. However, efficient use of computational methods demanding only sequence to identify binding residues can bridge this huge sequence-structure gap. In this study, we have extensively studied protein-RNA interface in known RNA-binding proteins (RBPs). We find that the interface is highly enriched in basic and polar residues with Gly being the most common interface neighbor. We investigated several amino acid features and developed a method to predict putative RBRs from amino acid sequence. We have implemented balanced random forest (BRF) classifier with local residue features of protein sequences for prediction. With 5-fold cross-validations, the sequence pattern derived dipeptide composition based BRF model (DCP-BRF) resulted in an accuracy of 87.9%, specificity of 88.8%, sensitivity of 82.2%, Mathew's correlation coefficient of 0.60 and AUC of 0.93, performing better than few existing methods. We further validated our prediction model on known human RBPs through RBR prediction and could map similar to 54% of them. Further, knowledge of binding site preferences obtained from computational predictions combined with experimental validations of potential RNA binding sites can enhance our understanding of protein-RNA interactions. This may serve to accelerate investigations on functional roles of many novel RBPs.
引用
收藏
页码:1361 / 1379
页数:19
相关论文
共 50 条
  • [41] THE MULTIPLE RNA-BINDING DOMAINS OF THE MESSENGER-RNA POLY(A)-BINDING PROTEIN HAVE DIFFERENT RNA-BINDING ACTIVITIES
    BURD, CG
    MATUNIS, EL
    DREYFUSS, G
    MOLECULAR AND CELLULAR BIOLOGY, 1991, 11 (07) : 3419 - 3424
  • [42] RNA-binding protein kinetics
    Singh, Arunima
    NATURE METHODS, 2021, 18 (04) : 335 - 335
  • [43] Understanding DNA- and RNA-binding Proteins Using Sequence and Structural Features
    Carson, Matthew B.
    Langlois, Robert
    Lu, Hui
    BIOPHYSICAL JOURNAL, 2009, 96 (03) : 64A - 64A
  • [44] Survey of the binding preferences of RNA-binding proteins to RNA editing events
    Hu, Xiaolin
    Zou, Qin
    Yao, Li
    Yang, Xuerui
    GENOME BIOLOGY, 2022, 23 (01)
  • [45] Prediction of RNA-binding residues in proteins using the interaction propensities of amino acids and nucleotides
    Shrestha, Rojan
    Kim, Jisu
    Han, Kyungsook
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 114 - +
  • [46] PredRBR: Accurate Prediction of RNA-binding Residues in Proteins Using Gradient Tree Boosting
    Liu, Diwei
    Tang, Yongjun
    Fan, Chao
    Chen, Zhigang
    Deng, Lei
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 47 - 52
  • [47] Survey of the binding preferences of RNA-binding proteins to RNA editing events
    Xiaolin Hu
    Qin Zou
    Li Yao
    Xuerui Yang
    Genome Biology, 23
  • [48] RNA-Binding Profiles of CKAP4 as an RNA-Binding Protein in Myocardial Tissues
    Zhu, Hong
    Zhang, Yanfeng
    Zhang, Chengliang
    Xie, Zhongshang
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2021, 8
  • [49] The RNA-binding protein AtGRP7 -: molecular characterisation and RNA-binding properties
    Schoening, J. C.
    Fuhrmann, A.
    Streitner, C.
    Lummer, M.
    Anselmetti, D.
    Ros, R.
    Staiger, D.
    FEBS JOURNAL, 2007, 274 : 87 - 87
  • [50] RNA-binding properties and mapping of the RNA-binding domain from the movement protein of Prunus necrotic ringspot virus
    Herranz, MC
    Pallás, V
    JOURNAL OF GENERAL VIROLOGY, 2004, 85 : 761 - 768