PST-PRNA: prediction of RNA-binding sites using protein surface topography and deep learning

被引:20
|
作者
Li, Pengpai [1 ]
Liu, Zhi-Ping [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Dept Biomed Engn, Jinan 250061, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
DATABASE; RECOGNITION; GENERATION; FEATURES;
D O I
10.1093/bioinformatics/btac078
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein-RNA interactions play essential roles in many biological processes, including pre-mRNA processing, post-transcriptional gene regulation and RNA degradation. Accurate identification of binding sites on RNA-binding proteins (RBPs) is important for functional annotation and site-directed mutagenesis. Experimental assays to sparse RBPs are precise and convincing but also costly and time consuming. Therefore, flexible and reliable computational methods are required to recognize RNA-binding residues. Results: In this work, we propose PST-PRNA, a novel model for predicting RNA-binding sites (PRNA) based on protein surface topography (PST). Taking full advantage of the 3D structural information of protein, PST-PRNA creates representative topography images of the entire protein surface by mapping it onto a unit spherical surface. Four kinds of descriptors are encoded to represent residues on the surface. Then, the potential features are integrated and optimized by using deep learning models. We compile a comprehensive non-redundant RBP dataset to train and test PST-PRNA using 10-fold cross-validation. Numerous experiments demonstrate PST-PRNA learns successfully the latent structural information of protein surface. On the non-redundant dataset with sequence identity of 0.3, PST-PRNA achieves area under the receiver operating characteristic curves (AUC) value of 0.860 and Matthew's correlation coefficient value of 0.420. Furthermore, we construct a completely independent test dataset for justification and comparison. PST-PRNA achieves AUC value of 0.913 on the independent dataset, which is superior to the other state-of-the-art methods.
引用
收藏
页码:2162 / 2168
页数:7
相关论文
共 50 条
  • [1] Prediction of clustered RNA-binding protein motif sites in the mammalian genome
    Zhang, Chaolin
    Lee, Kuang-Yung
    Swanson, Maurice S.
    Darnell, Robert B.
    NUCLEIC ACIDS RESEARCH, 2013, 41 (14) : 6793 - 6807
  • [2] Computational Prediction of RNA-Binding Proteins and Binding Sites
    Si, Jingna
    Cui, Jing
    Cheng, Jin
    Wu, Rongling
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (11): : 26303 - 26317
  • [3] Prediction of RNA-binding sites from evolutionary information of protein sequences
    Tong, Jing
    Jiang, Peng
    Lu, Zu-Hong
    PROGRESS ON POST-GENOME TECHNOLOGIES, 2007, : 205 - 208
  • [4] RBPsuite: RNA-protein binding sites prediction suite based on deep learning
    Pan, Xiaoyong
    Fang, Yi
    Li, Xianfeng
    Yang, Yang
    Shen, Hong-Bin
    BMC GENOMICS, 2020, 21 (01)
  • [5] A Review About RNA-Protein-Binding Sites Prediction Based on Deep Learning
    Yan, Jianrong
    Zhu, Min
    IEEE ACCESS, 2020, 8 : 150929 - 150944
  • [6] RBPsuite: RNA-protein binding sites prediction suite based on deep learning
    Xiaoyong Pan
    Yi Fang
    Xianfeng Li
    Yang Yang
    Hong-Bin Shen
    BMC Genomics, 21
  • [7] rBPDL:Predicting RNA-Binding Proteins Using Deep Learning
    Niu, Mengting
    Wu, Jin
    Zou, Quan
    Liu, Zhendong
    Xu, Lei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (09) : 3668 - 3676
  • [8] Protein-Specific Prediction of RNA-Binding Sites Based on Information Entropy
    Ji, Yue
    Bai, Lu
    Li, Menglong
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [9] DeepBtoD: Improved RNA-binding proteins prediction via integrated deep learning
    Du, XiuQuan
    Zhao, XiuJuan
    Zhang, YanPing
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2022, 20 (04)
  • [10] Improved discovery of RNA-binding protein binding sites in eCLIP data using DEWSeq
    Schwarzl, Thomas
    Sahadevan, Sudeep
    Lang, Benjamin
    Miladi, Milad
    Backofen, Rolf
    Huber, Wolfgang
    Hentze, Matthias W.
    Tartaglia, Gian Gaetano
    NUCLEIC ACIDS RESEARCH, 2024, 52 (01) : E1