PST-PRNA: prediction of RNA-binding sites using protein surface topography and deep learning

被引:20
|
作者
Li, Pengpai [1 ]
Liu, Zhi-Ping [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Dept Biomed Engn, Jinan 250061, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
DATABASE; RECOGNITION; GENERATION; FEATURES;
D O I
10.1093/bioinformatics/btac078
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein-RNA interactions play essential roles in many biological processes, including pre-mRNA processing, post-transcriptional gene regulation and RNA degradation. Accurate identification of binding sites on RNA-binding proteins (RBPs) is important for functional annotation and site-directed mutagenesis. Experimental assays to sparse RBPs are precise and convincing but also costly and time consuming. Therefore, flexible and reliable computational methods are required to recognize RNA-binding residues. Results: In this work, we propose PST-PRNA, a novel model for predicting RNA-binding sites (PRNA) based on protein surface topography (PST). Taking full advantage of the 3D structural information of protein, PST-PRNA creates representative topography images of the entire protein surface by mapping it onto a unit spherical surface. Four kinds of descriptors are encoded to represent residues on the surface. Then, the potential features are integrated and optimized by using deep learning models. We compile a comprehensive non-redundant RBP dataset to train and test PST-PRNA using 10-fold cross-validation. Numerous experiments demonstrate PST-PRNA learns successfully the latent structural information of protein surface. On the non-redundant dataset with sequence identity of 0.3, PST-PRNA achieves area under the receiver operating characteristic curves (AUC) value of 0.860 and Matthew's correlation coefficient value of 0.420. Furthermore, we construct a completely independent test dataset for justification and comparison. PST-PRNA achieves AUC value of 0.913 on the independent dataset, which is superior to the other state-of-the-art methods.
引用
收藏
页码:2162 / 2168
页数:7
相关论文
共 50 条
  • [31] Deep structural insights into RNA-binding disordered protein regions
    Zeke, Andras
    Schad, Eva
    Horvath, Tamas
    Abukhairan, Rawan
    Szabo, Beata
    Tantos, Agnes
    WILEY INTERDISCIPLINARY REVIEWS-RNA, 2022, 13 (05)
  • [32] RNA-binding residues prediction using structural features
    Ren, Huizhu
    Shen, Ying
    BMC BIOINFORMATICS, 2015, 16
  • [33] Deep neural networks for inferring binding sites of RNA-binding proteins by using distributed representations of RNA primary sequence and secondary structure
    Lei Deng
    Youzhi Liu
    Yechuan Shi
    Wenhao Zhang
    Chun Yang
    Hui Liu
    BMC Genomics, 21
  • [34] Deep neural networks for inferring binding sites of RNA-binding proteins by using distributed representations of RNA primary sequence and secondary structure
    Deng, Lei
    Liu, Youzhi
    Shi, Yechuan
    Zhang, Wenhao
    Yang, Chun
    Liu, Hui
    BMC GENOMICS, 2020, 21 (Suppl 13)
  • [35] Prediction of Protein-DNA Binding Sites Based on Protein Language Model and Deep Learning
    Shan, Kaixuan
    Zhang, Xiankun
    Song, Chen
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 314 - 325
  • [36] PRBP: Prediction of RNA-Binding Proteins Using a Random Forest Algorithm Combined with an RNA-Binding Residue Predictor
    Ma, Xin
    Guo, Jing
    Xiao, Ke
    Sun, Xiao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (06) : 1385 - 1393
  • [37] Modeling RNA-Binding Protein Specificity In Vivo by Precisely Registering Protein-RNA Crosslink Sites
    Feng, Huijuan
    Bao, Suying
    Rahman, Mohammad Alinoor
    Weyn-Vanhentenryck, Sebastien M.
    Khan, Aziz
    Wong, Justin
    Shah, Ankeeta
    Flynn, Elise D.
    Krainer, Adrian R.
    Zhang, Chaolin
    MOLECULAR CELL, 2019, 74 (06) : 1189 - +
  • [38] RISP: A web-based server for prediction of RNA-binding sites in proteins
    Tong, Jing
    Jiang, Peng
    Lu, Zu-hong
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2008, 90 (02) : 148 - 153
  • [39] Analysis of the binding of ligands to large numbers of sites: The binding of tryptophan to the 11 sites of the trp RNA-binding attenuation protein
    Saroff, HA
    Kiefer, JE
    ANALYTICAL BIOCHEMISTRY, 1997, 247 (01) : 138 - 142
  • [40] Prediction of Transcription Factor Binding Sites Using a Combined Deep Learning Approach
    Cao, Linan
    Liu, Pei
    Chen, Jialong
    Deng, Lei
    FRONTIERS IN ONCOLOGY, 2022, 12