D-ORB: A Web Server to Extract Structural Features of Related But Unaligned RNA Sequences

被引:1
|
作者
Dupont, Mathieu J.
Major, Francois [1 ]
机构
[1] Univ Montreal, Dept Comp Sci & Operat Res, Montreal, PQ H3C 3J7, Canada
基金
加拿大自然科学与工程研究理事会; 加拿大健康研究院;
关键词
RNA structure; RNA family; Artificial intelligence; Motif identification; Structural composition; ALGORITHM; SHAPES; SITE;
D O I
10.1016/j.jmb.2023.168181
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Identifying the common structural elements of functionally related RNA sequences (family) is usually based on an alignment of the sequences, which is often subject to human bias and may not be accurate. The resulting covariance model (CM) provides probabilities for each base to covary with another, which allows to support evolutionarily the formation of double helical regions and possibly pseudoknots. The coexistence of alternative folds in RNA, resulting from its dynamic nature, may lead to the potential omis-sion of motifs by CM. To overcome this limitation, we present D-ORB, a system of algorithms that iden-tifies overrepresented motifs in the secondary conformational landscapes of a family when compared to those of unrelated sequences. The algorithms are bundled into an easy-to-use website allowing users to submit a family, and optionally provide unrelated sequences. D-ORB produces a non-pseudoknotted sec-ondary structure based on the overrepresented motifs, a deep neural network classifier and two decision trees. When used to model an Rfam family, D-ORB fits overrepresented motifs in the corresponding Rfam structure; more than a hundred Rfam families have been modeled. The statistical approach behind D-ORB derives the structural composition of an RNA family, making it a valuable tool for analyzing and mod-eling it. Its easy-to-use interface and advanced algorithms make it an essential resource for researchers studying RNA structure. D-ORB is available at https://d-orb.major.iric.ca/.& COPY; 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://crea-tivecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:13
相关论文
共 50 条
  • [21] iFeature: a Python']Python package and web server for features extraction and selection from protein and peptide sequences
    Chen, Zhen
    Zhao, Pei
    Li, Fuyi
    Leier, Andre
    Marquez-Lago, Tatiana T.
    Wang, Yanan
    Webb, Geoffrey I.
    Smith, A. Ian
    Daly, Roger J.
    Chou, Kuo-Chen
    Song, Jiangning
    BIOINFORMATICS, 2018, 34 (14) : 2499 - 2502
  • [22] RNAssess-a web server for quality assessment of RNA 3D structures
    Lukasiak, Piotr
    Antczak, Maciej
    Ratajczak, Tomasz
    Szachniuk, Marta
    Popenda, Mariusz
    Adamiak, Ryszard W.
    Blazewicz, Jacek
    NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) : W502 - W506
  • [23] SimRNAweb: a web server for RNA 3D structure modeling with optional restraints
    Magnus, Marcin
    Boniecki, Michal J.
    Dawson, Wayne
    Bujnicki, Janusz M.
    NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) : W315 - W319
  • [24] Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences
    Liu, Bin
    Liu, Fule
    Wang, Xiaolong
    Chen, Junjie
    Fang, Longyun
    Chou, Kuo-Chen
    NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) : W65 - W71
  • [25] m5UPred: A Web Server for the Prediction of RNA 5-Methyluridine Sites from Sequences
    Jiang, Jie
    Song, Bowen
    Tang, Yujiao
    Chen, Kunqi
    Wei, Zhen
    Meng, Jia
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2020, 22 : 742 - 747
  • [26] Structural signatures: a web server for exploring a database of and generating protein structural features from human cell lines and tissues
    Zatorski, Nicole
    Stein, David
    Rahman, Rayees
    Iyengar, Ravi
    Schlessinger, Avner
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2022, 2022
  • [27] PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence
    Li, Z. R.
    Lin, H. H.
    Han, L. Y.
    Jiang, L.
    Chen, X.
    Chen, Y. Z.
    NUCLEIC ACIDS RESEARCH, 2006, 34 : W32 - W37
  • [28] MODEL - Molecular descriptor lab: A web-based server for computing structural and physicochemical features of compounds
    Li, Z. R.
    Han, L. Y.
    Xue, Y.
    Yap, C. W.
    Li, H.
    Jiang, L.
    Chen, Y. Z.
    BIOTECHNOLOGY AND BIOENGINEERING, 2007, 97 (02) : 389 - 396
  • [29] 3dRPC: a web server for 3D RNA-protein structure prediction
    Huang, Yangyu
    Li, Haotian
    Xiao, Yi
    BIOINFORMATICS, 2018, 34 (07) : 1238 - 1240
  • [30] CORDAX web server: an online platform for the prediction and 3D visualization of aggregation motifs in protein sequences
    Louros, Nikolaos
    Rousseau, Frederic
    Schymkowitz, Joost
    BIOINFORMATICS, 2024, 40 (05)