D-ORB: A Web Server to Extract Structural Features of Related But Unaligned RNA Sequences

被引:1
|
作者
Dupont, Mathieu J.
Major, Francois [1 ]
机构
[1] Univ Montreal, Dept Comp Sci & Operat Res, Montreal, PQ H3C 3J7, Canada
基金
加拿大自然科学与工程研究理事会; 加拿大健康研究院;
关键词
RNA structure; RNA family; Artificial intelligence; Motif identification; Structural composition; ALGORITHM; SHAPES; SITE;
D O I
10.1016/j.jmb.2023.168181
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Identifying the common structural elements of functionally related RNA sequences (family) is usually based on an alignment of the sequences, which is often subject to human bias and may not be accurate. The resulting covariance model (CM) provides probabilities for each base to covary with another, which allows to support evolutionarily the formation of double helical regions and possibly pseudoknots. The coexistence of alternative folds in RNA, resulting from its dynamic nature, may lead to the potential omis-sion of motifs by CM. To overcome this limitation, we present D-ORB, a system of algorithms that iden-tifies overrepresented motifs in the secondary conformational landscapes of a family when compared to those of unrelated sequences. The algorithms are bundled into an easy-to-use website allowing users to submit a family, and optionally provide unrelated sequences. D-ORB produces a non-pseudoknotted sec-ondary structure based on the overrepresented motifs, a deep neural network classifier and two decision trees. When used to model an Rfam family, D-ORB fits overrepresented motifs in the corresponding Rfam structure; more than a hundred Rfam families have been modeled. The statistical approach behind D-ORB derives the structural composition of an RNA family, making it a valuable tool for analyzing and mod-eling it. Its easy-to-use interface and advanced algorithms make it an essential resource for researchers studying RNA structure. D-ORB is available at https://d-orb.major.iric.ca/.& COPY; 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://crea-tivecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Rtools: a web server for various secondary structural analyses on single RNA sequences
    Hamada, Michiaki
    Ono, Yukiteru
    Kiryu, Hisanori
    Sato, Kengo
    Kato, Yuki
    Fukunaga, Tsukasa
    Mori, Ryota
    Asai, Kiyoshi
    NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) : W302 - W307
  • [2] RNAbor: a web server for RNA structural neighbors
    Freyhult, Eva
    Moulton, Vincent
    Clote, Peter
    NUCLEIC ACIDS RESEARCH, 2007, 35 : W305 - W309
  • [3] WebSTAR3D: a web server for RNA 3D structural alignment
    Holzhauser, Erwin
    Ge, Ping
    Zhang, Shaojie
    BIOINFORMATICS, 2016, 32 (23) : 3673 - 3675
  • [4] FRASS: the web-server for RNA structural comparison
    Svetlana Kirillova
    Silvio CE Tosatto
    Oliviero Carugo
    BMC Bioinformatics, 11
  • [5] FRASS: the web-server for RNA structural comparison
    Kirillova, Svetlana
    Tosatto, Silvio C. E.
    Carugo, Oliviero
    BMC BIOINFORMATICS, 2010, 11
  • [6] Vfold-Pipeline: a web server for RNA 3D structure prediction from sequences
    Li, Jun
    Zhang, Sicheng
    Zhang, Dong
    Chen, Shi-Jie
    BIOINFORMATICS, 2022, 38 (16) : 4042 - 4043
  • [7] incaRNAfbinv: a web server for the fragment-based design of RNA sequences
    Retwitzer, Matan Drory
    Reinharz, Vladimir
    Ponty, Yann
    Waldispuhl, Jerome
    Barash, Danny
    NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) : W308 - W314
  • [8] repRNA: a web server for generating various feature vectors of RNA sequences
    Bin Liu
    Fule Liu
    Longyun Fang
    Xiaolong Wang
    Kuo-Chen Chou
    Molecular Genetics and Genomics, 2016, 291 : 473 - 481
  • [9] repRNA: a web server for generating various feature vectors of RNA sequences
    Liu, Bin
    Liu, Fule
    Fang, Longyun
    Wang, Xiaolong
    Chou, Kuo-Chen
    MOLECULAR GENETICS AND GENOMICS, 2016, 291 (01) : 473 - 481
  • [10] RepEx: A web server to extract sequence repeats from protein and DNA sequences
    Michael, Daliah
    Gurusaran, M.
    Santhosh, R.
    Hussain, Md. Khaja
    Satheesh, S. N.
    Suhan, S.
    Sivaranjan, P.
    Jaiswal, Akanksha
    Sekar, K.
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 78 : 424 - 430