Multiple Sequence Alignments Enhance Boundary Definition of RNA Structures

被引:2
|
作者
Sabarinathan, Radhakrishnan [1 ,2 ]
Anthon, Christian [1 ]
Gorodkin, Jan [1 ]
Seemann, Stefan E. [1 ]
机构
[1] Univ Copenhagen, Dept Vet & Anim Sci, Ctr Noncoding RNA Technol & Hlth, Gronnegardsvej 3, DK-1870 Frederiksberg C, Denmark
[2] Tata Inst Fundamental Res, Natl Ctr Biol Sci, Bangalore 560065, Karnataka, India
来源
GENES | 2018年 / 9卷 / 12期
关键词
RNA secondary structure; RNA structure boundary; RNA domain; non-coding RNA gene finder; SECONDARY STRUCTURE; IDENTIFICATION; PROBABILITIES; PREDICTION;
D O I
10.3390/genes9120604
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Self-contained structured domains of RNA sequences have often distinct molecular functions. Determining the boundaries of structured domains of a non-coding RNA (ncRNA) is needed for many ncRNA gene finder programs that predict RNA secondary structures in aligned genomes because these methods do not necessarily provide precise information about the boundaries or the location of the RNA structure inside the predicted ncRNA. Even without having a structure prediction, it is of interest to search for structured domains, such as for finding common RNA motifs in RNA-protein binding assays. The precise definition of the boundaries are essential for downstream analyses such as RNA structure modelling, e.g., through covariance models, and RNA structure clustering for the search of common motifs. Such efforts have so far been focused on single sequences, thus here we present a comparison for boundary definition between single sequence and multiple sequence alignments. We also present a novel approach, named RNAbound, for finding the boundaries that are based on probabilities of evolutionarily conserved base pairings. We tested the performance of two different methods on a limited number of Rfam families using the annotated structured RNA regions in the human genome and their multiple sequence alignments created from 14 species. The results show that multiple sequence alignments improve the boundary prediction for branched structures compared to single sequences independent of the chosen method. The actual performance of the two methods differs on single hairpin structures and branched structures. For the RNA families with branched structures, including transfer RNA (tRNA) and small nucleolar RNAs (snoRNAs), RNAbound improves the boundary predictions using multiple sequence alignments to median differences of -6 and -11.5 nucleotides (nts) for left and right boundary, respectively (window size of 200 nts).
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Hierarchical folding of multiple sequence alignments for the prediction of structures and RNA-RNA interactions
    Stefan E Seemann
    Andreas S Richter
    Jan Gorodkin
    Rolf Backofen
    [J]. Algorithms for Molecular Biology, 5
  • [2] Hierarchical folding of multiple sequence alignments for the prediction of structures and RNA-RNA interactions
    Seemann, Stefan E.
    Richter, Andreas S.
    Gorodkin, Jan
    Backofen, Rolf
    [J]. ALGORITHMS FOR MOLECULAR BIOLOGY, 2010, 5
  • [3] Direct RNA motif definition and identification from multiple sequence alignments using secondary structure profiles
    Gautheret, D
    Lambert, A
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 313 (05) : 1003 - 1011
  • [4] RNA-RNA interaction prediction based on multiple sequence alignments
    Li, Andrew X.
    Marz, Manja
    Qin, Jing
    Reidys, Christian M.
    [J]. BIOINFORMATICS, 2011, 27 (04) : 456 - 463
  • [5] Alignments of RNA Structures
    Blin, Guillaume
    Denise, Alain
    Dulucq, Serge
    Herrbach, Claire
    Touzet, Helene
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (02) : 309 - 322
  • [6] Multiple sequence alignments
    Wallace, IM
    Blackshields, G
    Higgins, DG
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2005, 15 (03) : 261 - 266
  • [7] MSARI: Multiple sequence alignments for statistical detection of RNA secondary structure
    Coventry, A
    Kleitman, DJ
    Berger, B
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (33) : 12102 - 12107
  • [8] Multithreaded multiple sequence alignments
    Bai, Joanne
    Rezael, Siamak
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2863 - 2866
  • [9] Building multiple sequence alignments with a flavor of HSSP alignments
    Higa, Roberto Hiroshi
    Braga da Cruz, Sergio Aparecido
    Kuser, Paula Regina
    Beleza Yamagishi, Michel Eduardo
    Fileto, Renato
    de Medeiros Oliveira, Stanley Robson
    Mazoni, Ivan
    dos Santos, Edgard Henrique
    Mancini, Adauto Luiz
    Neshich, Goran
    [J]. GENETICS AND MOLECULAR RESEARCH, 2006, 5 (01): : 127 - 137
  • [10] Markovian structures in biological sequence alignments
    Liu, JS
    Neuwald, AF
    Lawrence, CE
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (445) : 1 - 15