RNA SEQUENCE-ANALYSIS USING COVARIANCE-MODELS

被引:500
|
作者
EDDY, SR
DURBIN, R
机构
[1] MRC Laboratory of Molecular Biology, Cambridge CB2 2QH, Hills Road
关键词
D O I
10.1093/nar/22.11.2079
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe a general approach to several RNA sequence analysis problems using probabilistic models that flexibly describe the secondary structure and primary sequence consensus of an RNA sequence family. We call these models 'covariance models'. A covariance model of tRNA sequences is an extremely sensitive and discriminative tool for searching for additional tRNAs and tRNA-related sequences in sequence databases. A model can be built automatically from an existing sequence alignment. We also describe an algorithm for learning a model and hence a consensus secondary structure from initially unaligned example sequences and no prior structural information. Models trained on unaligned tRNA examples correctly predict tRNA scondary structure and produce high-quality multiple alignments. The approach may be applied to any family of small RNA sequences.
引用
收藏
页码:2079 / 2088
页数:10
相关论文
共 50 条