Improved Covariance Model Parameter Estimation Using RNA Thermodynamic Properties

被引:0
|
作者
Smith, Scott F. [1 ]
Wiese, Kay C. [2 ]
机构
[1] Boise State Univ, ECE Dept, Boise, ID 83725 USA
[2] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
关键词
Bioinformatics; Covariance models; RNA secondary structure; Database search; Non-coding RNA gene search;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Covariance models are a powerful description of non-coding RNA (ncRNA) families that can be used to search nucleotide databases for new members of these ncRNA families. Currently, estimation of the parameters of a covariance model (state transition and emission scores) is based only on the observed frequencies of mutations, insertions, and deletions in known ncRNA sequences. For families with very few known members, this can result in rather uninformative models where the consensus sequence has a good score and most deviations from consensus have a fairly uniform poor score. It is proposed here to combine the traditional observed-frequency information with known information about free energy changes in RNA helix formation and loop length changes. More thermodynamically probable deviations from the consensus sequence will then be favored in database search. The thermodynamic information may be incorporated into the models as informative priors that depend on neighboring consensus nucleotides and on loop lengths.
引用
收藏
页码:176 / +
页数:2
相关论文
共 50 条