STATISTICAL-ANALYSIS OF NUCLEOTIDE-SEQUENCES

被引:32
|
作者
STUCKLE, EE
EMMRICH, C
GROB, U
NIELSEN, PJ
机构
[1] MAX PLANCK INST IMMUNBIOL, STUBEWEG 51, W-7800 FREIBURG, GERMANY
[2] UNIV FREIBURG, FAK PHYS, W-7800 FREIBURG, GERMANY
关键词
D O I
10.1093/nar/18.22.6641
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In order to scan nucleic acid databases for potentially relevant but as yet unknown signals, we have developed an improved statistical model for pattern analysis of nucleic acid sequences by modifying previous methods based on Markov chains. We demonstrate the importance of selecting the appropriate parameters in order for the method to function at all. The model allows the simultaneous analysis of several short sequences with unequal base frequencies and Markov order k≠0 as is usually the case in databases. As a test of these modifications, we show that in E.coli sequences there is a bias against palindromic hexamers which correspond to known restriction enzyme recognition sites. © 1990 Oxford University Press.
引用
收藏
页码:6641 / 6647
页数:7
相关论文
共 50 条