Codon and amino-acid distribution in DNA

被引:13
|
作者
Kim, JK
Yang, SI
Kwon, YH [1 ]
Lee, EI
机构
[1] Hanyang Univ, Dept Phys, Ansan 425791, Kyunggi Do, South Korea
[2] Hanyang Univ, Sch Elect & Comp Engn, Ansan 425791, Kyunggi Do, South Korea
[3] Univ Rochester, Dept Phys & Astron, Rochester, NY 14623 USA
[4] Korea Univ, Sch Med, Dept Prevent Med, Seoul 136713, South Korea
[5] Korea Univ, Sch Med, Inst Environm Hlth, Seoul 136713, South Korea
关键词
D O I
10.1016/j.chaos.2004.07.027
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
According to the Zipf's law, the distribution of rank-ordered frequency of words in the natural language can be modelled on the power law. In this paper, we examine the frequency distribution of 64 codons over the coding and non-coding regions of 88 DNA from EMBL and GenBank database, using exponential fitting. Also, we regard 20 amino-acids as vocabulary, perform the same frequency analysis to the same database and show that amino-acids can be used as biological meaningful words for Zipf's approach. Our analysis suggests that a natural language structure may exist not only in the coding region of DNA but in the non-coding one of DNA. (C) 2004 Published by Elsevier Ltd.
引用
收藏
页码:1795 / 1807
页数:13
相关论文
共 50 条