Scaling properties of coding and non-coding DNA sequences

被引:44
|
作者
Provata, A [1 ]
Almirantis, Y
机构
[1] NRCPS Demokritos, Inst Phys Chem, Mol Modelling Mat Lab, Athens 15310, Greece
[2] NRCPS Demokritos, Inst Biol, Athens 15310, Greece
来源
PHYSICA A | 1997年 / 247卷 / 1-4期
关键词
power law; long-range correlations; exponential decay; coding/non-coding DNA sequences; random walks;
D O I
10.1016/S0378-4371(97)00424-X
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We study the size distribution of purine and pyrimidine clusters in coding and non-coding DNA sequences. We observe that the cluster-size distribution P(s) follows an exponential decay in coding sequences whereas it follows a power-law decay in non-coding sequences: P(s) similar to s(-1)-(mu), with a power exponent mu = 1.5-1.8. The mean-square displacement sigma(2)(m) is examined via a cluster walk model, with step-size distribution following P(s) and with m denoting the number of clusters covered by the walker. The behaviour of the mean-square displacement is sigma(2)(m) similar to m(2/mu) for non-coding sequences and sigma(2)(m) similar to m for coding sequences. We associate the power-law behaviour in the non-coding with the tendency of large Pu and Py cluster formation which dominate the non-coding. Under this observation the entire DNA sequence may be regarded as a collection of extended non-coding regions interrupted by small coding regions. We recall that this irregular composition of DNA, is of vital importance for the living organisms: Transposable elements and other "parasite" DNA which try to incorporate themselves into the DNA chain most probably intersect the large non-coding regions, thus leaving the organism unaffected, as is well known to biologists.
引用
收藏
页码:482 / 496
页数:15
相关论文
共 50 条