Scaling behaviors of CG clusters in coding and noncoding DNA sequences

被引:5
|
作者
Zhang, LX [1 ]
Chen, J [1 ]
机构
[1] Wenzhou Normal Coll, Dept Phys, Wenzhou 325027, Peoples R China
关键词
D O I
10.1016/j.chaos.2004.07.013
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this paper the statistical properties of CG clusters in coding and non-coding DNA sequences are investigated through calculating the cluster-size distribution of CG clusters P(S) and the breadth of the distribution of the root-mean-square size of CG clusters sigma(m) in consecutive, non-overlapping blocks of m bases. There do exist some differences between coding and non-coding sequences. The cluster-size distribution of CG clusters P(S) for both coding and non-coding sequences follows an exponential decay of P(S)proportional toe(-alphaS), and the value of a depends on the percentage of C-G content for coding sequences. It can fit into a linear line regularly but the case is contrary for noncoding sequences. We find that xi(m) = sigma(m)/rootm of CG clusters all obeys the good power-law decay of xi(m)proportional tom(-gamma) in both coding and non-coding sequences, and the value of gamma is 0.949 +/- 0.014 and 0.826 +/- 0.011 for coding and noncoding sequences, respectively. Therefore, we can distinguish between coding and non-coding sequences on the basis of the value of gamma. At the meantime, we also discuss the power-law of xi(m)proportional tom(-gamma) for random sequence, and find that the value of gamma for random sequence is very close to 1.00. So we can know that the value of gamma for coding sequences is more close to the random sequence, and obtain the conclusion that the behavior of coding sequence trends to random sequence more similarly. This investigation can provide some insights into DNA sequences. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:115 / 123
页数:9
相关论文
共 50 条
  • [41] Finding the coding frame on DNA sequences
    Hatzigeorgiou, A
    Papanikolaou, H
    Reczko, M
    COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION - NEURAL NETWORKS & ADVANCED CONTROL STRATEGIES, 1999, 54 : 148 - 153
  • [42] Toward Identification of Functional Sequences and Variants in Noncoding DNA
    Monti, Remo
    Ohler, Uwe
    ANNUAL REVIEW OF BIOMEDICAL DATA SCIENCE, 2023, 6 : 191 - 210
  • [43] Coding DNA sequences: statistical distributions
    Som, A
    Sahoo, S
    Chakrabarti, J
    MATHEMATICAL BIOSCIENCES, 2003, 183 (01) : 49 - 61
  • [44] Coarsening of granular clusters: Two types of scaling behaviors
    Sapozhnikov, MV
    Aranson, IS
    Olafsen, JS
    PHYSICAL REVIEW E, 2003, 67 (01):
  • [45] Evolution of coding and noncoding genomic sequences shared by humans and great apes
    Saber, Morteza Mahmoudi
    Saitou, Naruya
    GENES & GENETIC SYSTEMS, 2016, 91 (06) : 345 - 345
  • [46] Multifractal characterisation of length sequences of coding and noncoding segments in a complete genome
    Yu, ZG
    Anh, V
    Lau, KS
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2001, 301 (1-4) : 351 - 361
  • [47] Emergence and Evolution of Hominidae-Specific Coding and Noncoding Genomic Sequences
    Saber, Morteza Mahmoudi
    Babarinde, Isaac Adeyemi
    Hettiarachchi, Nilmini
    Saitou, Naruya
    GENOME BIOLOGY AND EVOLUTION, 2016, 8 (07): : 2076 - 2092
  • [48] NONRANDOM PATTERNS OF SIMPLE AND CRYPTIC TRIPLET REPEATS IN CODING AND NONCODING SEQUENCES
    RICKE, DO
    LIU, Q
    GOSTOUT, B
    SOMMER, SS
    GENOMICS, 1995, 26 (03) : 510 - 520
  • [49] Nucleosomal signatures impose nucleosome positioning in coding and noncoding sequences in the genome
    Gonzalez, Sara
    Garcia, Alicia
    Vazquez, Enrique
    Serrano, Rebeca
    Sanchez, Mar
    Quintales, Luis
    Antequera, Francisco
    GENOME RESEARCH, 2016, 26 (11) : 1532 - 1543
  • [50] Distinguish coding and noncoding sequences in a complete genome using Fourier transform
    Zhou, Yu
    Zhou, Li-Qian
    Yu, Zu-Guo
    Anh, Vo
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 295 - +