Context of deletions and insertions in human coding sequences

被引:62
|
作者
Kondrashov, AS [1 ]
Rogozin, IB [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Bethesda, MD 20894 USA
关键词
mutation; hot spot; deletion; insertion; mutable motif; nucleotide context; repeat; microsatellite;
D O I
10.1002/humu.10312
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We studied the dependence of the rate of short deletions and insertions on their contexts using the data on mutations within coding exons at 19 human loci that cause mendelian diseases. We confirm that periodic sequences consisting of three to five or more nucleotides are mutagenic. Mutability of sequences with strongly biased nucleotide composition is also elevated, even when mutations within homonucleotide runs longer than three nucleotides are ignored. In contrast, no elevated mutation rates have been detected for imperfect direct or inverted repeats. Among known candidate contexts, the indel context GTAAGT and regions with purine-pyrimidine imbalance between the two DNA strands are mutagenic in our sample, and many others are not mutagenic. Data on mutation hot spots suggest two novel contexts that increase the deletion rate. Comprehensive analysis of mutability of all possible contexts of lengths four, six, and eight indicates a substantially elevated deletion rate within YYYTG and similar sequences, which is one of the two contexts revealed by the hot spots. Possible contexts that increase the insertion rate (AT(A/C)(A/C)GCC and TACCRC) and decrease deletion (TATCGC) or insertion (GCGG) rates have also been identified. Two-thirds of deletions remove a repeat, and over 80% of insertions create a repeat, i.e., they are duplications. Published 2003 Wiley-Liss, Inc.(dagger)
引用
收藏
页码:177 / 185
页数:9
相关论文
共 50 条
  • [1] Optimal Interactive Coding for Insertions, Deletions, and Substitutions
    Sherstov, Alexander A.
    Wu, Pei
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (10) : 5971 - 6000
  • [2] Optimal Interactive Coding for Insertions, Deletions, and Substitutions
    Sherstov, Alexander A.
    Wu, Pei
    2017 IEEE 58TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2017, : 240 - 251
  • [3] Coding for Interactive Communication Correcting Insertions and Deletions
    Braverman, Mark
    Gelles, Ran
    Mao, Jieming
    Ostrovsky, Rafail
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2017, 63 (10) : 6256 - 6270
  • [4] Human-specific insertions and deletions inferred from mammalian genome sequences
    Chen, Feng-Chi
    Chen, Chueng-Jong
    Li, Wen-Hsiung
    Chuang, Trees-Juen
    GENOME RESEARCH, 2007, 17 (01) : 16 - 22
  • [5] Coding for the Permutation Channel with Insertions, Deletions, Substitutions, and Erasures
    Kovacevic, Mladen
    Tan, Vincent Y. F.
    2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 1933 - 1937
  • [6] RECONSTRUCTING LATENT PERIODS IN GENOME SEQUENCES WITH INSERTIONS AND DELETIONS
    Arora, Raman
    Dewey, Colin
    Sethares, William A.
    2009 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2009), 2009, : 122 - +
  • [7] Occurrence and consequences of coding sequence insertions and deletions in mammalian genomes
    Taylor, MS
    Ponting, CP
    Copley, RR
    GENOME RESEARCH, 2004, 14 (04) : 555 - 566
  • [8] The Evolution of Small Insertions and Deletions in the Coding Genes of Drosophila melanogaster
    Chong, Zechen
    Zhai, Weiwei
    Li, Chunyan
    Gao, Min
    Gong, Qiang
    Ruan, Jue
    Li, Juan
    Jiang, Lan
    Lv, Xuemei
    Hungate, Eric
    Wu, Chung-I
    MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (12) : 2699 - 2708
  • [9] Assessing Autosomal InDel Loci With Multiple Insertions or Deletions of Random DNA Sequences in Human Genome
    Yao, Yining
    Sun, Kuan
    Yang, Qinrui
    Zhou, Zhihan
    Shao, Chengchen
    Qian, Xiaoqin
    Tang, Qiqun
    Xie, Jianhui
    FRONTIERS IN GENETICS, 2022, 12
  • [10] Small insertions and deletions (INDELs) in human genomes
    Mullaney, Julienne M.
    Mills, Ryan E.
    Pittard, W. Stephen
    Devine, Scott E.
    HUMAN MOLECULAR GENETICS, 2010, 19 : R131 - R136