Context of deletions and insertions in human coding sequences

被引:62
|
作者
Kondrashov, AS [1 ]
Rogozin, IB [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Bethesda, MD 20894 USA
关键词
mutation; hot spot; deletion; insertion; mutable motif; nucleotide context; repeat; microsatellite;
D O I
10.1002/humu.10312
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We studied the dependence of the rate of short deletions and insertions on their contexts using the data on mutations within coding exons at 19 human loci that cause mendelian diseases. We confirm that periodic sequences consisting of three to five or more nucleotides are mutagenic. Mutability of sequences with strongly biased nucleotide composition is also elevated, even when mutations within homonucleotide runs longer than three nucleotides are ignored. In contrast, no elevated mutation rates have been detected for imperfect direct or inverted repeats. Among known candidate contexts, the indel context GTAAGT and regions with purine-pyrimidine imbalance between the two DNA strands are mutagenic in our sample, and many others are not mutagenic. Data on mutation hot spots suggest two novel contexts that increase the deletion rate. Comprehensive analysis of mutability of all possible contexts of lengths four, six, and eight indicates a substantially elevated deletion rate within YYYTG and similar sequences, which is one of the two contexts revealed by the hot spots. Possible contexts that increase the insertion rate (AT(A/C)(A/C)GCC and TACCRC) and decrease deletion (TATCGC) or insertion (GCGG) rates have also been identified. Two-thirds of deletions remove a repeat, and over 80% of insertions create a repeat, i.e., they are duplications. Published 2003 Wiley-Liss, Inc.(dagger)
引用
收藏
页码:177 / 185
页数:9
相关论文
共 50 条
  • [31] Insertions and Deletions Target Lineage-Defining Genes in Human Cancers
    Imielinski, Marcin
    Guo, Guangwu
    Meyerson, Matthew
    CELL, 2017, 168 (03) : 460 - +
  • [32] Functional constraint and small insertions and deletions in the ENCODE regions of the human genome
    Taane G Clark
    Toby Andrew
    Gregory M Cooper
    Elliott H Margulies
    James C Mullikin
    David J Balding
    Genome Biology, 8
  • [33] Small Insertions Are More Deleterious than Small Deletions in Human Genomes
    Huang, Shengfeng
    Li, Jie
    Xu, Anlong
    Huang, Guangrui
    You, Leiming
    HUMAN MUTATION, 2013, 34 (12) : 1642 - 1649
  • [34] Amino acid insertions and deletions contribute to diversify the human Ig repertoire
    Wilson, P
    Liu, YJ
    Banchereau, J
    Capra, JD
    Pascual, V
    IMMUNOLOGICAL REVIEWS, 1998, 162 : 143 - 151
  • [35] Characterization of frequencies and distribution of single nucleotide insertions/deletions in the human genome
    Tan, Ene-Choo
    Li, Haixia
    GENE, 2006, 376 (02) : 268 - 280
  • [36] Functional constraint and small insertions and deletions in the ENCODE regions of the human genome
    Clark, Taane G.
    Andrew, Toby
    Cooper, Gregory M.
    Margulies, Elliott H.
    Mullikin, James C.
    Balding, David J.
    GENOME BIOLOGY, 2007, 8 (09)
  • [37] Insertions/deletions in sequences of highly homologous proteins can infer targetable differences in their spatial structures
    Cherkasov, A
    Nandan, D
    Reiner, NE
    FEBS JOURNAL, 2005, 272 : 79 - 80
  • [38] Identification of motifs with insertions and deletions in protein sequences using self-organizing neural networks
    Liu, DR
    Xiong, XX
    Hou, ZG
    DasGupta, B
    NEURAL NETWORKS, 2005, 18 (5-6) : 835 - 842
  • [39] A self-organizing neural network approach for the identification of motifs with insertions and deletions in protein sequences
    Xiong, XX
    Liu, DR
    Zhang, HG
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 292 - 297
  • [40] Evolution of alternative splicing: deletions, insertions and origin of functional parts of proteins from intron sequences
    Kondrashov, FA
    Koonin, EV
    TRENDS IN GENETICS, 2003, 19 (03) : 115 - 119