Identical sequence patterns in the ends of exons and introns of human protein-coding genes

被引:1
|
作者
Tavares, Raphael [1 ]
Renaud, Gabriel [1 ]
Lopes Oliveira, Paulo Sergio [2 ]
Ferreira, Carlos G.
Dias-Neto, Emmanuel [3 ,4 ]
Passetti, Fabio [1 ]
机构
[1] Inst Nacl Canc INCA, Bioinformat Unit, BR-20231050 Rio De Janeiro, RJ, Brazil
[2] Lab Nacl Biociencias, BR-13083970 Campinas, SP, Brazil
[3] Univ Sao Paulo, Lab Neurociencias LIM 27, Inst Psiquiatria, Fac Med, BR-01060970 Sao Paulo, Brazil
[4] Hosp AC Camargo Fund Antonio Prudente, Lab Med Genom, CIPE, BR-01508010 Sao Paulo, Brazil
关键词
Transcriptomics; Bioinformatics; Genome; Sequence analysis; SPLICEOSOMAL INTRONS;
D O I
10.1016/j.compbiolchem.2012.01.002
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Intron splicing is one of the most important steps involved in the maturation process of a pre-mRNA. Although the sequence profiles around the splice sites have been studied extensively, the levels of sequence identity between the exonic sequences preceding the donor sites and the intronic sequences preceding the acceptor sites has not been examined as thoroughly. In this study we investigated identity patterns between the last 15 nucleotides of the exonic sequence preceding the 5' splice site and the intronic sequence preceding the 3' splice site in a set of human protein-coding genes that do not exhibit intron retention. We found that almost 60% of consecutive exons and introns in human protein-coding genes share at least two identical nucleotides at their 3' ends and, on average, the sequence identity length is 2.47 nucleotides. Based on our findings we conclude that the 3' ends of exons and introns tend to have longer identical sequences within a gene than when being taken from different genes. Our results hold even if the pairs are non-consecutive in the transcription order. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:55 / 61
页数:7
相关论文
共 50 条
  • [1] Introns in protein-coding genes in Archaea
    Watanabe, Y
    Yokobori, S
    Inaba, T
    Yamagishi, A
    Oshima, T
    Kawarabayasi, Y
    Kikuchi, H
    Kita, K
    [J]. FEBS LETTERS, 2002, 510 (1-2) : 27 - 30
  • [2] RELATIONSHIP BETWEEN THE TOTAL SIZE OF EXONS AND INTRONS IN PROTEIN-CODING GENES OF HIGHER EUKARYOTES
    NAORA, H
    DEACON, NJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1982, 79 (20): : 6196 - 6200
  • [3] INTRONS AND THE ORIGIN OF PROTEIN-CODING GENES - REPLY
    STOLTZFUS, A
    SPENCER, DF
    ZUKER, M
    LOGSDON, JM
    DOOLITTLE, WF
    [J]. SCIENCE, 1995, 268 (5215) : 1367 - 1369
  • [4] INTRONS IN CHLOROPLAST PROTEIN-CODING GENES OF LAND PLANTS
    PLANT, AL
    GRAY, JC
    [J]. PHOTOSYNTHESIS RESEARCH, 1988, 16 (1-2) : 23 - 39
  • [5] Transcriptional Enhancers in Protein-Coding Exons of Vertebrate Developmental Genes
    Ritter, Deborah I.
    Dong, Zhiqiang
    Guo, Su
    Chuang, Jeffrey H.
    [J]. PLOS ONE, 2012, 7 (05):
  • [6] Introns Structure Patterns of Variation in Nucleotide Composition in Arabidopsis thaliana and Rice Protein-Coding Genes
    Ressayre, Adrienne
    Glemin, Sylvain
    Montalent, Pierre
    Serre-Giardi, Laurana
    Dillmann, Christine
    Joets, Johann
    [J]. GENOME BIOLOGY AND EVOLUTION, 2015, 7 (10): : 2913 - 2928
  • [7] Proposal of a new nomenclature for introns in protein-coding genes in fungal mitogenomes
    Shu Zhang
    Yong-Jie Zhang
    [J]. IMA Fungus, 10
  • [8] Proposal of a new nomenclature for introns in protein-coding genes in fungal mitogenomes
    Zhang, Shu
    Zhang, Yong-Jie
    [J]. IMA FUNGUS, 2019, 10 (1)
  • [9] Correction to: Proposal of a new nomenclature for introns in protein-coding genes in fungal mitogenomes
    Shu Zhang
    Yong-Jie Zhang
    [J]. IMA Fungus, 11
  • [10] The GC-content at the 5′ ends of human protein-coding genes is undergoing mutational decay
    Qiu, Yi
    Kang, Yoon Mo
    Korfmann, Christopher
    Pouyet, Fanny
    Eckford, Andrew
    Palazzo, Alexander F.
    [J]. GENOME BIOLOGY, 2024, 25 (01):