Analysis of canonical and non-canonical splice sites in mammalian genomes

被引:449
|
作者
Burset, M [1 ]
Seledtsov, IA [1 ]
Solovyev, VV [1 ]
机构
[1] Sanger Ctr, Informat Div, Cambridge CB10 1SA, England
基金
英国惠康基金;
关键词
D O I
10.1093/nar/28.21.4364
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A set of 43 337 splice junction pairs was extracted from mammalian GenBank annotated genes. Expressed sequence tag (EST) sequences support 22 489 of them. Of these, 98.71% contain canonical dinucleotides: GT and AG for donor and acceptor sites, respectively; 0.56% hold non-canonical GC-AG splice site pairs; and the remaining 0.73% occurs in a lot of small groups (with a maximum size of 0.05%). Studying these groups we observe that many of them contain splicing dinucleotides shifted from the annotated splice junction by one position. After close examination of such cases we present a new classification consisting of only eight observed types of splice site pairs (out of 256 a priori possible combinations). EST alignments allow us to verify the exonic part of the splice sites, but many non-canonical cases may be due to intron sequencing errors. This idea is given substantial support when we compare the sequences of human genes having non-canonical splice sites deposited in GenBank by high throughput genome sequencing projects (HTG). A high proportion (156 out of 171) of the human non-canonical and EST-supported splice-site sequences had a clear match in the human HTG. They can be classified after corrections as: 79 GC-AG pairs (of which one was an error that corrected to GC-AG), 61 errors that were corrected to GT-BG;canonical pairs, six AT-AC pairs (of which two were-errors that corrected to AT-AC), one case was produced from non-existent intron, seven cases were found in HTG that were deposited to GenBank and finally there were only two cases left of supported non-canonical splice sites. If we assume that approximately the same situation is true for the whole: set of annotated mammalian non-canonical splice-sites, then the 99.24% of splice site pairs should be GT-AG, 0.69% GC-AG, 0.05% AT-AC and finally only 0.02% could consist of other types of non-canonical splice sites. We analyze several characteristics of EST-verified splice sites and build weight matrices for the major groups, which can be incorporated into gene prediction programs. We also present a set of EST-verified canonical splice sites larger by two orders of magnitude than the current one (22 199 entries versus similar to 600) and finally, a set of 290 EST-supported non-canonical splice sites, Both sets should be significant for future investigations of the splicing mechanism.
引用
收藏
页码:4364 / 4375
页数:12
相关论文
共 50 条
  • [41] Conservation and divergence of canonical and non-canonical imprinting in murids
    Albert, Julien Richard
    Kobayashi, Toshihiro
    Inoue, Azusa
    Monteagudo-Sanchez, Ana
    Kumamoto, Soichiro
    Takashima, Tomoya
    Miura, Asuka
    Oikawa, Mami
    Miura, Fumihito
    Takada, Shuji
    Hirabayashi, Masumi
    Korthauer, Keegan
    Kurimoto, Kazuki
    Greenberg, Maxim V. C.
    Lorincz, Matthew
    Kobayashi, Hisato
    GENOME BIOLOGY, 2023, 24 (01)
  • [42] Canonical and Non-Canonical Roles of Human DNA Polymerase η
    Bedaiwi, Salma
    Usmani, Anam
    Carty, Michael P.
    GENES, 2024, 15 (10)
  • [43] Canonical and non-canonical mechanism of translation initiation in eukaryotes
    Hellen, CU
    FASEB JOURNAL, 2006, 20 (05): : A852 - A852
  • [44] Comprehensive portrait of canonical and non-canonical splicing in cancer
    Jayasinghe, Reyka G.
    Cao, Song
    Gao, Qingsong
    Wyczalkowski, Matthew A.
    Sengupta, Sohini
    Walter, Matthew J.
    Maher, Christopher
    Wendl, Michael C.
    Chen, Feng
    Eyras, Eduardo
    Lazar, Alexander J.
    Chen, Ken
    Shmulevich, Ilya
    Ding, Li
    CANCER RESEARCH, 2018, 78 (13)
  • [45] Comprehensive Analysis of the Canonical and Non-canonical Wnt Signaling Pathways in Gastric Cancer
    Le Wang
    Hao Wang
    Xianglong Duan
    Penggao Dai
    Jianping Li
    Digestive Diseases and Sciences, 2019, 64 : 2830 - 2842
  • [46] The expanding landscape of canonical and non-canonical protein phosphorylation
    Houles, Thibault
    Yoon, Sang-Oh
    Roux, Philippe P.
    TRENDS IN BIOCHEMICAL SCIENCES, 2024, 49 (11) : 986 - 999
  • [47] A ravenous defense: canonical and non-canonical autophagy in immunity
    Sil, Payel
    Muse, Ginger
    Martinez, Jennifer
    CURRENT OPINION IN IMMUNOLOGY, 2018, 50 : 21 - 31
  • [48] Conservation and divergence of canonical and non-canonical imprinting in murids
    Julien Richard Albert
    Toshihiro Kobayashi
    Azusa Inoue
    Ana Monteagudo-Sánchez
    Soichiro Kumamoto
    Tomoya Takashima
    Asuka Miura
    Mami Oikawa
    Fumihito Miura
    Shuji Takada
    Masumi Hirabayashi
    Keegan Korthauer
    Kazuki Kurimoto
    Maxim V. C. Greenberg
    Matthew Lorincz
    Hisato Kobayashi
    Genome Biology, 24
  • [49] Connecting Gospels: Beyond the Canonical/Non-canonical Divide
    Walters, James
    STUDIES IN RELIGION-SCIENCES RELIGIEUSES, 2022, 51 (01) : 125 - 127
  • [50] Canonical and Non-Canonical Wnt Signaling in Immune Cells
    Chae, Wook-Jin
    Bothwell, Alfred L. M.
    TRENDS IN IMMUNOLOGY, 2018, 39 (10) : 830 - 847