Genome-wide analyses supported by RNA-Seq reveal non-canonical splice sites in plant genomes

被引:31
|
作者
Pucker, Boas [1 ,2 ,3 ]
Brockington, Samuel F. [1 ]
机构
[1] Univ Cambridge, Dept Plant Sci, Evolut & Divers, Cambridge, England
[2] Bielefeld Univ, CeBiTec, Genet & Genom Plants, Bielefeld, Germany
[3] Bielefeld Univ, Fac Biol, Bielefeld, Germany
来源
BMC GENOMICS | 2018年 / 19卷
关键词
Gene structure; Splicing; Annotation; Comparative genomics; Transcriptomics; Gene expression; Natural diversity; Evolution; INTRONS; GENES; SEQUENCES; MECHANISM; PROTEIN; CONSERVATION; EVOLUTION; MONOCOT;
D O I
10.1186/s12864-018-5360-z
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundMost eukaryotic genes comprise exons and introns thus requiring the precise removal of introns from pre-mRNAs to enable protein biosynthesis. U2 and U12 spliceosomes catalyze this step by recognizing motifs on the transcript in order to remove the introns. A process which is dependent on precise definition of exon-intron borders by splice sites, which are consequently highly conserved across species. Only very few combinations of terminal dinucleotides are frequently observed at intron ends, dominated by the canonical GT-AG splice sites on the DNA level.ResultsHere we investigate the occurrence of diverse combinations of dinucleotides at predicted splice sites. Analyzing 121 plant genome sequences based on their annotation revealed strong splice site conservation across species, annotation errors, and true biological divergence from canonical splice sites. The frequency of non-canonical splice sites clearly correlates with their divergence from canonical ones indicating either an accumulation of probably neutral mutations, or evolution towards canonical splice sites. Strong conservation across multiple species and non-random accumulation of substitutions in splice sites indicate a functional relevance of non-canonical splice sites. The average composition of splice sites across all investigated species is 98.7% for GT-AG, 1.2% for GC-AG, 0.06% for AT-AC, and 0.09% for minor non-canonical splice sites. RNA-Seq data sets of 35 species were incorporated to validate non-canonical splice site predictions through gaps in sequencing reads alignments and to demonstrate the expression of affected genes.ConclusionWe conclude that bona fide non-canonical splice sites are present and appear to be functionally relevant in most plant genomes, although at low abundance.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Genome-Wide Identification of Essential Proteins by Integrating RNA-seq, Subcellular Location and Complexes Information
    Fan, Chunyan
    Lei, Xiujuan
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT II, 2017, 10362 : 375 - 384
  • [42] Discovery of a Novel Driver of Radioresistance in Pancreatic Cancer (PC) Using Genome-Wide RNA-Seq
    Wolfe, A. R.
    Zuniga, O.
    Byrum, S. D.
    Leung, J.
    Tackett, A.
    Xia, F.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2022, 114 (03): : S12 - S12
  • [43] LaSSO, a strategy for genome-wide mapping of intronic lariats and branch points using RNA-seq
    Bitton, Danny A.
    Rallis, Charalampos
    Jeffares, Daniel C.
    Smith, Graeme C.
    Chen, Yuan Y. C.
    Codlin, Sandra
    Marguerat, Samuel
    Baehler, Juerg
    GENOME RESEARCH, 2014, 24 (07) : 1169 - 1179
  • [44] Genome-wide identification and analysis of the eQTL lncRNAs in multiple sclerosis based on RNA-seq data
    Han, Zhijie
    Xue, Weiwei
    Tao, Lin
    Lou, Yan
    Qiu, Yunqing
    Zhu, Feng
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (03) : 1023 - 1037
  • [45] Genome-wide Transcriptome Study for Myocardial Infarction and Coronary Artery Calcification Using RNA-Seq
    Zhang, Xiaoling
    Wakabayashi, Yoshiyuki
    Hwang, Shih-Jen
    Yang, Yanqin
    Levy, Daniel
    Johnson, Andrew D.
    Zhu, Jun
    O'Donnell, Christopher J.
    CIRCULATION, 2015, 132
  • [46] Genome-Wide ChIP-seq and RNA-seq Analyses of STAT3 Target Genes in TLRs Activated Human Peripheral Blood B Cells
    Wu, Jing
    Jin, Ying-Ying
    Gong, Ruo-Lan
    Yang, Fan
    Su, Xiao-Ya
    Chen, Tong-Xin
    FRONTIERS IN IMMUNOLOGY, 2022, 13
  • [47] Genome-wide QTL mapping and RNA-seq reveal genetic mechanisms behind discrepant growth traits in Pacific whiteleg shrimp, Litopenaeus vannamei
    Ma, Bo
    Liu, Yang
    Zhang, Xin
    Chen, Ting
    Zhang, Lvping
    Hu, Chaoqun
    Yu, Suzhong
    Chen, Guoqiang
    Liu, Liyan
    Zhu, Jingxuan
    Luo, Peng
    AQUACULTURE, 2025, 599
  • [48] Genome-Wide Identification and Characterization of Long Non-Coding RNAs from Mulberry (Morus notabilis) RNA-seq Data
    Song, Xiaobo
    Sun, Liang
    Luo, Haitao
    Ma, Qingguo
    Zhao, Yi
    Pei, Dong
    GENES, 2016, 7 (03)
  • [49] Characterization of genome-wide variations induced by gamma-ray radiation in barley using RNA-Seq
    Cong Tan
    Xiao-Qi Zhang
    Yin Wang
    Dianxin Wu
    Matthew I. Bellgard
    Yanhao Xu
    Xiaoli Shu
    Gaofeng Zhou
    Chengdao Li
    BMC Genomics, 20
  • [50] Characterization of genome-wide variations induced by gamma-ray radiation in barley using RNA-Seq
    Tan, Cong
    Zhang, Xiao-Qi
    Wang, Yin
    Wu, Dianxin
    Bellgard, Matthew I.
    Xu, Yanhao
    Shu, Xiaoli
    Zhou, Gaofeng
    Li, Chengdao
    BMC GENOMICS, 2019, 20 (01)