Support vector machines-based identification of alternative splicing in Arabidopsis thaliana from whole-genome tiling arrays

被引:16
|
作者
Eichner, Johannes [1 ,3 ]
Zeller, Georg [1 ,2 ,4 ]
Laubinger, Sascha [2 ,5 ]
Raetsch, Gunnar [1 ]
机构
[1] Max Planck Gesell, Friedrich Miescher Lab, D-72076 Tubingen, Germany
[2] Max Planck Inst Dev Biol, D-72076 Tubingen, Germany
[3] Univ Tubingen, Ctr Bioinformat, D-72076 Tubingen, Germany
[4] European Mol Biol Lab, D-69117 Heidelberg, Germany
[5] Univ Tubingen, Ctr Plant Mol Biol, Auf Der Morgenstelle, Germany
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
TRANSCRIPTOME; MICROARRAYS; DATABASE; PLANTS; POLYMORPHISMS; DISEASE; RNAS;
D O I
10.1186/1471-2105-12-55
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Alternative splicing (AS) is a process which generates several distinct mRNA isoforms from the same gene by splicing different portions out of the precursor transcript. Due to the (patho-)physiological importance of AS, a complete inventory of AS is of great interest. While this is in reach for human and mammalian model organisms, our knowledge of AS in plants has remained more incomplete. Experimental approaches for monitoring AS are either based on transcript sequencing or rely on hybridization to DNA microarrays. Among the microarray platforms facilitating the discovery of AS events, tiling arrays are well-suited for identifying intron retention, the most prevalent type of AS in plants. However, analyzing tiling array data is challenging, because of high noise levels and limited probe coverage. Results: In this work, we present a novel method to detect intron retentions (IR) and exon skips (ES) from tiling arrays. While statistical tests have typically been proposed for this purpose, our method instead utilizes support vector machines (SVMs) which are appreciated for their accuracy and robustness to noise. Existing EST and cDNA sequences served for supervised training and evaluation. Analyzing a large collection of publicly available microarray and sequence data for the model plant A. thaliana, we demonstrated that our method is more accurate than existing approaches. The method was applied in a genome-wide screen which resulted in the discovery of 1,355 IR events. A comparison of these IR events to the TAIR annotation and a large set of short-read RNA-seq data showed that 830 of the predicted IR events are novel and that 525 events (39%) overlap with either the TAIR annotation or the IR events inferred from the RNA-seq data. Conclusions: The method developed in this work expands the scarce repertoire of analysis tools for the identification of alternative mRNA splicing from whole-genome tiling arrays. Our predictions are highly enriched with known AS events and complement the A. thaliana genome annotation with respect to AS. Since all predicted AS events can be precisely attributed to experimental conditions, our work provides a basis for follow-up studies focused on the elucidation of the regulatory mechanisms underlying tissue-specific and stress-dependent AS in plants.
引用
收藏
页数:17
相关论文
共 21 条
  • [1] Support vector machines-based identification of alternative splicing in Arabidopsis thaliana from whole-genome tiling arrays
    Johannes Eichner
    Georg Zeller
    Sascha Laubinger
    Gunnar Rätsch
    BMC Bioinformatics, 12
  • [2] Stress-induced changes in the Arabidopsis thaliana transcriptome analyzed using whole-genome tiling arrays
    Zeller, Georg
    Henz, Stefan R.
    Widmer, Christian K.
    Sachsenberg, Timo
    Raetsch, Gunnar
    Weigel, Detlef
    Laubinger, Sascha
    PLANT JOURNAL, 2009, 58 (06): : 1068 - 1082
  • [3] Genome-Wide Survey of Cold Stress Regulated Alternative Splicing in Arabidopsis thaliana with Tiling Microarray
    Leviatan, Noam
    Alkan, Noam
    Leshkowitz, Dena
    Fluhr, Robert
    PLOS ONE, 2013, 8 (06):
  • [4] Identification of transcribed sequences in Arabidopsis thaliana by using high-resolution genome tiling arrays
    Stolc, V
    Samanta, MP
    Tongprasit, W
    Sethi, H
    Liang, SD
    Nelson, DC
    Hegeman, A
    Nelson, C
    Rancour, D
    Bednarek, S
    Ulrich, EL
    Zhao, Q
    Wrobel, RL
    Newman, CS
    Fox, BG
    Phillips, GN
    Markley, JL
    Sussman, MR
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (12) : 4453 - 4458
  • [5] Global analysis of genetic, epigenetic and transcriptional polymorphisms in Arabidopsis thaliana using whole genome tiling arrays
    Zhang, Xu
    Shiu, Shinhan
    Cal, Andrew
    Borevitz, Justin O.
    PLOS GENETICS, 2008, 4 (03):
  • [6] At-TAX: a whole genome tiling array resource for developmental expression analysis and transcript identification in Arabidopsis thaliana
    Sascha Laubinger
    Georg Zeller
    Stefan R Henz
    Timo Sachsenberg
    Christian K Widmer
    Naïra Naouar
    Marnik Vuylsteke
    Bernhard Schölkopf
    Gunnar Rätsch
    Detlef Weigel
    Genome Biology, 9
  • [7] At-TAX:: a whole genome tiling array resource for developmental expression analysis and transcript identification in Arabidopsis thaliana
    Laubinger, Sascha
    Zeller, Georg
    Henz, Stefan R.
    Sachsenberg, Timo
    Widmer, Christian K.
    Naouar, Naira
    Vuylsteke, Marnik
    Schoelkopf, Bernhard
    Raetsch, Gunnar
    Weigel, Detlef
    GENOME BIOLOGY, 2008, 9 (07)
  • [8] Molecular Evidence for Functional Divergence and Decay of a Transcription Factor Derived from Whole-Genome Duplication in Arabidopsis thaliana
    Lehti-Shiu, Melissa D.
    Uygun, Sahra
    Moghe, Gaurav D.
    Panchy, Nicholas
    Fang, Liang
    Hufnagel, David E.
    Jasicki, Hannah L.
    Feig, Michael
    Shiu, Shin-Han
    PLANT PHYSIOLOGY, 2015, 168 (04) : 1717 - U1061
  • [9] Knodle: A Support Vector Machines-Based Automatic Perception of Organic Molecules from 3D Coordinates
    Kadukova, Maria
    Grudinin, Sergei
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (08) : 1410 - 1419
  • [10] Icelandic accession of Arabidopsis thaliana confirmed with cytogenetic markers and its origin inferred from whole-genome sequencing
    Mandakova, Terezie
    Thorbjornsson, Hjortur
    Pisupati, Rahul
    Reichardt, Ilka
    Lysak, Martin A.
    Anamthawat-Jonsson, Kesara
    ICELANDIC AGRICULTURAL SCIENCES, 2017, 30 : 29 - 38