More than half of Caenorhabditis elegans pre-mRNAs lose their original 5' ends in a process termed "trans-splicing" in which the RNA extending from the transcription start site (TSS) to the site of trans-splicing of the primary transcript, termed the "outron," is replaced with a 22-nt spliced leader. This complicates the mapping of TSSs, leading to a lack of available TSS mapping data for these genes. We used growth at low temperature and nuclear isolation to enrich for transcripts still containing outrons, applying a modified SAGE capture procedure and high-throughput sequencing to characterize 5' termini in this transcript population. We report from this data both a landscape of 5'-end utilization for C. elegans and a representative collection of TSSs for 7351 trans-spliced genes. TSS distributions for individual genes were often dispersed, with a greater average number of TSSs for trans-spliced genes, suggesting that trans-splicing may remove selective pressure for a single TSS. Upstream of newly defined TSSs, we observed well-known motifs (including TATAA-box and SP1) as well as novel motifs. Several of these motifs showed association with tissue-specific expression and/or conservation among six worm species. Comparing TSS features between trans-spliced and non-trans-spliced genes, we found stronger signals among outron TSSs for preferentially positioning of flanking nucleosomes and for downstream Pol II enrichment. Our data provide an enabling resource for both experimental and theoretical analysis of gene structure and function in C. elegans.
机构:
Rice Univ, Dept Biosci, Houston, TX 77005 USA
Rice Univ, Dept Bioengn, Houston, TX USAUniv Warwick, Math Inst, Coventry, W Midlands, England
Warmflash, Aryeh
Rand, David A.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Warwick, Math Inst, Coventry, W Midlands, England
Univ Warwick, Zeeman Inst Syst Biol & Infect Dis Epidemiol Res, Coventry, W Midlands, EnglandUniv Warwick, Math Inst, Coventry, W Midlands, England
机构:
Univ So Calif, Dept Biol Sci, Sect Mol & Computat Biol, Los Angeles, CA 90089 USAUniv So Calif, Dept Biol Sci, Sect Mol & Computat Biol, Los Angeles, CA 90089 USA
Main, Bradley J.
Smith, Andrew D.
论文数: 0引用数: 0
h-index: 0
机构:
Univ So Calif, Dept Biol Sci, Sect Mol & Computat Biol, Los Angeles, CA 90089 USAUniv So Calif, Dept Biol Sci, Sect Mol & Computat Biol, Los Angeles, CA 90089 USA
Smith, Andrew D.
Jang, Hyosik
论文数: 0引用数: 0
h-index: 0
机构:
Univ So Calif, Dept Biol Sci, Sect Mol & Computat Biol, Los Angeles, CA 90089 USAUniv So Calif, Dept Biol Sci, Sect Mol & Computat Biol, Los Angeles, CA 90089 USA
Jang, Hyosik
Nuzhdin, Sergey V.
论文数: 0引用数: 0
h-index: 0
机构:
Univ So Calif, Dept Biol Sci, Sect Mol & Computat Biol, Los Angeles, CA 90089 USAUniv So Calif, Dept Biol Sci, Sect Mol & Computat Biol, Los Angeles, CA 90089 USA