Mining SAGE data allows large-scale, sensitive screening of antisense transcript expression -: art. no. e163

被引:25
|
作者
Quéré, R
Manchon, L
Lejeune, M
Clément, O
Pierrat, F
Bonafoux, B
Commes, T
Piquemal, D
Marti, J
机构
[1] Inst Human Genet, CNRS, UPR 1142, F-34396 Montpellier, France
[2] Univ Montpellier 2, Skuld Tech, F-34095 Montpellier, France
关键词
D O I
10.1093/nar/gnh161
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As a growing number of complementary transcripts, susceptible to exert various regulatory functions, are being found in eukaryotes, high throughput analytical methods are needed to investigate their expression in multiple biological samples. Serial Analysis of Gene Expression (SAGE), based on the enumeration of directionally reliable short cDNA sequences (tags), is capable of revealing antisense transcripts. We initially detected them by observing tags that mapped on to the reverse complement of known mRNAs. The presence of such tags in individual SAGE libraries suggested that SAGE datasets contain latent information on antisense transcripts. We raised a collection of virtual tags for mining these data. Tag pairs were assembled by searching for complementarities between 24-nt long sequences centered on the potential SAGE-anchoring sites of well-annotated human expressed sequences. An analysis of their presence in a large collection of published SAGE libraries revealed transcripts expressed at high levels from both strands of two adjacent, oppositely oriented, transcription units. In other cases, the respective transcripts of such cis-oriented genes displayed a mutually exclusive expression pattern or were co-expressed in a small number of libraries. Other tag pairs revealed overlapping transcripts of trans-encoded unique genes. Finally, we isolated a group of tags shared by multiple transcripts. Most of them mapped on to retroelements, essentially represented in humans by Alu sequences inserted in opposite orientations in the 3'UTR of otherwise different mRNAs. Registering these tags in separate files makes possible computational searches focused on unique sense-antisense pairs. The method developed in the present work shows that SAGE datasets constitute a major resource of rapidly investigating with high sensitivity the expression of antisense transcripts, so that a single tag may be detected in one library when screening a large number of biological samples.
引用
收藏
页数:7
相关论文
共 5 条
  • [1] A large-scale ASON platform in Tsinghua University - art. no. 60221E
    Zheng, XP
    Zhang, HY
    Network Architectures, Management, and Applications III, Pts 1 and 2, 2005, 6022 : E221 - E221
  • [2] Gene Catchr -: Gene cloning and tagging for Caenorhabditis elegans using yeast homologous recombination:: a novel approach for the analysis of gene expression -: art. no. E163
    Sassi, HE
    Renihan, S
    Spence, AM
    Cooperstock, RL
    NUCLEIC ACIDS RESEARCH, 2005, 33 (18) : 1 - 16
  • [3] Imaging the structure of a large-scale TID using ISR and TEC data -: art. no. L09812
    Nicolls, MJ
    Kelley, MC
    Coster, AJ
    González, SA
    Makela, JJ
    GEOPHYSICAL RESEARCH LETTERS, 2004, 31 (09) : L098121 - 4
  • [4] Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data
    Céline Becquet
    Sylvain Blachon
    Baptiste Jeudy
    Jean-Francois Boulicaut
    Olivier Gandrillon
    Genome Biology, 3 (12)
  • [5] Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data
    Becquet, Celine
    Blachon, Sylvain
    Jeudy, Baptiste
    Boulicaut, Jean-Francois
    Gandrillon, Olivier
    GENOME BIOLOGY, 2002, 3 (12):