Transcriptome sequencing of the Microarray Quality Control (MAQC) RNA reference samples using next generation sequencing

被引:61
|
作者
Mane, Shrinivasrao P. [2 ]
Evans, Clive [2 ]
Cooper, Kristal L. [2 ]
Crasta, Oswald R. [2 ]
Folkerts, Otto [2 ]
Hutchison, Stephen K. [3 ]
Harkins, Timothy T. [4 ]
Thierry-Mieg, Danielle [5 ]
Thierry-Mieg, Jean [5 ]
Jensen, Roderick V. [1 ]
机构
[1] Virginia Tech, Dept Biol Sci, Blacksburg, VA 24061 USA
[2] Virginia Tech, Virginia Bioinformat Inst, Blacksburg, VA 24061 USA
[3] 454 Life Sci Inc, Branford, CT 06405 USA
[4] Roche Appl Sci, Indianapolis, IN 46250 USA
[5] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
来源
BMC GENOMICS | 2009年 / 10卷
关键词
GENE-EXPRESSION; CELL TRANSCRIPTOME; DISCOVERY;
D O I
10.1186/1471-2164-10-264
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC) reference RNA samples using Roche's 454 Genome Sequencer FLX. Results: We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values <= 10(-20). We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR) from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database. Conclusion: Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Comparison of next generation sequencing technologies for transcriptome characterization
    Wall, P. Kerr
    Leebens-Mack, Jim
    Chanderbali, Andre S.
    Barakat, Abdelali
    Wolcott, Erik
    Liang, Haiying
    Landherr, Lena
    Tomsho, Lynn P.
    Hu, Yi
    Carlson, John E.
    Ma, Hong
    Schuster, Stephan C.
    Soltis, Douglas E.
    Soltis, Pamela S.
    Altman, Naomi
    dePamphilis, Claude W.
    BMC GENOMICS, 2009, 10
  • [32] Next generation sequencing of viral RNA genomes
    Denise A Marston
    Lorraine M McElhinney
    Richard J Ellis
    Daniel L Horton
    Emma L Wise
    Stacey L Leech
    Dan David
    Xavier de Lamballerie
    Anthony R Fooks
    BMC Genomics, 14
  • [33] Next generation sequencing of viral RNA genomes
    Marston, Denise A.
    McElhinney, Lorraine M.
    Ellis, Richard J.
    Horton, Daniel L.
    Wise, Emma L.
    Leech, Stacey L.
    David, Dan
    de Lamballerie, Xavier
    Fooks, Anthony R.
    BMC GENOMICS, 2013, 14
  • [34] Improving Product Quality and Safety using Next Generation Sequencing
    Clayton, Reginald
    McMutrie, Donna
    Armstrong, Alison
    Cote, Colette
    Chang, Audrey
    HUMAN GENE THERAPY, 2017, 28 (08) : A34 - A35
  • [35] Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules
    Camille Sessegolo
    Corinne Cruaud
    Corinne Da Silva
    Audric Cologne
    Marion Dubarry
    Thomas Derrien
    Vincent Lacroix
    Jean-Marc Aury
    Scientific Reports, 9
  • [36] Decrypting the transcriptome profile of Pak4 using next generation sequencing
    Rane, Chetan
    Patel, Misaal
    Cai, Li
    Minden, Audrey
    CANCER RESEARCH, 2017, 77
  • [37] Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules
    Sessegolo, Camille
    Cruaud, Corinne
    Da Silva, Corinne
    Cologne, Audric
    Dubarry, Marion
    Derrien, Thomas
    Lacroix, Vincent
    Aury, Jean-Marc
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [38] Using next-generation RNA sequencing to identify imprinted genes
    Wang, X.
    Clark, A. G.
    HEREDITY, 2014, 113 (02) : 156 - 166
  • [39] Using next-generation RNA sequencing to identify imprinted genes
    X Wang
    A G Clark
    Heredity, 2014, 113 : 156 - 166
  • [40] Cataloguing the small RNA content of honey using next generation sequencing
    Smith, Christopher
    Cokcetin, Nural
    Truong, Thuyen
    Harry, Elizabeth
    Hutvagner, Gyorgy
    Bajan, Sarah
    FOOD CHEMISTRY: MOLECULAR SCIENCES, 2021, 2