Transcriptome sequencing of the Microarray Quality Control (MAQC) RNA reference samples using next generation sequencing

被引:61
|
作者
Mane, Shrinivasrao P. [2 ]
Evans, Clive [2 ]
Cooper, Kristal L. [2 ]
Crasta, Oswald R. [2 ]
Folkerts, Otto [2 ]
Hutchison, Stephen K. [3 ]
Harkins, Timothy T. [4 ]
Thierry-Mieg, Danielle [5 ]
Thierry-Mieg, Jean [5 ]
Jensen, Roderick V. [1 ]
机构
[1] Virginia Tech, Dept Biol Sci, Blacksburg, VA 24061 USA
[2] Virginia Tech, Virginia Bioinformat Inst, Blacksburg, VA 24061 USA
[3] 454 Life Sci Inc, Branford, CT 06405 USA
[4] Roche Appl Sci, Indianapolis, IN 46250 USA
[5] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
来源
BMC GENOMICS | 2009年 / 10卷
关键词
GENE-EXPRESSION; CELL TRANSCRIPTOME; DISCOVERY;
D O I
10.1186/1471-2164-10-264
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC) reference RNA samples using Roche's 454 Genome Sequencer FLX. Results: We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values <= 10(-20). We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR) from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database. Conclusion: Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Probing RNA structure and dynamics using nanopore and next generation sequencing
    Bose, Emma
    Xiong, Shengwei
    Jones, Alisha N.
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2024, 300 (06)
  • [42] Microarray and Next-Generation Sequencing to Analyse Gastric Cancer
    Dang, Yuan
    Wang, Ying-Chao
    Huang, Qiao-Jia
    ASIAN PACIFIC JOURNAL OF CANCER PREVENTION, 2014, 15 (19) : 8033 - 8039
  • [43] Rapid evaluation and quality control of next generation sequencing data with FaQCs
    Chien-Chi Lo
    Patrick S G Chain
    BMC Bioinformatics, 15
  • [44] ChronQC: a quality control monitoring system for clinical next generation sequencing
    Tawari, Nilesh R.
    Seow, Justine Jia Wen
    Perumal, Dharuman
    Ow, Jack L.
    Ang, Shimin
    Devasia, Arun George
    Ng, Pauline C.
    BIOINFORMATICS, 2018, 34 (10) : 1799 - 1800
  • [45] New Precision Metrics for Next Generation Sequencing Assay Quality Control
    Konigshofer, Y.
    Davis, L.
    Garlick, R. K.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2017, 19 (06): : 1058 - 1058
  • [46] COST IMPACT OF NEXT GENERATION SEQUENCING TESTING IN NSCLC WITH AND WITHOUT RNA SEQUENCING
    Carter, Cuyun G.
    Thakkar, S.
    Bognar, K.
    Ortendahl, J. D.
    Abdou, Y. G.
    Gandara, D.
    VALUE IN HEALTH, 2023, 26 (06) : S118 - S118
  • [47] Statistical guidelines for quality control of next-generation sequencing techniques
    Sprang, Maximilian
    Krueger, Matteo
    Andrade-Navarro, Miguel A.
    Fontaine, Jean -Fred
    LIFE SCIENCE ALLIANCE, 2021, 4 (11)
  • [48] Rapid evaluation and quality control of next generation sequencing data with FaQCs
    Lo, Chien-Chi
    Chain, Patrick S. G.
    BMC BIOINFORMATICS, 2014, 15
  • [49] Discovery of common sequences absent in the human reference genome using pooled samples from next generation sequencing
    Yu Liu
    Mehmet Koyutürk
    Sean Maxwell
    Min Xiang
    Martina Veigl
    Richard S Cooper
    Bamidele O Tayo
    Li Li
    Thomas LaFramboise
    Zhenghe Wang
    Xiaofeng Zhu
    Mark R Chance
    BMC Genomics, 15
  • [50] Discovery of common sequences absent in the human reference genome using pooled samples from next generation sequencing
    Liu, Yu
    Koyuturk, Mehmet
    Maxwell, Sean
    Xiang, Min
    Veigl, Martina
    Cooper, Richard S.
    Tayo, Bamidele O.
    Li, Li
    LaFramboise, Thomas
    Wang, Zhenghe
    Zhu, Xiaofeng
    Chance, Mark R.
    BMC GENOMICS, 2014, 15