Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks

被引:9379
|
作者
Trapnell, Cole [1 ,2 ]
Roberts, Adam [3 ]
Goff, Loyal [1 ,2 ,4 ]
Pertea, Geo [5 ,6 ]
Kim, Daehwan [5 ,7 ]
Kelley, David R. [1 ,2 ]
Pimentel, Harold [3 ]
Salzberg, Steven L. [5 ,6 ]
Rinn, John L. [1 ,2 ]
Pachter, Lior [3 ,8 ,9 ]
机构
[1] Broad Inst MIT & Harvard, Cambridge, MA USA
[2] Harvard Univ, Dept Stem Cell & Regenerat Biol, Cambridge, MA 02138 USA
[3] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
[4] MIT, Dept Elect Engn & Comp Sci, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[5] Johns Hopkins Univ, Sch Med, Dept Med, McKusick Nathans Inst Genet Med, Baltimore, MD 21205 USA
[6] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[7] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD USA
[8] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[9] Univ Calif Berkeley, Dept Mol & Cell Biol, Berkeley, CA 94720 USA
基金
美国国家卫生研究院;
关键词
SPLICE JUNCTIONS; MESSENGER-RNA; IN-VIVO; IDENTIFICATION; REVEALS; QUANTIFICATION; ANNOTATION;
D O I
10.1038/nprot.2012.016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and similar to 1 h of hands-on time.
引用
收藏
页码:562 / 578
页数:17
相关论文
共 50 条
  • [21] Differential Gene Expression by RNA-Seq Analysis of the Primo Vessel in the Rabbit Lymph
    Shin, Jun-Young
    Choi, Sang-Heon
    Choi, Da-Woon
    An, Ye-Jin
    Seo, Jae-Hyuk
    Choi, Jong-Gu
    Rho, Min-Suk
    Lee, Sang-Suk
    JOURNAL OF ACUPUNCTURE AND MERIDIAN STUDIES, 2019, 12 (01) : 11 - 19
  • [22] Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Rapaport, Franck
    Khanin, Raya
    Liang, Yupu
    Pirun, Mono
    Krek, Azra
    Zumbo, Paul
    Mason, Christopher E.
    Socci, Nicholas D.
    Betel, Doron
    GENOME BIOLOGY, 2013, 14 (09):
  • [23] Power analysis for RNA-Seq differential expression studies
    Yu, Lianbo
    Fernandez, Soledad
    Brock, Guy
    BMC BIOINFORMATICS, 2017, 18
  • [24] Differential expression analysis for paired RNA-seq data
    Chung, Lisa M.
    Ferguson, John P.
    Zheng, Wei
    Qian, Feng
    Bruno, Vincent
    Montgomery, Ruth R.
    Zhao, Hongyu
    BMC BIOINFORMATICS, 2013, 14 : 110
  • [25] Power analysis for RNA-Seq differential expression studies
    Lianbo Yu
    Soledad Fernandez
    Guy Brock
    BMC Bioinformatics, 18
  • [26] Differential expression analysis for paired RNA-seq data
    Lisa M Chung
    John P Ferguson
    Wei Zheng
    Feng Qian
    Vincent Bruno
    Ruth R Montgomery
    Hongyu Zhao
    BMC Bioinformatics, 14
  • [27] Differential Expression Analysis of Gene and Transcript Abundance for Single Cell RNA-Seq Data using STAR and HISAT Aligners.
    Ngwa, Julius
    Wojciechowski, Robert
    Zack, Donald J.
    Beaty, Terri
    Ruczinski, Ingo
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2017, 58 (08)
  • [28] RNA-Seq Analysis of Differential Gene Expression in Electroporated Chick Embryonic Spinal Cord
    Vieceli, Felipe M.
    Yan, C. Y. Irene
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2014, (93):
  • [29] Experimental validation of methods for differential gene expression analysis and sample pooling in RNA-seq
    Anto P. Rajkumar
    Per Qvist
    Ross Lazarus
    Francesco Lescai
    Jia Ju
    Mette Nyegaard
    Ole Mors
    Anders D. Børglum
    Qibin Li
    Jane H. Christensen
    BMC Genomics, 16
  • [30] Experimental validation of methods for differential gene expression analysis and sample pooling in RNA-seq
    Rajkumar, Anto P.
    Qvist, Per
    Lazarus, Ross
    Lescai, Francesco
    Ju, Jia
    Nyegaard, Mette
    Mors, Ole
    Borglum, Anders D.
    Li, Qibin
    Christensen, Jane H.
    BMC GENOMICS, 2015, 16