Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks

被引:9379
|
作者
Trapnell, Cole [1 ,2 ]
Roberts, Adam [3 ]
Goff, Loyal [1 ,2 ,4 ]
Pertea, Geo [5 ,6 ]
Kim, Daehwan [5 ,7 ]
Kelley, David R. [1 ,2 ]
Pimentel, Harold [3 ]
Salzberg, Steven L. [5 ,6 ]
Rinn, John L. [1 ,2 ]
Pachter, Lior [3 ,8 ,9 ]
机构
[1] Broad Inst MIT & Harvard, Cambridge, MA USA
[2] Harvard Univ, Dept Stem Cell & Regenerat Biol, Cambridge, MA 02138 USA
[3] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
[4] MIT, Dept Elect Engn & Comp Sci, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[5] Johns Hopkins Univ, Sch Med, Dept Med, McKusick Nathans Inst Genet Med, Baltimore, MD 21205 USA
[6] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[7] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD USA
[8] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[9] Univ Calif Berkeley, Dept Mol & Cell Biol, Berkeley, CA 94720 USA
基金
美国国家卫生研究院;
关键词
SPLICE JUNCTIONS; MESSENGER-RNA; IN-VIVO; IDENTIFICATION; REVEALS; QUANTIFICATION; ANNOTATION;
D O I
10.1038/nprot.2012.016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and similar to 1 h of hands-on time.
引用
收藏
页码:562 / 578
页数:17
相关论文
共 50 条
  • [31] Analysis and Differential Expression of Primo Genes Using RNA-Seq and qRT-PCR Experiments
    Shin, Jun-Young
    Ji, Jong-Ok
    Choi, Sang-Heon
    Choi, Da-Woon
    An, Ye-Jin
    Seo, Jae-Hyeok
    Choi, Jong-Gu
    Rho, Min-Suk
    Lee, Ji Yoon
    Yeo, Sujung
    Lee, Sang-Suk
    OXYGEN TRANSPORT TO TISSUE XLI, 2020, 1232 : 393 - 399
  • [32] Analysis of differential gene expression by RNA-seq data in brain areas of laboratory animals
    Babenko, Vladimir N.
    Bragin, Anatoly O.
    Spitsina, Anastasia M.
    Chadaeva, Irina V.
    Galieva, Elvira R.
    Orlova, Galina V.
    Medvedeva, Irina V.
    Orlov, Yuriy L.
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2016, 13 (04) : 292
  • [33] Differential gene expression analysis of palbociclib-resistant TNBC via RNA-seq
    Lanceta, Lilibeth
    Lypova, Nadiia
    O'Neill, Conor
    Li, Xiaohong
    Rouchka, Eric
    Chesney, Jason
    Imbert-Fernandez, Yoannis
    BREAST CANCER RESEARCH AND TREATMENT, 2021, 186 (03) : 677 - 686
  • [34] Development of a quantitative targeted RNA-Seq methodology for use in differential gene expression analysis
    Lader, Eric
    Hussong, Melanie
    Fosbrink, Matthew
    CANCER RESEARCH, 2016, 76
  • [35] Differential gene expression analysis of palbociclib-resistant TNBC via RNA-seq
    Lilibeth Lanceta
    Nadiia Lypova
    Conor O’Neill
    Xiaohong Li
    Eric Rouchka
    Jason Chesney
    Yoannis Imbert-Fernandez
    Breast Cancer Research and Treatment, 2021, 186 : 677 - 686
  • [36] Erratum to: Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Franck Rapaport
    Raya Khanin
    Yupu Liang
    Mono Pirun
    Azra Krek
    Paul Zumbo
    Christopher E. Mason
    Nicholas D. Socci
    Doron Betel
    Genome Biology, 16
  • [37] Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
    Wang, Xi
    Cairns, Murray J.
    BMC BIOINFORMATICS, 2013, 14
  • [38] Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
    Xi Wang
    Murray J Cairns
    BMC Bioinformatics, 14
  • [39] Stability of methods for differential expression analysis of RNA-seq data
    Bingqing Lin
    Zhen Pang
    BMC Genomics, 20
  • [40] RNA-Seq for Enrichment and Analysis of IRF5 Transcript Expression in SLE
    Stone, Rivka C.
    Du, Peicheng
    Feng, Di
    Dhawan, Kopal
    Ronnblom, Lars
    Eloranta, Maija-Leena
    Donnelly, Robert
    Barnes, Betsy J.
    PLOS ONE, 2013, 8 (01):