How tool combinations in different pipeline versions affect the outcome in RNA-seq analysis

被引:0
|
作者
Perelo, Louisa Wessels [1 ]
Gabernet, Gisela [1 ,5 ]
Straub, Daniel [1 ]
Nahnsen, Sven [1 ,2 ,3 ,4 ]
机构
[1] Univ Tubingen, Quant Biol Ctr QBiC, Otfried Muller Str 37, D-72076 Tubingen, Germany
[2] Univ Tubingen, Fac Med, M3 Res Ctr, Otfried Muller Str 37, D-72076 Tubingen, Germany
[3] Univ Tubingen, Inst Bioinformat & Med Informat IBMI, Dept Comp Sci, Otfried Muller Str 37, D-72076 Tubingen, Germany
[4] Univ Tubingen, Image Guided & Functionally Instruct Tumor Therapi, Cluster Excellence iFIT EXC 2180, Otfried Muller Str 37, D-72076 Tubingen, Germany
[5] Yale Sch Med, Computat Immunol, New Haven, CT 06511 USA
关键词
ALIGNMENT;
D O I
10.1093/nargab/lqae020
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Data analysis tools are continuously changed and improved over time. In order to test how these changes influence the comparability between analyses, the output of different workflow options of the nf-core/rnaseq pipeline were compared. Five different pipeline settings (STAR+Salmon, STAR+RSEM, STAR+featureCounts, HISAT2+featureCounts, pseudoaligner Salmon) were run on three datasets (human, Arabidopsis, zebrafish) containing spike-ins of the External RNA Control Consortium (ERCC). Fold change ratios and differential expression of genes and spike-ins were used for comparative analyses of the different tools and versions settings of the pipeline. An overlap of 85% for differential gene classification between pipelines could be shown. Genes interpreted with a bias were mostly those present at lower concentration. Also, the number of isoforms and exons per gene were determinants. Previous pipeline versions using featureCounts showed a higher sensitivity to detect one-isoform genes like ERCC. To ensure data comparability in long-term analysis series it would be recommendable to either stay with the pipeline version the series was initialized with or to run both versions during a transition time in order to ensure that the target genes are addressed the same way.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Grape RNA-Seq analysis pipeline environment
    Knowles, David G.
    Roeder, Maik
    Merkel, Angelika
    Guigo, Roderic
    BIOINFORMATICS, 2013, 29 (05) : 614 - 621
  • [2] VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis
    MacIntosh Cornwell
    Mahesh Vangala
    Len Taing
    Zachary Herbert
    Johannes Köster
    Bo Li
    Hanfei Sun
    Taiwen Li
    Jian Zhang
    Xintao Qiu
    Matthew Pun
    Rinath Jeselsohn
    Myles Brown
    X. Shirley Liu
    Henry W. Long
    BMC Bioinformatics, 19
  • [3] VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis
    Cornwell, MacIntosh
    Vangala, Mahesh
    Taing, Len
    Herbert, Zachary
    Koester, Johannes
    Li, Bo
    Sun, Hanfei
    Li, Taiwen
    Zhang, Jian
    Qiu, Xintao
    Pun, Matthew
    Jeselsohn, Rinath
    Brown, Myles
    Liu, X. Shirley
    Long, Henry W.
    BMC BIOINFORMATICS, 2018, 19
  • [4] shortran: a pipeline for small RNA-seq data analysis
    Gupta, Vikas
    Markmann, Katharina
    Pedersen, Christian N. S.
    Stougaard, Jens
    Andersen, Stig U.
    BIOINFORMATICS, 2012, 28 (20) : 2698 - 2700
  • [5] RNA-Seq Analysis Pipeline Based on Oshell Environment
    Li, Jing
    Hu, Jun
    Newman, Matthew
    Liu, Kejun
    Ge, Huanying
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (05) : 973 - 978
  • [6] FX: an RNA-Seq analysis tool on the cloud
    Hong, Dongwan
    Rhie, Arang
    Park, Sung-Soo
    Lee, Jongkeun
    Ju, Young Seok
    Kim, Sujung
    Yu, Saet-Byeol
    Bleazard, Thomas
    Park, Hyun-Seok
    Rhee, Hwanseok
    Chong, Hyonyong
    Yang, Kap-Seok
    Lee, Yeon-Su
    Kim, In-Hoo
    Lee, Jin Soo
    Kim, Jong-Il
    Seo, Jeong-Sun
    BIOINFORMATICS, 2012, 28 (05) : 721 - 723
  • [7] MMAPPR: Mutation Mapping Analysis Pipeline for Pooled RNA-seq
    Hill, Jonathon T.
    Demarest, Bradley L.
    Bisgrove, Brent W.
    Gorsi, Bushra
    Su, Yi-Chu
    Yost, H. Joseph
    GENOME RESEARCH, 2013, 23 (04) : 687 - 697
  • [8] RNA-Seq Analysis Is a Useful Tool in Variant Classification
    Karam, Rachid
    LaDuca, Holly
    Richardson, Marcy E.
    Pesaran, Tina
    Chao, Elizabeth
    JCO PRECISION ONCOLOGY, 2020, 4 : 1226 - 1227
  • [9] sRNAflow: A Tool for the Analysis of Small RNA-Seq Data
    Zayakin, Pawel
    NON-CODING RNA, 2024, 10 (01)
  • [10] RNA-Seq UD: A bioinformatics plattform for RNA-Seq analysis
    Ramirez, Miguel
    Alejandro Rojas-Quintero, Cristian
    Enrique Vera-Parra, Nelson
    2015 10TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2015,