Flexible comparison of batch correction methods for single-cell RNA-seq using BatchBench

被引:41
|
作者
Chazarra-Gil, Ruben [1 ]
van Dongen, Stijn [1 ]
Kiselev, Vladimir Yu [1 ]
Hemberg, Martin [1 ]
机构
[1] Wellcome Sanger Inst, Wellcome Genome Campus, Hinxton CB10 1SA, England
基金
英国惠康基金;
关键词
GENE-EXPRESSION; TECHNOLOGIES; POPULATION; ATLAS;
D O I
10.1093/nar/gkab004
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As the cost of single-cell RNA-seq experiments has decreased, an increasing number of datasets are now available. Combining newly generated and publicly accessible datasets is challenging due to non-biological signals, commonly known as batch effects. Although there are several computational methods available that can remove batch effects, evaluating which method performs best is not straightforward. Here, we present BatchBench (https://github.com/cellgeni/batchbench), a modular and flexible pipeline for comparing batch correction methods for single-cell RNA-seq data. We apply BatchBench to eight methods, highlighting their methodological differences and assess their performance and computational requirements through a compendium of well-studied datasets. This systematic comparison guides users in the choice of batch correction tool, and the pipeline makes it easy to evaluate other datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Tao Peng
    Qin Zhu
    Penghang Yin
    Kai Tan
    Genome Biology, 20
  • [32] Comparison of methods and resources for cell-cell communication inference from single-cell RNA-Seq data
    Daniel Dimitrov
    Dénes Türei
    Martin Garrido-Rodriguez
    Paul L. Burmedi
    James S. Nagai
    Charlotte Boys
    Ricardo O. Ramirez Flores
    Hyojin Kim
    Bence Szalai
    Ivan G. Costa
    Alberto Valdeolivas
    Aurélien Dugourd
    Julio Saez-Rodriguez
    Nature Communications, 13
  • [33] A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq
    Ye, Wenbin
    Lian, Qiwei
    Ye, Congting
    Wu, Xiaohui
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2023, 21 (01) : 67 - 83
  • [34] Comparison of methods and resources for cell-cell communication inference from single-cell RNA-Seq data
    Dimitrov, Daniel
    Tuerei, Denes
    Garrido-Rodriguez, Martin
    Burmedi, Paul L.
    Nagai, James S.
    Boys, Charlotte
    Flores, Ricardo O. Ramirez
    Kim, Hyojin
    Szalai, Bence
    Costa, Ivan G.
    Valdeolivas, Alberto
    Dugourd, Aurelien
    Saez-Rodriguez, Julio
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [35] Publisher Correction: Challenges in unsupervised clustering of single-cell RNA-seq data
    Vladimir Yu Kiselev
    Tallulah S. Andrews
    Martin Hemberg
    Nature Reviews Genetics, 2019, 20 : 310 - 310
  • [36] Correction: Corrigendum: Accounting for technical noise in single-cell RNA-seq experiments
    Philip Brennecke
    Simon Anders
    Jong Kyoung Kim
    Aleksandra A Kołodziejczyk
    Xiuwei Zhang
    Valentina Proserpio
    Bianka Baying
    Vladimir Benes
    Sarah A Teichmann
    John C Marioni
    Marcus G Heisler
    Nature Methods, 2014, 11 : 210 - 210
  • [37] Bubble: a fast single-cell RNA-seq imputation using an autoencoder constrained by bulk RNA-seq data
    Chen, Siqi
    Yan, Xuhua
    Zheng, Ruiqing
    Li, Min
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
  • [38] Comparison of Scanpy-based algorithms to remove the batch effect from single-cell RNA-seq data
    Li J.
    Yu C.
    Ma L.
    Wang J.
    Guo G.
    Cell Regeneration, 9 (1)
  • [39] SMNN: batch effect correction for single-cell RNA-seq data via supervised mutual nearest neighbor detection
    Yang, Yuchen
    Li, Gang
    Qian, Huijun
    Wilhelmsen, Kirk C.
    Shen, Yin
    Li, Yun
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [40] SINGLE-CELL ANALYSIS From single-cell RNA-seq to transcriptional regulation
    La Manno, Gioele
    NATURE BIOTECHNOLOGY, 2019, 37 (12) : 1421 - 1422