Assessing the reliability of spike-in normalization for analyses of single-cell RNA sequencing data

被引:40
|
作者
Lun, Aaron T. L. [1 ]
Calero-Nieto, Fernando J. [2 ]
Haim-Vilmovsky, Liora [3 ,4 ]
Gottgens, Berthold [2 ]
Marioni, John C. [1 ,3 ,4 ]
机构
[1] Univ Cambridge, Li Ka Shing Ctr, Canc Res UK Cambridge Inst, Cambridge CB2 0RE, England
[2] Univ Cambridge, Wellcome Trust & MRC Cambridge Stem Cell Inst, Cambridge CB2 0XY, England
[3] EMBL European Bioinformat Inst, Wellcome Genome Campus, Cambridge CB10 1SD, England
[4] Wellcome Trust Sanger Inst, Wellcome Genome Campus, Cambridge CB10 1SA, England
基金
英国惠康基金; 英国医学研究理事会;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; COMPUTATIONAL ANALYSIS; HETEROGENEITY; SEQ; TRANSCRIPTOME; DESIGN; NOISE;
D O I
10.1101/gr.222877.117
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
By profiling the transcriptomes of individual cells, single-cell RNA sequencing provides unparalleled resolution to study cellular heterogeneity. However, this comes at the cost of high technical noise, including cell-specific biases in capture efficiency and library generation. One strategy for removing these biases is to add a constant amount of spike-in RNA to each cell and to scale the observed expression values so that the coverage of spike-in transcripts is constant across cells. This approach has previously been criticized as its accuracy depends on the precise addition of spike-in RNA to each sample. Here, we perform mixture experiments using two different sets of spike-in RNA to quantify the variance in the amount of spike-in RNA added to each well in a plate-based protocol. We also obtain an upper bound on the variance due to differences in behavior between the two spike-in sets. We demonstrate that both factors are small contributors to the total technical variance and have only minor effects on downstream analyses, such as detection of highly variable genes and clustering. Our results suggest that scaling normalization using spike-in transcripts is reliable enough for routine use in single-cell RNA sequencing data analyses.
引用
收藏
页码:1795 / 1806
页数:12
相关论文
共 50 条
  • [1] SAMstrt: statistical test for differential expression in single-cell transcriptome with spike-in normalization
    Katayama, Shintaro
    Tohonen, Virpi
    Linnarsson, Sten
    Kere, Juha
    BIOINFORMATICS, 2013, 29 (22) : 2943 - 2945
  • [2] Spike-in normalization for single-cell RNA-seq reveals dynamic global transcriptional activity mediating anticancer drug response
    Wang, Xin
    Frederick, Jane
    Wang, Hongbin
    Hui, Sheng
    Backman, Vadim
    Ji, Zhe
    NAR GENOMICS AND BIOINFORMATICS, 2021, 3 (02)
  • [3] Single-cell RNA sequencing of nc886, a non-coding RNA transcribed by RNA polymerase III, with a primer spike-in strategy
    Shin, Gyeong-Jin
    Choi, Byung-Han
    Eum, Hye Hyeon
    Jo, Areum
    Kim, Nayoung
    Kang, Huiram
    Hong, Dongwan
    Jang, Jiyoung Joan
    Lee, Hwi-Ho
    Lee, Yeon-Su
    Lee, Yong Sun
    Lee, Hae-Ock
    PLOS ONE, 2024, 19 (08):
  • [4] Normalization by distributional resampling of high throughput single-cell RNA-sequencing data
    Brown, Jared
    Ni, Zijian
    Mohanty, Chitrasen
    Bacher, Rhonda
    Kendziorski, Christina
    BIOINFORMATICS, 2021, 37 (22) : 4123 - 4128
  • [5] Normalizing single-cell RNA sequencing data with internal spike-in-like genes
    Lin, Li
    Song, Minfang
    Jiang, Yong
    Zhao, Xiaojing
    Wang, Haopeng
    Zhang, Liye
    NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (03)
  • [6] Evaluation of single-cell classifiers for single-cell RNA sequencing data sets
    Zhao, Xinlei
    Wu, Shuang
    Fang, Nan
    Sun, Xiao
    Fan, Jue
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1581 - 1595
  • [7] Novel small RNA spike-in oligonucleotides enable absolute normalization of small RNA-Seq data
    Lutzmayer, Stefan
    Enugutti, Balaji
    Nodine, Michael D.
    SCIENTIFIC REPORTS, 2017, 7
  • [8] Complex Analysis of Single-Cell RNA Sequencing Data
    Khozyainova, Anna A. A.
    Valyaeva, Anna A. A.
    Arbatsky, Mikhail S. S.
    Isaev, Sergey V. V.
    Iamshchikov, Pavel S. S.
    Volchkov, Egor V. V.
    Sabirov, Marat S. S.
    Zainullina, Viktoria R. R.
    Chechekhin, Vadim I. I.
    Vorobev, Rostislav S. S.
    Menyailo, Maxim E. E.
    Tyurin-Kuzmin, Pyotr A. A.
    Denisov, Evgeny V. V.
    BIOCHEMISTRY-MOSCOW, 2023, 88 (02) : 231 - 252
  • [9] Splatter: simulation of single-cell RNA sequencing data
    Zappia, Luke
    Phipson, Belinda
    Oshlack, Alicia
    GENOME BIOLOGY, 2017, 18
  • [10] Complex Analysis of Single-Cell RNA Sequencing Data
    Anna A. Khozyainova
    Anna A. Valyaeva
    Mikhail S. Arbatsky
    Sergey V. Isaev
    Pavel S. Iamshchikov
    Egor V. Volchkov
    Marat S. Sabirov
    Viktoria R. Zainullina
    Vadim I. Chechekhin
    Rostislav S. Vorobev
    Maxim E. Menyailo
    Pyotr A. Tyurin-Kuzmin
    Evgeny V. Denisov
    Biochemistry (Moscow), 2023, 88 : 231 - 252