Statistics or biology: the zero-inflation controversy about scRNA-seq data

被引:107
|
作者
Jiang, Ruochen [1 ]
Sun, Tianyi [1 ]
Song, Dongyuan [2 ]
Li, Jingyi Jessica [1 ,3 ,4 ,5 ]
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Bioinformat Interdept PhD Program, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, Dept Computat Med, Los Angeles, CA 90095 USA
[5] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
基金
美国国家科学基金会;
关键词
CELL GENE-EXPRESSION; SINGLE-CELL; RNA-SEQ; FATE DECISIONS; DNA; RECONSTRUCTION; AMPLIFICATION; IMPUTATION; BINDING; MODEL;
D O I
10.1186/s13059-022-02601-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Researchers view vast zeros in single-cell RNA-seq data differently: some regard zeros as biological signals representing no or low gene expression, while others regard zeros as missing data to be corrected. To help address the controversy, here we discuss the sources of biological and non-biological zeros; introduce five mechanisms of adding non-biological zeros in computational benchmarking; evaluate the impacts of non-biological zeros on data analysis; benchmark three input data types: observed counts, imputed counts, and binarized counts; discuss the open questions regarding non-biological zeros; and advocate the importance of transparent analysis.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Statistics or biology: the zero-inflation controversy about scRNA-seq data
    Ruochen Jiang
    Tianyi Sun
    Dongyuan Song
    Jingyi Jessica Li
    Genome Biology, 23
  • [2] UMI or not UMI, that is the question for scRNA-seq zero-inflation
    Yingying Cao
    Simo Kitanovski
    Ralf Küppers
    Daniel Hoffmann
    Nature Biotechnology, 2021, 39 : 158 - 159
  • [3] UMI or not UMI, that is the question for scRNA-seq zero-inflation
    Cao, Yingying
    Kitanovski, Simo
    Kueppers, Ralf
    Hoffmann, Daniel
    NATURE BIOTECHNOLOGY, 2021, 39 (02) : 158 - 159
  • [4] Reply to: UMI or not UMI, that is the question for scRNA-seq zero-inflation
    Valentine Svensson
    Nature Biotechnology, 2021, 39 : 160 - 160
  • [5] Reply to: UMI or not UMI, that is the question for scRNA-seq zero-inflation
    Svensson, Valentine
    NATURE BIOTECHNOLOGY, 2021, 39 (02) : 160 - 160
  • [6] Droplet scRNA-seq is not zero-inflated
    Svensson, Valentine
    NATURE BIOTECHNOLOGY, 2020, 38 (02) : 147 - 150
  • [7] Droplet scRNA-seq is not zero-inflated
    Valentine Svensson
    Nature Biotechnology, 2020, 38 : 147 - 150
  • [8] Machine learning and system biology application to scRNA-seq data analysis
    Arbatskiy, Mikhail
    Sysoeva, Veronika
    Rubina, Kseniya
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 293 - 294
  • [9] SCBOOLSEQ: Linking scRNA-seq statistics and Boolean dynamics
    Magana-Lopez, Gustavo
    Calzone, Laurence
    Zinovyev, Andrei
    Pauleve, Loic
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (07)
  • [10] Computational approaches for interpreting scRNA-seq data
    Rostom, Raghd
    Svensson, Valentine
    Teichmann, Sarah A.
    Kar, Gozde
    FEBS LETTERS, 2017, 591 (15) : 2213 - 2225