PsiNorm: a scalable normalization for single-cell RNA-seq data

被引:13
|
作者
Borella, Matteo [1 ]
Martello, Graziano [1 ]
Risso, Davide [2 ]
Romualdi, Chiara [1 ]
机构
[1] Univ Padua, Dept Biol, I-35121 Padua, Italy
[2] Univ Padua, Dept Stat Sci, I-35121 Padua, Italy
基金
美国国家卫生研究院;
关键词
GENE-EXPRESSION;
D O I
10.1093/bioinformatics/btab641
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Single-cell RNA sequencing (scRNA-seq) enables transcriptome-wide gene expression measurements at single-cell resolution providing a comprehensive view of the compositions and dynamics of tissue and organism development. The evolution of scRNA-seq protocols has led to a dramatic increase of cells throughput, exacerbating many of the computational and statistical issues that previously arose for bulk sequencing. In particular, with scRNA-seq data all the analyses steps, including normalization, have become computationally intensive, both in terms of memory usage and computational time. In this perspective, new accurate methods able to scale efficiently are desirable. Results: Here, we propose PsiNorm, a between-sample normalization method based on the power-law Pareto distribution parameter estimate. Here, we show that the Pareto distribution well resembles scRNA-seq data, especially those coming from platforms that use unique molecular identifiers. Motivated by this result, we implement PsiNorm, a simple and highly scalable normalization method. We benchmark PsiNorm against seven other methods in terms of cluster identification, concordance and computational resources required. We demonstrate that PsiNorm is among the top performing methods showing a good trade-off between accuracy and scalability. Moreover, PsiNorm does not need a reference, a characteristic that makes it useful in supervised classification settings, in which new out-of-sample data need to be normalized.
引用
收藏
页码:164 / 172
页数:9
相关论文
共 50 条
  • [21] Evaluating imputation methods for single-cell RNA-seq data
    Cheng, Yi
    Ma, Xiuli
    Yuan, Lang
    Sun, Zhaoguo
    Wang, Pingzhang
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)
  • [22] Testing for Phylogenetic Signal in Single-Cell RNA-Seq Data
    Moravec, Jiri C.
    Lanfear, Robert
    Spector, David L.
    Diermeier, Sarah D.
    Gavryushkin, Alex
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2023, 30 (04) : 518 - 537
  • [23] Locality Sensitive Imputation for Single-Cell RNA-Seq Data
    Moussa, Marmar
    Mandoiu, Ion I.
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2018, 2018, 10847 : 347 - 360
  • [24] Supervised Adversarial Alignment of Single-Cell RNA-seq Data
    Ge, Songwei
    Wang, Haohan
    Alavi, Amir
    Xing, Eric
    Bar-Joseph, Ziv
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (05) : 501 - 513
  • [25] Phylogenetic inference from single-cell RNA-seq data
    Xuan Liu
    Jason I. Griffiths
    Isaac Bishara
    Jiayi Liu
    Andrea H. Bild
    Jeffrey T. Chang
    [J]. Scientific Reports, 13
  • [26] Phylogenetic inference from single-cell RNA-seq data
    Liu, Xuan
    Griffiths, Jason I.
    Bishara, Isaac
    Liu, Jiayi
    Bild, Andrea H.
    Chang, Jeffrey T.
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [27] Deep Learning for Clustering Single-cell RNA-seq Data
    Zhu, Yuan
    Bai, Litai
    Ning, Zilin
    Fu, Wenfei
    Liu, Jie
    Jiang, Linfeng
    Fei, Shihuang
    Gong, Shiyun
    Lu, Lulu
    Deng, Minghua
    Yi, Ming
    [J]. CURRENT BIOINFORMATICS, 2024, 19 (03) : 193 - 210
  • [28] SCell: integrated analysis of single-cell RNA-seq data
    Diaz, Aaron
    Liu, Siyuan J.
    Sandoval, Carmen
    Pollen, Alex
    Nowakowski, Tom J.
    Lim, Daniel A.
    Kriegstein, Arnold
    [J]. BIOINFORMATICS, 2016, 32 (14) : 2219 - 2220
  • [29] The contribution of cell cycle to heterogeneity in single-cell RNA-seq data
    McDavid, Andrew
    Finak, Greg
    Gottardo, Raphael
    [J]. NATURE BIOTECHNOLOGY, 2016, 34 (06) : 591 - 593
  • [30] The contribution of cell cycle to heterogeneity in single-cell RNA-seq data
    Andrew McDavid
    Greg Finak
    Raphael Gottardo
    [J]. Nature Biotechnology, 2016, 34 : 591 - 593