scQCEA: a framework for annotation and quality control report of single-cell RNA-sequencing data

被引:2
|
作者
Nassiri, Isar [1 ]
Fairfax, Benjamin [2 ,3 ,4 ]
Lee, Angela [1 ]
Wu, Yanxia [1 ]
Buck, David [1 ]
Piazza, Paolo [1 ]
机构
[1] Univ Oxford, Oxford Genom Ctr, Wellcome Ctr Human Genet, Nuffield Dept Med, Oxford, England
[2] Univ Oxford, MRC Weatherall Inst Mol Med, Oxford, England
[3] Univ Oxford, Dept Oncol, Oxford, England
[4] Oxford Univ Hosp NHS Fdn Trust, Churchill Hosp, Oxford Canc Ctr, Oxford, England
基金
英国惠康基金;
关键词
Single cell RNA sequencing; Transcriptomics; Genomics; Cell type annotation;
D O I
10.1186/s12864-023-09447-6
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background Systematic description of library quality and sequencing performance of single-cell RNA sequencing (scRNA-seq) data is imperative for subsequent downstream modules, including re-pooling libraries. While several packages have been developed to visualise quality control (QC) metrics for scRNA-seq data, they do not include expression-based QC to discriminate between true variation and background noise.Results We present scQCEA (acronym of the single-cell RNA sequencing Quality Control and Enrichment Analysis), an R package to generate reports of process optimisation metrics for comparing sets of samples and visual evaluation of quality scores. scQCEA can import data from 10X or other single-cell platforms and includes functions for generating an interactive report of QC metrics for multi-omics data. In addition, scQCEA provides automated cell type annotation on scRNA-seq data using differential gene expression patterns for expression-based quality control. We provide a repository of reference gene sets, including 2348 marker genes, which are exclusively expressed in 95 human and mouse cell types.Using scRNA-seq data from 56 gene expressions and V(D)J T cell replicates, we show how scQCEA can be applied for the visual evaluation of quality scores for sets of samples. In addition, we use the summary of QC measures from 342 human and mouse shallow-sequenced gene expression profiles to specify optimal sequencing requirements to run a cell-type enrichment analysis function.Conclusions The open-source R tool will allow examining biases and outliers over biological and technical measures, and objective selection of optimal cluster numbers before downstream analysis. scQCEA is available at as an R package. Full documentation, including an example, is provided on the package website.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] scAnno: a deconvolution strategy-based automatic cell type annotation tool for single-cell RNA-sequencing data sets
    Liu, Hongjia
    Li, Huamei
    Sharma, Amit
    Huang, Wenjuan
    Pan, Duo
    Gu, Yu
    Lin, Lu
    Sun, Xiao
    Liu, Hongde
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [42] Quantitative assessment of single-cell RNA-sequencing methods
    Angela R Wu
    Norma F Neff
    Tomer Kalisky
    Piero Dalerba
    Barbara Treutlein
    Michael E Rothenberg
    Francis M Mburu
    Gary L Mantalas
    Sopheak Sim
    Michael F Clarke
    Stephen R Quake
    Nature Methods, 2014, 11 : 41 - 46
  • [43] Power analysis of single-cell RNA-sequencing experiments
    Svensson, Valentine
    Natarajan, Kedar Nath
    Ly, Lam-Ha
    Miragaia, Ricardo J.
    Labalette, Charlotte
    Macaulay, Iain C.
    Cvejic, Ana
    Teichmann, Sarah A.
    NATURE METHODS, 2017, 14 (04) : 381 - +
  • [44] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Tian, Yu
    Zheng, Ruiqing
    Liang, Zhenlan
    Li, Suning
    Wu, Fang-Xiang
    Li, Min
    TSINGHUA SCIENCE AND TECHNOLOGY, 2021, 26 (05) : 772 - 789
  • [45] Power analysis of single-cell RNA-sequencing experiments
    Valentine Svensson
    Kedar Nath Natarajan
    Lam-Ha Ly
    Ricardo J Miragaia
    Charlotte Labalette
    Iain C Macaulay
    Ana Cvejic
    Sarah A Teichmann
    Nature Methods, 2017, 14 : 381 - 387
  • [46] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Yu Tian
    Ruiqing Zheng
    Zhenlan Liang
    Suning Li
    Fang-Xiang Wu
    Min Li
    TsinghuaScienceandTechnology, 2021, 26 (05) : 772 - 789
  • [47] Clustering single-cell rna-sequencing data based on matching clusters structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    Tehnicki Vjesnik, 2020, 27 (01): : 89 - 95
  • [48] Recovery of missing single-cell RNA-sequencing data with optimized transcriptomic references
    Pool, Allan-Hermann
    Poldsam, Helen
    Chen, Sisi
    Thomson, Matt
    Oka, Yuki
    NATURE METHODS, 2023, 20 (10) : 1506 - +
  • [49] Identifying and removing the cellcycle effect from single-cell RNA-Sequencing data
    Barron, Martin
    Li, Jun
    SCIENTIFIC REPORTS, 2016, 6
  • [50] Expression variation analysis for tumor heterogeneity in single-cell RNA-sequencing data
    Davis-Marcisak, Emily F.
    Orugunta, Pranay
    Stein-O'Brien, Genevieve
    Puram, Sidharth V.
    Torres, Evanthia Roussos
    Hopkins, Alexander
    Jaffee, Elizabeth M.
    Favorov, Alexander V.
    Afsari, Bahman
    Goff, Loyal A.
    Fertig, Elana J.
    CANCER RESEARCH, 2019, 79 (13)