scQCEA: a framework for annotation and quality control report of single-cell RNA-sequencing data

被引:2
|
作者
Nassiri, Isar [1 ]
Fairfax, Benjamin [2 ,3 ,4 ]
Lee, Angela [1 ]
Wu, Yanxia [1 ]
Buck, David [1 ]
Piazza, Paolo [1 ]
机构
[1] Univ Oxford, Oxford Genom Ctr, Wellcome Ctr Human Genet, Nuffield Dept Med, Oxford, England
[2] Univ Oxford, MRC Weatherall Inst Mol Med, Oxford, England
[3] Univ Oxford, Dept Oncol, Oxford, England
[4] Oxford Univ Hosp NHS Fdn Trust, Churchill Hosp, Oxford Canc Ctr, Oxford, England
基金
英国惠康基金;
关键词
Single cell RNA sequencing; Transcriptomics; Genomics; Cell type annotation;
D O I
10.1186/s12864-023-09447-6
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background Systematic description of library quality and sequencing performance of single-cell RNA sequencing (scRNA-seq) data is imperative for subsequent downstream modules, including re-pooling libraries. While several packages have been developed to visualise quality control (QC) metrics for scRNA-seq data, they do not include expression-based QC to discriminate between true variation and background noise.Results We present scQCEA (acronym of the single-cell RNA sequencing Quality Control and Enrichment Analysis), an R package to generate reports of process optimisation metrics for comparing sets of samples and visual evaluation of quality scores. scQCEA can import data from 10X or other single-cell platforms and includes functions for generating an interactive report of QC metrics for multi-omics data. In addition, scQCEA provides automated cell type annotation on scRNA-seq data using differential gene expression patterns for expression-based quality control. We provide a repository of reference gene sets, including 2348 marker genes, which are exclusively expressed in 95 human and mouse cell types.Using scRNA-seq data from 56 gene expressions and V(D)J T cell replicates, we show how scQCEA can be applied for the visual evaluation of quality scores for sets of samples. In addition, we use the summary of QC measures from 342 human and mouse shallow-sequenced gene expression profiles to specify optimal sequencing requirements to run a cell-type enrichment analysis function.Conclusions The open-source R tool will allow examining biases and outliers over biological and technical measures, and objective selection of optimal cluster numbers before downstream analysis. scQCEA is available at as an R package. Full documentation, including an example, is provided on the package website.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Single-cell Mayo Map (scMayoMap): an easy-to-use tool for cell type annotation in single-cell RNA-sequencing data analysis
    Yang, Lu
    Ng, Yan Er
    Sun, Haipeng
    Li, Ying
    Chini, Lucas C. S.
    Lebrasseur, Nathan K.
    Chen, Jun
    Zhang, Xu
    BMC BIOLOGY, 2023, 21 (01)
  • [22] Systematic determination of the mitochondrial proportion in human and mice tissues for single-cell RNA-sequencing data quality control
    Osorio, Daniel
    Cai, James J.
    BIOINFORMATICS, 2021, 37 (07) : 963 - 967
  • [23] Single-cell RNA-sequencing in asthma research
    Tang, Weifeng
    Li, Mihui
    Teng, Fangzhou
    Cui, Jie
    Dong, Jingcheng
    Wang, Wenqian
    FRONTIERS IN IMMUNOLOGY, 2022, 13
  • [24] Single-cell isolation by a modular single-cell pipette for RNA-sequencing
    Zhang, Kai
    Gao, Min
    Chong, Zechen
    Li, Ying
    Han, Xin
    Chen, Rui
    Qin, Lidong
    LAB ON A CHIP, 2016, 16 (24) : 4742 - 4748
  • [25] Automated annotation of rare-cell types from single-cell RNA-sequencing data through synthetic oversampling
    Bej, Saptarshi
    Galow, Anne-Marie
    David, Robert
    Wolfien, Markus
    Wolkenhauer, Olaf
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [26] Automated annotation of rare-cell types from single-cell RNA-sequencing data through synthetic oversampling
    Saptarshi Bej
    Anne-Marie Galow
    Robert David
    Markus Wolfien
    Olaf Wolkenhauer
    BMC Bioinformatics, 22
  • [27] Improved deconvolution of combined bulk and single-cell RNA-sequencing data
    Lei, Haoyun
    Guo, Xiaoyan A.
    Tao, Yifeng
    Ding, Kai
    Fu, Xuecong
    Oesterreich, Steffi
    Lee, Adrian V.
    Schwartz, Russell
    CANCER RESEARCH, 2022, 82 (12)
  • [28] Comparison of Computational Methods for Imputing Single-Cell RNA-Sequencing Data
    Zhang, Lihua
    Zhang, Shihua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (02) : 376 - 389
  • [29] Consensus Nature Inspired Clustering of Single-Cell RNA-Sequencing Data
    Abou El-Naga, Amany H.
    Sayed, Sabah
    Salah, Akram
    Mohsen, Heba
    IEEE ACCESS, 2022, 10 : 98079 - 98094
  • [30] Missing data and technical variability in single-cell RNA-sequencing experiments
    Hicks, Stephanie C.
    Townes, F. William
    Teng, Mingxiang
    Irizarry, Rafael A.
    BIOSTATISTICS, 2018, 19 (04) : 562 - 578