scQCEA: a framework for annotation and quality control report of single-cell RNA-sequencing data

被引:2
|
作者
Nassiri, Isar [1 ]
Fairfax, Benjamin [2 ,3 ,4 ]
Lee, Angela [1 ]
Wu, Yanxia [1 ]
Buck, David [1 ]
Piazza, Paolo [1 ]
机构
[1] Univ Oxford, Oxford Genom Ctr, Wellcome Ctr Human Genet, Nuffield Dept Med, Oxford, England
[2] Univ Oxford, MRC Weatherall Inst Mol Med, Oxford, England
[3] Univ Oxford, Dept Oncol, Oxford, England
[4] Oxford Univ Hosp NHS Fdn Trust, Churchill Hosp, Oxford Canc Ctr, Oxford, England
基金
英国惠康基金;
关键词
Single cell RNA sequencing; Transcriptomics; Genomics; Cell type annotation;
D O I
10.1186/s12864-023-09447-6
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background Systematic description of library quality and sequencing performance of single-cell RNA sequencing (scRNA-seq) data is imperative for subsequent downstream modules, including re-pooling libraries. While several packages have been developed to visualise quality control (QC) metrics for scRNA-seq data, they do not include expression-based QC to discriminate between true variation and background noise.Results We present scQCEA (acronym of the single-cell RNA sequencing Quality Control and Enrichment Analysis), an R package to generate reports of process optimisation metrics for comparing sets of samples and visual evaluation of quality scores. scQCEA can import data from 10X or other single-cell platforms and includes functions for generating an interactive report of QC metrics for multi-omics data. In addition, scQCEA provides automated cell type annotation on scRNA-seq data using differential gene expression patterns for expression-based quality control. We provide a repository of reference gene sets, including 2348 marker genes, which are exclusively expressed in 95 human and mouse cell types.Using scRNA-seq data from 56 gene expressions and V(D)J T cell replicates, we show how scQCEA can be applied for the visual evaluation of quality scores for sets of samples. In addition, we use the summary of QC measures from 342 human and mouse shallow-sequenced gene expression profiles to specify optimal sequencing requirements to run a cell-type enrichment analysis function.Conclusions The open-source R tool will allow examining biases and outliers over biological and technical measures, and objective selection of optimal cluster numbers before downstream analysis. scQCEA is available at as an R package. Full documentation, including an example, is provided on the package website.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] RESCUE: imputing dropout events in single-cell RNA-sequencing data
    Tracy, Sam
    Yuan, Guo-Cheng
    Dries, Ruben
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [32] RESCUE: imputing dropout events in single-cell RNA-sequencing data
    Sam Tracy
    Guo-Cheng Yuan
    Ruben Dries
    BMC Bioinformatics, 20
  • [33] DoubletDecon: Deconvoluting Doublets from Single-Cell RNA-Sequencing Data
    DePasquale, Erica A. K.
    Schnell, Daniel J.
    Van Camp, Pieter-Jan
    Valiente-Alandi, Inigo
    Blaxall, Burns C.
    Grimes, H. Leighton
    Singh, Harinder
    Salomonis, Nathan
    CELL REPORTS, 2019, 29 (06): : 1718 - +
  • [34] Reusability report: Learning the transcriptional grammar in single-cell RNA-sequencing data using transformers
    Sumeer Ahmad Khan
    Alberto Maillo
    Vincenzo Lagani
    Robert Lehmann
    Narsis A. Kiani
    David Gomez-Cabrero
    Jesper Tegner
    Nature Machine Intelligence, 2023, 5 : 1437 - 1446
  • [35] Reusability report: Learning the transcriptional grammar in single-cell RNA-sequencing data using transformers
    Khan, Sumeer Ahmad
    Maillo, Alberto
    Lagani, Vincenzo
    Lehmann, Robert
    Kiani, Narsis A.
    Gomez-Cabrero, David
    Tegner, Jesper
    NATURE MACHINE INTELLIGENCE, 2023, 5 (12) : 1437 - 1446
  • [36] A Galaxy-based training resource for single-cell RNA-sequencing quality control and analyses
    Etherington, Graham J.
    Soranzo, Nicola
    Mohammed, Suhaib
    Haerty, Wilfried
    Davey, Robert P.
    Di Palma, Federica
    GIGASCIENCE, 2019, 8 (12):
  • [37] scds: computational annotation of doublets in single-cell RNA sequencing data
    Bais, Abha S.
    Kostka, Dennis
    BIOINFORMATICS, 2020, 36 (04) : 1150 - 1158
  • [38] A single-cell RNA-sequencing training and analysis suite using the Galaxy framework
    Tekman, Mehmet
    Batut, Berenice
    Ostrovsky, Alexander
    Antoniewski, Christophe
    Clements, Dave
    Ramirez, Fidel
    Etherington, Graham J.
    Hotz, Hans-Rudolf
    Scholtalbers, Jelle
    Manning, Jonathan R.
    Bellenger, Lea
    Doyle, Maria A.
    Heydarian, Mohammad
    Huang, Ni
    Soranzo, Nicola
    Moreno, Pablo
    Mautner, Stefan
    Papatheodorou, Irene
    Nekrutenko, Anton
    Taylor, James
    Blankenberg, Daniel
    Backofen, Rolf
    Gruening, Bjoern
    GIGASCIENCE, 2020, 9 (10):
  • [39] Combining bulk RNA-sequencing and single-cell RNA-sequencing data to reveal the immune microenvironment and metabolic pattern of osteosarcoma
    Huang, Ruichao
    Wang, Xiaohu
    Yin, Xiangyun
    Zhou, Yaqi
    Sun, Jiansheng
    Yin, Zhongxiu
    Zhu, Zhi
    FRONTIERS IN GENETICS, 2022, 13
  • [40] X Software Benchmark-Classification Tree Algorithms for Cell Atlases Annotation Using Single-Cell RNA-Sequencing Data
    Alaqeeli, Omar
    Xing, Li
    Zhang, Xuekui
    MICROBIOLOGY RESEARCH, 2021, 12 (02) : 317 - 334