SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing data

被引:4
|
作者
Abugessaisa, Imad [1 ]
Hasegawa, Akira [1 ]
Noguchi, Shuhei [1 ]
Cardon, Melissa [1 ]
Watanabe, Kazuhide [2 ]
Takahashi, Masataka [2 ]
Suzuki, Harukazu [2 ]
Katayama, Shintaro [3 ,4 ,5 ]
Kere, Juha [3 ,4 ,5 ]
Kasukawa, Takeya [1 ,6 ]
机构
[1] RIKEN Ctr Integrat Med Sci, Lab Large Scale Biomed Data Technol, Tsurumi Ku, 1-7-22 Suehiro Cho, Yokohama, Kanagawa 2300045, Japan
[2] RIKEN Ctr Integrat Med Sci, Lab Cellular Funct Convers Technol, Tsurumi Ku, 1-7-22 Suehiro Cho, Yokohama, Kanagawa 2300045, Japan
[3] Folkhalsan Res Ctr, Topeliuksenkatu 20, Helsinki 00250, Finland
[4] Karolinska Inst, Dept Biosci & Nutr, S-14183 Huddinge, Sweden
[5] Univ Helsinki, Stem Cells & Metab Res Program, POB 4,Yliopistonkatu 3, Helsinki, Finland
[6] Osaka Univ, Inst Prot Res, Suita, Osaka 5650871, Japan
基金
瑞典研究理事会;
关键词
EXPRESSION; HETEROGENEITY; PROGRAMS; NOISE; FATE;
D O I
10.1016/j.isci.2022.103777
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The analysis and interpretation of single-cell RNA sequencing (scRNA-seq) experiments are compromised by the presence of poor-quality cells. For meaningful analyses, such poor-quality cells should be excluded as they introduce noise in the data. We introduce SkewC, a quality-assessment tool, to identify skewed cells in scRNA-seq experiments. The tool's methodology is based on the assessment of gene coverage for each cell, and its skewness as a quality measure; the gene body coverage is a unique characteristic for each protocol, and different protocols yield highly different coverage profiles. This tool is designed to avoid misclustering or false clusters by identifying, isolating, and removing cells with skewed gene body coverage profiles. SkewC is capable of processing any type of scRNA-seq dataset, regardless of the protocol. We envision SkewC as a distinctive QC method to be incorporated into scRNA-seq QC processing to preclude the possibility of scRNA-seq data misinterpretation.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] SIEVE: identifying robust single cell variable genes for single-cell RNA sequencing data
    Zhang, Yinan
    Xie, Xiaowei
    Wu, Peng
    Zhu, Ping
    [J]. BLOOD SCIENCE, 2021, 3 (02): : 35 - 39
  • [2] Differential gene expression analysis in single-cell RNA sequencing data
    Wang, Tianyu
    Nabavi, Sheida
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 202 - 207
  • [3] Joint gene network construction by single-cell RNA sequencing data
    Dong, Meichen
    He, Yiping
    Jiang, Yuchao
    Zou, Fei
    [J]. BIOMETRICS, 2023, 79 (02) : 915 - 925
  • [4] GENE EXPRESSION Single-cell RNA sequencing of collecting duct cells
    Allison, Susan J.
    [J]. NATURE REVIEWS NEPHROLOGY, 2018, 14 (01) : 3 - 3
  • [5] iDESC: identifying differential expression in single-cell RNA sequencing data with multiple subjects
    Liu, Yunqing
    Zhao, Jiayi
    Adams, Taylor S.
    Wang, Ningya
    Schupp, Jonas C.
    Wu, Weimiao
    McDonough, John E.
    Chupp, Geoffrey L.
    Kaminski, Naftali
    Wang, Zuoheng
    Yan, Xiting
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)
  • [6] Identifying and removing the cellcycle effect from single-cell RNA-Sequencing data
    Barron, Martin
    Li, Jun
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [7] Prospects of Identifying Alternative Splicing Events from Single-Cell RNA Sequencing Data
    Wang, Jiacheng
    Yuan, Lei
    [J]. CURRENT BIOINFORMATICS, 2024, 19 (09) : 845 - 850
  • [8] iDESC: identifying differential expression in single-cell RNA sequencing data with multiple subjects
    Yunqing Liu
    Jiayi Zhao
    Taylor S. Adams
    Ningya Wang
    Jonas C. Schupp
    Weimiao Wu
    John E. McDonough
    Geoffrey L. Chupp
    Naftali Kaminski
    Zuoheng Wang
    Xiting Yan
    [J]. BMC Bioinformatics, 24
  • [9] Evaluation of single-cell classifiers for single-cell RNA sequencing data sets
    Zhao, Xinlei
    Wu, Shuang
    Fang, Nan
    Sun, Xiao
    Fan, Jue
    [J]. BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1581 - 1595
  • [10] A comparison of marker gene selection methods for single-cell RNA sequencing data
    Jeffrey M. Pullin
    Davis J. McCarthy
    [J]. Genome Biology, 25