doubletD: detecting doublets in single-cell DNA sequencing data

被引:11
|
作者
Weber, Leah L. [1 ]
Sashittal, Palash [1 ,2 ]
El-Kebir, Mohammed [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbama, IL 61801 USA
[2] Univ Illinois, Dept Aerosp Engn, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
INFERENCE;
D O I
10.1093/bioinformatics/btab266
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: While single-cell DNA sequencing (scDNA-seq) has enabled the study of intratumor heterogeneity at an unprecedented resolution, current technologies are error-prone and often result in doublets where two or more cells are mistaken for a single cell. Not only do doublets confound downstream analyses, but the increase in doublet rate is also a major bottleneck preventing higher throughput with current single-cell technologies. Although doublet detection and removal are standard practice in scRNA-seq data analysis, options for scDNA-seq data are limited. Current methods attempt to detect doublets while also performing complex downstream analyses tasks, leading to decreased efficiency and/or performance. Results: We present doubletD, the first standalone method for detecting doublets in scDNA-seq data. Underlying our method is a simple maximum likelihood approach with a closed-form solution. We demonstrate the performance of doubletD on simulated data as well as real datasets, outperforming current methods for downstream analysis of scDNA-seq data that jointly infer doublets as well as standalone approaches for doublet detection in scRNA-seq data. Incorporating doubletD in scDNA-seq analysis pipelines will reduce complexity and lead to more accurate results. Availability and implementation: https://github.com/elkebir-group/doubletD. Contact: melkebir@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:I214 / I221
页数:8
相关论文
共 50 条
  • [1] scds: computational annotation of doublets in single-cell RNA sequencing data
    Bais, Abha S.
    Kostka, Dennis
    BIOINFORMATICS, 2020, 36 (04) : 1150 - 1158
  • [2] Vaeda computationally annotates doublets in single-cell RNA sequencing data
    Schriever, Hannah
    Kostka, Dennis
    BIOINFORMATICS, 2023, 39 (01)
  • [3] DoubletDecon: Deconvoluting Doublets from Single-Cell RNA-Sequencing Data
    DePasquale, Erica A. K.
    Schnell, Daniel J.
    Van Camp, Pieter-Jan
    Valiente-Alandi, Inigo
    Blaxall, Burns C.
    Grimes, H. Leighton
    Singh, Harinder
    Salomonis, Nathan
    CELL REPORTS, 2019, 29 (06): : 1718 - +
  • [4] SimSCSnTree: a simulator of single-cell DNA sequencing data
    Mallory, Xian Fan
    Nakhleh, Luay
    BIOINFORMATICS, 2022, 38 (10) : 2912 - 2914
  • [5] Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
    Xiong, Ke-Xu
    Zhou, Han-Lin
    Lin, Cong
    Yin, Jian-Hua
    Kristiansen, Karsten
    Yang, Huan-Ming
    Li, Gui-Bo
    COMMUNICATIONS BIOLOGY, 2022, 5 (01)
  • [6] Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
    Ke-Xu Xiong
    Han-Lin Zhou
    Cong Lin
    Jian-Hua Yin
    Karsten Kristiansen
    Huan-Ming Yang
    Gui-Bo Li
    Communications Biology, 5
  • [7] Haplotype phasing in single-cell DNA-sequencing data
    Satas, Gryte
    Raphael, Benjamin J.
    BIOINFORMATICS, 2018, 34 (13) : 211 - 217
  • [8] Applications of Single-Cell DNA Sequencing
    Evrony, Gilad D.
    Hinch, Anjali Gupta
    Luo, Chongyuan
    ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 22, 2021, 2021, 22 : 171 - 197
  • [9] Scrublet: Computational Identification of Cell Doublets in Single-Cell Transcriptomic Data
    Wolock, Samuel L.
    Lopez, Romain
    Klein, Allon M.
    CELL SYSTEMS, 2019, 8 (04) : 281 - +
  • [10] SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
    Xikang Feng
    Lingxi Chen
    BMC Genomics, 23