doubletD: detecting doublets in single-cell DNA sequencing data

被引:11
|
作者
Weber, Leah L. [1 ]
Sashittal, Palash [1 ,2 ]
El-Kebir, Mohammed [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbama, IL 61801 USA
[2] Univ Illinois, Dept Aerosp Engn, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
INFERENCE;
D O I
10.1093/bioinformatics/btab266
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: While single-cell DNA sequencing (scDNA-seq) has enabled the study of intratumor heterogeneity at an unprecedented resolution, current technologies are error-prone and often result in doublets where two or more cells are mistaken for a single cell. Not only do doublets confound downstream analyses, but the increase in doublet rate is also a major bottleneck preventing higher throughput with current single-cell technologies. Although doublet detection and removal are standard practice in scRNA-seq data analysis, options for scDNA-seq data are limited. Current methods attempt to detect doublets while also performing complex downstream analyses tasks, leading to decreased efficiency and/or performance. Results: We present doubletD, the first standalone method for detecting doublets in scDNA-seq data. Underlying our method is a simple maximum likelihood approach with a closed-form solution. We demonstrate the performance of doubletD on simulated data as well as real datasets, outperforming current methods for downstream analysis of scDNA-seq data that jointly infer doublets as well as standalone approaches for doublet detection in scRNA-seq data. Incorporating doubletD in scDNA-seq analysis pipelines will reduce complexity and lead to more accurate results. Availability and implementation: https://github.com/elkebir-group/doubletD. Contact: melkebir@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:I214 / I221
页数:8
相关论文
共 50 条