Inferring single-cell copy number profiles through cross-cell segmentation of read counts

被引:2
|
作者
Liu, Furui [1 ]
Shi, Fangyuan [1 ,2 ]
Yu, Zhenhua [1 ,2 ]
机构
[1] Ningxia Univ, Sch Informat Engn, Yinchuan 750021, Peoples R China
[2] Ningxia Univ, Collaborat Innovat Ctr Ningxia Big Data & Artifici, Cofounded Ningxia Municipal & Minist Educ, Yinchuan 750021, Peoples R China
关键词
Single-cell DNA sequencing; Copy number alteration; Autoencoder; Mixture model; TUMOR EVOLUTION;
D O I
10.1186/s12864-023-09901-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundCopy number alteration (CNA) is one of the major genomic variations that frequently occur in cancers, and accurate inference of CNAs is essential for unmasking intra-tumor heterogeneity (ITH) and tumor evolutionary history. Single-cell DNA sequencing (scDNA-seq) makes it convenient to profile CNAs at single-cell resolution, and thus aids in better characterization of ITH. Despite that several computational methods have been proposed to decipher single-cell CNAs, their performance is limited in either breakpoint detection or copy number estimation due to the high dimensionality and noisy nature of read counts data.ResultsBy treating breakpoint detection as a process to segment high dimensional read count sequence, we develop a novel method called DeepCNA for cross-cell segmentation of read count sequence and per-cell inference of CNAs. To cope with the difficulty of segmentation, an autoencoder (AE) network is employed in DeepCNA to project the original data into a low-dimensional space, where the breakpoints can be efficiently detected along each latent dimension and further merged to obtain the final breakpoints. Unlike the existing methods that manually calculate certain statistics of read counts to find breakpoints, the AE model makes it convenient to automatically learn the representations. Based on the inferred breakpoints, we employ a mixture model to predict copy numbers of segments for each cell, and leverage expectation-maximization algorithm to efficiently estimate cell ploidy by exploring the most abundant copy number state. Benchmarking results on simulated and real data demonstrate our method is able to accurately infer breakpoints as well as absolute copy numbers and surpasses the existing methods under different test conditions. DeepCNA can be accessed at: https://github.com/zhyu-lab/deepcna.ConclusionsProfiling single-cell CNAs based on deep learning is becoming a new paradigm of scDNA-seq data analysis, and DeepCNA is an enhancement to the current arsenal of computational methods for investigating cancer genomics.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Inferring single-cell copy number profiles through cross-cell segmentation of read counts
    Furui Liu
    Fangyuan Shi
    Zhenhua Yu
    BMC Genomics, 25
  • [2] Single-cell copy number variation detection
    Cheng, Jiqiu
    Vanneste, Evelyne
    Konings, Peter
    Voet, Thierry
    Vermeesch, Joris R.
    Moreau, Yves
    GENOME BIOLOGY, 2011, 12 (08):
  • [3] Single-cell copy number variation detection
    Jiqiu Cheng
    Evelyne Vanneste
    Peter Konings
    Thierry Voet
    Joris R Vermeesch
    Yves Moreau
    Genome Biology, 12
  • [4] Inferring copy number substructure from single-cell transcriptomics in human tumors with CopyKat.
    Gao, Ruli
    Bai, Shanshan
    Ying, Henderson
    Lin, Yiyun
    Seth, Tapsi
    Hu, Min
    Sei, Emi
    Davis, Alexander
    Wang, Fang
    Wang, Jennifer Rui
    Chen, Ken
    Moulder, Stacey
    Lai, Stephen
    Navin, Nicholas
    CANCER RESEARCH, 2020, 80 (21)
  • [5] SCONCE2: jointly inferring single cell copy number profiles and tumor evolutionary distances
    Hui, Sandra
    Nielsen, Rasmus
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [6] SCONCE2: jointly inferring single cell copy number profiles and tumor evolutionary distances
    Sandra Hui
    Rasmus Nielsen
    BMC Bioinformatics, 23
  • [7] Inferring cell–cell communication at single-cell resolution
    Nature Biotechnology, 2024, 42 : 390 - 391
  • [8] DNA copy number profiling using single-cell sequencing
    Wang, Xuefeng
    Chen, Hao
    Zhang, Nancy R.
    BRIEFINGS IN BIOINFORMATICS, 2018, 19 (05) : 731 - 736
  • [9] Single-cell measurement of plasmid copy number and promoter activity
    Shao, Bin
    Rammohan, Jayan
    Anderson, Daniel A.
    Alperovich, Nina
    Ross, David
    Voigt, Christopher A.
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [10] Single-cell measurement of plasmid copy number and promoter activity
    Bin Shao
    Jayan Rammohan
    Daniel A. Anderson
    Nina Alperovich
    David Ross
    Christopher A. Voigt
    Nature Communications, 12