Inferring single-cell copy number profiles through cross-cell segmentation of read counts

被引:2
|
作者
Liu, Furui [1 ]
Shi, Fangyuan [1 ,2 ]
Yu, Zhenhua [1 ,2 ]
机构
[1] Ningxia Univ, Sch Informat Engn, Yinchuan 750021, Peoples R China
[2] Ningxia Univ, Collaborat Innovat Ctr Ningxia Big Data & Artifici, Cofounded Ningxia Municipal & Minist Educ, Yinchuan 750021, Peoples R China
关键词
Single-cell DNA sequencing; Copy number alteration; Autoencoder; Mixture model; TUMOR EVOLUTION;
D O I
10.1186/s12864-023-09901-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundCopy number alteration (CNA) is one of the major genomic variations that frequently occur in cancers, and accurate inference of CNAs is essential for unmasking intra-tumor heterogeneity (ITH) and tumor evolutionary history. Single-cell DNA sequencing (scDNA-seq) makes it convenient to profile CNAs at single-cell resolution, and thus aids in better characterization of ITH. Despite that several computational methods have been proposed to decipher single-cell CNAs, their performance is limited in either breakpoint detection or copy number estimation due to the high dimensionality and noisy nature of read counts data.ResultsBy treating breakpoint detection as a process to segment high dimensional read count sequence, we develop a novel method called DeepCNA for cross-cell segmentation of read count sequence and per-cell inference of CNAs. To cope with the difficulty of segmentation, an autoencoder (AE) network is employed in DeepCNA to project the original data into a low-dimensional space, where the breakpoints can be efficiently detected along each latent dimension and further merged to obtain the final breakpoints. Unlike the existing methods that manually calculate certain statistics of read counts to find breakpoints, the AE model makes it convenient to automatically learn the representations. Based on the inferred breakpoints, we employ a mixture model to predict copy numbers of segments for each cell, and leverage expectation-maximization algorithm to efficiently estimate cell ploidy by exploring the most abundant copy number state. Benchmarking results on simulated and real data demonstrate our method is able to accurately infer breakpoints as well as absolute copy numbers and surpasses the existing methods under different test conditions. DeepCNA can be accessed at: https://github.com/zhyu-lab/deepcna.ConclusionsProfiling single-cell CNAs based on deep learning is becoming a new paradigm of scDNA-seq data analysis, and DeepCNA is an enhancement to the current arsenal of computational methods for investigating cancer genomics.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Comprehensive Analysis of Copy Number Variations in Kidney Cancer by Single-Cell Exome Sequencing
    Zhou, Wenyang
    Yang, Fan
    Xu, Zhaochun
    Luo, Meng
    Wang, Pingping
    Guo, Yu
    Nie, Huan
    Yao, Lifen
    Jiang, Qinghua
    FRONTIERS IN GENETICS, 2020, 10
  • [42] A pan-cancer single-cell analysis of intratumoral copy number diversity and evolution
    Ye, Hanghui
    McDonald, Thomas
    Sei, Emi
    Minussi, Darlan Conterno
    Hu, Min
    Tang, Chenling
    Wang, Junke
    Wang, Kaile
    Casasent, Anna
    Chen, Hui
    Michor, Franziska
    Navin, Nicholas
    CANCER RESEARCH, 2024, 84 (06)
  • [43] Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes
    Ruli Gao
    Shanshan Bai
    Ying C. Henderson
    Yiyun Lin
    Aislyn Schalck
    Yun Yan
    Tapsi Kumar
    Min Hu
    Emi Sei
    Alexander Davis
    Fang Wang
    Simona F. Shaitelman
    Jennifer Rui Wang
    Ken Chen
    Stacy Moulder
    Stephen Y. Lai
    Nicholas E. Navin
    Nature Biotechnology, 2021, 39 : 599 - 608
  • [44] SCOPE: A Normalization and Copy-Number Estimation Method for Single-Cell DNA Sequencing
    Wang, Rujin
    Lin, Dan-Yu
    Jiang, Yuchao
    CELL SYSTEMS, 2020, 10 (05) : 445 - +
  • [45] Tumor Copy Number Data Deconvolution Integrating Bulk and Single-cell Sequencing Data
    Lei, Haoyun
    Lyu, Bochuan
    Gertz, E. Michael
    Schaffer, Alejandro A.
    Schwartz, Russell
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2018,
  • [46] Cross-species imputation and comparison of single-cell transcriptomic profiles
    Zhang, Ran
    Yang, Mu
    Schreiber, Jacob
    O'Day, Diana R.
    Turner, James M. A.
    Shendure, Jay
    Noble, William Stafford
    Disteche, Christine M.
    Deng, Xinxian
    GENOME BIOLOGY, 2025, 26 (01):
  • [47] Single-cell somatic copy number alteration profiling of vitreous humor seeds in retinoblastoma
    Sirivolu, Shreya
    Schmidt, Michael J.
    Prabakar, Rishvanth K.
    Kuhn, Peter
    Hicks, James
    Berry, Jesse L.
    Xu, Liya
    OPHTHALMIC GENETICS, 2024, 45 (06) : 646 - 649
  • [48] Single-cell genome-wide concurrent haplotyping and copy-number profiling through genotyping-by-sequencing
    Masset, Heleen
    Ding, Jia
    Dimitriadou, Eftychia
    Debrock, Sophie
    Tsuiko, Olga
    Smits, Katrien
    Peeraer, Karen
    Voet, Thierry
    Esteki, Masoud Zamani
    Vermeesch, Joris R.
    NUCLEIC ACIDS RESEARCH, 2022, 50 (11) : E63
  • [49] Precise measurement of vector copy number and transduction efficiency at single-cell resolution for cell and gene therapy development
    Li, A.
    Parikh, S.
    Patel, K.
    Yang, Y.
    Mendoza, D.
    Mohiuddin, M.
    He, H.
    Elliott, J.
    Enzmann, B.
    Pho, V.
    Cato, M.
    Wang, S.
    Schroeder, B.
    HUMAN GENE THERAPY, 2024, 35 (3-4) : A126 - A126
  • [50] Precise Measurement of Vector Copy Number and Transduction Efficiency at Single-Cell Resolution for Cell and Gene Therapy Development
    Li, Alex
    Parikh, Saurabh
    Patel, Khushali
    Yang, Yilong
    Mendoza, Daniel
    Mohiuddin, Mahir
    He, Hua-Jun
    Elliott, John
    Enzmann, Brittany
    Pho, Vanee
    Wang, Shu
    Schroeder, Benjamin
    MOLECULAR THERAPY, 2023, 31 (04) : 450 - 450