Accurate and efficient cell lineage tree inference from noisy single cell data: the maximum likelihood perfect phylogeny approach

被引:17
|
作者
Wu, Yufeng [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
基金
美国国家科学基金会;
关键词
HETEROGENEITY; NUCLEOTIDE; EVOLUTION;
D O I
10.1093/bioinformatics/btz676
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cells in an organism share a common evolutionary history, called cell lineage tree. Cell lineage tree can be inferred from single cell genotypes at genomic variation sites. Cell lineage tree inference from noisy single cell data is a challenging computational problem. Most existing methods for cell lineage tree inference assume uniform uncertainty in genotypes. A key missing aspect is that real single cell data usually has non-uniform uncertainty in individual genotypes. Moreover, existing methods are often sampling based and can be very slow for large data. Results: In this article, we propose a new method called ScisTree, which infers cell lineage tree and calls genotypes from noisy single cell genotype data. Different from most existing approaches, ScisTree works with genotype probabilities of individual genotypes (which can be computed by existing single cell genotype callers). ScisTree assumes the infinite sites model. Given uncertain genotypes with individualized probabilities, ScisTree implements a fast heuristic for inferring cell lineage tree and calling the genotypes that allow the so-called perfect phylogeny and maximize the likelihood of the genotypes. Through simulation, we show that ScisTree performs well on the accuracy of inferred trees, and is much more efficient than existing methods. The efficiency of ScisTree enables new applications including imputation of the so-called doublets.
引用
下载
收藏
页码:742 / 750
页数:9
相关论文
共 50 条
  • [1] Maximum likelihood phylogeographic inference of cell motility and cell division from spatial lineage tracing data
    Mai, Uyen
    Hu, Gary
    Raphael, Benjamin J.
    BIOINFORMATICS, 2024, 40 : i228 - i236
  • [2] Parameter inference for stochastic single-cell dynamics from lineage tree data
    Kuzmanovska, Irena
    Milias-Argeitis, Andreas
    Mikelson, Jan
    Zechner, Christoph
    Khammash, Mustafa
    BMC SYSTEMS BIOLOGY, 2017, 11
  • [3] Theoretical guarantees for phylogeny inference from single-cell lineage tracing
    Wang, Robert
    Zhang, Richard
    Khodaverdian, Alex
    Yosef, Nir
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (12)
  • [4] Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm
    Foroughmand-Araabi, Mohammad-Hadi
    Goliaei, Sama
    Mchardy, Alice C.
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (08)
  • [5] Tree inference for single-cell data
    Jahn, Katharina
    Kuipers, Jack
    Beerenwinkel, Niko
    GENOME BIOLOGY, 2016, 17
  • [6] Tree inference for single-cell data
    Katharina Jahn
    Jack Kuipers
    Niko Beerenwinkel
    Genome Biology, 17
  • [7] Maximum Likelihood Inference of Time-Scaled Cell Lineage Trees with Mixed-Type Missing Data
    Mai, Uyen
    Chu, Gillian
    Raphael, Benjamin J.
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB 2024, 2024, 14758 : 360 - 363
  • [8] Joint inference of cell lineage and mitochondrial evolution from single-cell sequencing data
    Sashittal, Palash
    Chen, Viola
    Pasarkar, Amey
    Raphael, Benjamin J.
    BIOINFORMATICS, 2024, 40 : i218 - i227
  • [9] Inference of single-cell phylogenies from lineage tracing data using Cassiopeia
    Matthew G Jones
    Alex Khodaverdian
    Jeffrey J Quinn
    Michelle M Chan
    Jeffrey A Hussmann
    Robert Wang
    Chenling Xu
    Jonathan S Weissman
    Nir Yosef
    Genome Biology, 21
  • [10] COALESCENT-BASED SPECIES TREE INFERENCE FROM GENE TREE TOPOLOGIES UNDER INCOMPLETE LINEAGE SORTING BY MAXIMUM LIKELIHOOD
    Wu, Yufeng
    EVOLUTION, 2012, 66 (03) : 763 - 775