Fidelity of hyperbolic space for Bayesian phylogenetic inference

被引:4
|
作者
Macaulay, Matthew O. [1 ]
Darling, Aaron [2 ]
Fourment, Mathieu O. [1 ]
机构
[1] Univ Technol Sydney, Australian Inst Microbiol & Infect, Sydney, Australia
[2] Illumina Australia Pty Ltd, Sydney, Australia
基金
澳大利亚研究理事会;
关键词
MAXIMUM-LIKELIHOOD; TREE; PERFORMANCE; PROPOSALS;
D O I
10.1371/journal.pcbi.1011084
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Bayesian inference for phylogenetics is a gold standard for computing distributions of phylogenies. However, Bayesian phylogenetics faces the challenging computational problem of moving throughout the high-dimensional space of trees. Fortunately, hyperbolic space offers a low dimensional representation of tree-like data. In this paper, we embed genomic sequences as points in hyperbolic space and perform hyperbolic Markov Chain Monte Carlo for Bayesian inference in this space. The posterior probability of an embedding is computed by decoding a neighbour-joining tree from the embedding locations of the sequences. We empirically demonstrate the fidelity of this method on eight data sets. We systematically investigated the effect of embedding dimension and hyperbolic curvature on the performance in these data sets. The sampled posterior distribution recovers the splits and branch lengths to a high degree over a range of curvatures and dimensions. We systematically investigated the effects of the embedding space's curvature and dimension on the Markov Chain's performance, demonstrating the suitability of hyperbolic space for phylogenetic inference. Author summary Why was this study done? Tree structures are widely used in fields such as phylogenetics, however modifying the layout and branch lengths of these structures simultaniously is a high-dimensional problem. Recent work in machine learning has demonstrated the usefulness of representing tree-like data as points in low dimensional hyperbolic space. We aimed to explore new ways of representing phylogenetic trees so they can be modified in a continuous manner. What did the researchers do and find? We represented trees by the locations of their embedded genomic sequences in hyperbolic space. We perturbed these continuous encoding locations and decoded an altered discrete tree structure. Using this technique, we performed Bayesian inference and computed the posterior distribution of standard eight datasets, to demonstrate the feasibility of phylogenetic inference with this representation. We found that hyperbolic space is suitable for Bayasian phylogenetics and is most efficient across a broad range of hyperbolic curvatures with low dimensionality. What do these findings mean? This method diversifies the way numerical methods can navigate the space of trees both in phylogenetics and more broadly. With hyperbolic embeddings, scaleable online inference is possible by quickly adding taxa to a tree or a distribution of trees. This method could open a wealth of powerful continuum-based methods to navigate the space of trees.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Sequential Bayesian Phylogenetic Inference
    Hoehna, Sebastian
    Hsiang, Allison Y.
    [J]. SYSTEMATIC BIOLOGY, 2024,
  • [2] Polytomies and Bayesian phylogenetic inference
    Lewis, PO
    Holder, MT
    Holsinger, KE
    [J]. SYSTEMATIC BIOLOGY, 2005, 54 (02) : 241 - 253
  • [3] A Variational Approach to Bayesian Phylogenetic Inference
    Zhang, Cheng
    Matsen IV, Frederick A.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 56
  • [4] MRBAYES: Bayesian inference of phylogenetic trees
    Huelsenbeck, JP
    Ronquist, F
    [J]. BIOINFORMATICS, 2001, 17 (08) : 754 - 755
  • [5] Parallel algorithms for Bayesian phylogenetic inference
    Feng, XZ
    Buell, DA
    Rose, JR
    Waddell, PJ
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2003, 63 (7-8) : 707 - 718
  • [6] Particle Gibbs sampling for Bayesian phylogenetic inference
    Wang, Shijia
    Wang, Liangliang
    [J]. BIOINFORMATICS, 2021, 37 (05) : 642 - 649
  • [7] Consistency of Bayesian inference of resolved phylogenetic trees
    Steel, Mike
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2013, 336 : 246 - 249
  • [8] Adaptive Tree Proposals for Bayesian Phylogenetic Inference
    Meyer, X.
    [J]. SYSTEMATIC BIOLOGY, 2021, 70 (05) : 1015 - 1032
  • [9] Empirical evaluation of a prior for Bayesian phylogenetic inference
    Yang, Ziheng
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1512) : 4031 - 4039
  • [10] MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model Space
    Ronquist, Fredrik
    Teslenko, Maxim
    van der Mark, Paul
    Ayres, Daniel L.
    Darling, Aaron
    Hohna, Sebastian
    Larget, Bret
    Liu, Liang
    Suchard, Marc A.
    Huelsenbeck, John P.
    [J]. SYSTEMATIC BIOLOGY, 2012, 61 (03) : 539 - 542