A Simulation Approach for Change-Points on Phylogenetic Trees

被引:2
|
作者
Persing, Adam [1 ]
Jasra, Ajay [2 ]
Beskos, Alexandros [1 ]
Balding, David [3 ]
De Iorio, Maria [1 ]
机构
[1] UCL, Dept Stat Sci, London WC1E 7HB, England
[2] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore 117548, Singapore
[3] UCL, Genet Inst, London WC1E 7HB, England
基金
英国工程与自然科学研究理事会;
关键词
sequential Monte Carlo samplers; approximate Bayesian computation; binary trees; particle marginal Metropolis-Hastings; change-point models; time machine; EVOLUTIONARY TREES; DNA-SEQUENCES; MODEL; INFERENCE; COMPUTATION; SELECTION; SITES;
D O I
10.1089/cmb.2014.0218
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We observe n sequences at each of m sites and assume that they have evolved from an ancestral sequence that forms the root of a binary tree of known topology and branch lengths, but the sequence states at internal nodes are unknown. The topology of the tree and branch lengths are the same for all sites, but the parameters of the evolutionary model can vary over sites. We assume a piecewise constant model for these parameters, with an unknown number of change-points and hence a transdimensional parameter space over which we seek to perform Bayesian inference. We propose two novel ideas to deal with the computational challenges of such inference. Firstly, we approximate the model based on the time machine principle: the top nodes of the binary tree (near the root) are replaced by an approximation of the true distribution; as more nodes are removed from the top of the tree, the cost of computing the likelihood is reduced linearly in n. The approach introduces a bias, which we investigate empirically. Secondly, we develop a particle marginal Metropolis-Hastings (PMMH) algorithm, that employs a sequential Monte Carlo (SMC) sampler and can use the first idea. Our time-machine PMMH algorithm copes well with one of the bottle-necks of standard computational algorithms: the transdimensional nature of the posterior distribution. The algorithm is implemented on simulated and real data examples, and we empirically demonstrate its potential to outperform competing methods based on approximate Bayesian computation (ABC) techniques.
引用
收藏
页码:10 / 24
页数:15
相关论文
共 50 条
  • [1] A SIMPLE BAYESIAN APPROACH TO MULTIPLE CHANGE-POINTS
    Lai, Tze Leung
    Xing, Haipeng
    STATISTICA SINICA, 2011, 21 (02) : 539 - 569
  • [2] Change-points and bootstrap
    Gombay, E
    Horváth, L
    ENVIRONMETRICS, 1999, 10 (06) : 725 - 736
  • [3] A Bayesian fusion approach to change-points analysis of processes
    Reboul, S
    Benjelloun, M
    2005 7th International Conference on Information Fusion (FUSION), Vols 1 and 2, 2005, : 699 - 704
  • [4] A BAYESIAN-APPROACH TO RETROSPECTIVE IDENTIFICATION OF CHANGE-POINTS
    BOOTH, NB
    SMITH, AFM
    JOURNAL OF ECONOMETRICS, 1982, 19 (01) : 7 - 22
  • [5] On the Adaptive Partition Approach to the Detection of Multiple Change-Points
    Lai, Yinglei
    PLOS ONE, 2011, 6 (05):
  • [6] Testing for bubbles and change-points
    Kirman, A
    Teyssière, G
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2005, 29 (04): : 765 - 799
  • [7] Detecting change-points in extremes
    Dupuis, D. J.
    Sun, Y.
    Wang, Huixia Judy
    STATISTICS AND ITS INTERFACE, 2015, 8 (01) : 19 - 31
  • [8] A semiparametric approach to the detection of change-points in volatility dynamics of financial data
    Hu, Huaiyu
    Gangopadhyay, Ashis
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (12) : 5973 - 5993
  • [9] DETERMINING ONE OR MORE CHANGE-POINTS
    JONES, RH
    DEY, I
    CHEMISTRY AND PHYSICS OF LIPIDS, 1995, 76 (01) : 1 - 6
  • [10] Change-points in Bernoulli trials with dependence
    Krauth, J
    BETWEEN DATA SCIENCE AND APPLIED DATA ANALYSIS, 2003, : 261 - 269