Choosing among Partition Models in Bayesian Phylogenetics

被引:144
|
作者
Fan, Yu [1 ]
Wu, Rui [2 ]
Chen, Ming-Hui [2 ]
Kuo, Lynn [2 ]
Lewis, Paul O. [1 ]
机构
[1] Univ Connecticut, Dept Ecol & Evolutionary Biol, Storrs, CT 06269 USA
[2] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
phylogenetics; Bayes factor; marginal likelihood; harmonic mean method; stepping-stone method; partitioning; MONTE-CARLO; NORMALIZING CONSTANTS; EVOLUTION; INFERENCE; SELECTION; IDENTITY; RATIO;
D O I
10.1093/molbev/msq224
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Bayesian phylogenetic analyses often depend on Bayes factors (BFs) to determine the optimal way to partition the data. The marginal likelihoods used to compute BFs, in turn, are most commonly estimated using the harmonic mean (HM) method, which has been shown to be inaccurate. We describe a new more accurate method for estimating the marginal likelihood of a model and compare it with the HM method on both simulated and empirical data. The new method generalizes our previously described stepping-stone (SS) approach by making use of a reference distribution parameterized using samples from the posterior distribution. This avoids one challenging aspect of the original SS method, namely the need to sample from distributions that are close (in the Kullback-Leibler sense) to the prior. We specifically address the choice of partition models and find that using the HM method can lead to a strong preference for an overpartitioned model. In contrast to the HM method and the original SS method, we show using simulated data that the generalized SS method is strikingly more precise (repeatable BF values of the same data and partition model) and yields BF values that are much more reasonable than those produced by the HM method. Comparisons of HM and generalized SS methods on an empirical data set demonstrate that the generalized SS method tends to choose simpler partition schemes that are more in line with expectation based on inferred patterns of molecular evolution. The generalized SS method shares with thermodynamic integration the need to sample from a series of distributions in addition to the posterior. Such dedicated path-based Markov chain Monte Carlo analyses appear to be a cost of estimating marginal likelihoods accurately.
引用
收藏
页码:523 / 532
页数:10
相关论文
共 50 条
  • [1] BAYESIAN DISCRIMINATION - METHOD FOR CHOOSING AMONG COMPETING DIGITAL-SIMULATION MODELS
    GARMAN, MB
    [J]. SIMULATION, 1975, 25 (04) : 109 - 113
  • [2] Bayesian clustering and product partition models
    Quintana, FA
    Iglesias, PL
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2003, 65 : 557 - 574
  • [3] Cross-validation to select Bayesian hierarchical models in phylogenetics
    Duchene, Sebastian
    Duchene, David A.
    Di Giallonardo, Francesca
    Eden, John-Sebastian
    Geoghegan, Jemma L.
    Holt, Kathryn E.
    Ho, Simon Y. W.
    Holmes, Edward C.
    [J]. BMC EVOLUTIONARY BIOLOGY, 2016, 16
  • [4] Cross-validation to select Bayesian hierarchical models in phylogenetics
    Sebastián Duchêne
    David A. Duchêne
    Francesca Di Giallonardo
    John-Sebastian Eden
    Jemma L. Geoghegan
    Kathryn E. Holt
    Simon Y. W. Ho
    Edward C. Holmes
    [J]. BMC Evolutionary Biology, 16
  • [5] Similarity analysis in Bayesian random partition models
    Navarrete, Carlos A.
    Quintana, Fernando A.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (01) : 97 - 109
  • [6] Scalable Bayesian phylogenetics
    Fisher, Alexander A.
    Hassler, Gabriel W.
    Ji, Xiang
    Baele, Guy
    Suchard, Marc A.
    Lemey, Philippe
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2022, 377 (1861)
  • [7] Bayesian phylogenetics of Bryozoa
    Tsyganov-Bodounov, Anton
    Hayward, Peter J.
    Porter, Joanne S.
    Skibinski, David O. F.
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2009, 52 (03) : 904 - 910
  • [8] Choosing and Using Introns in Molecular Phylogenetics
    Creer, Simon
    [J]. EVOLUTIONARY BIOINFORMATICS, 2007, 3 : 99 - 108
  • [9] Bayesian Value-at-Risk with product partition models
    Bormetti, Giacomo
    De Giuli, Maria Elena
    Delpini, Danilo
    Tarantola, Claudia
    [J]. QUANTITATIVE FINANCE, 2012, 12 (05) : 769 - 780
  • [10] An MCMC algorithm for bayesian analysis of Hierarchical Partition Models
    Stefano Sampietro
    Piero Veronese
    [J]. Journal of the Italian Statistical Society, 1998, 7 (2) : 209 - 220