Evaluating the Adequacy of Molecular Clock Models Using Posterior Predictive Simulations

被引:27
|
作者
Duchene, David A. [1 ]
Duchene, Sebastian [2 ,3 ]
Holmes, Edward C. [2 ,3 ]
Ho, Simon Y. W. [2 ]
机构
[1] Australian Natl Univ, Res Sch Biol, Canberra, ACT, Australia
[2] Univ Sydney, Sch Biol Sci, Sydney, NSW 2006, Australia
[3] Univ Sydney, Sydney Med Sch, Marie Bashir Inst Infect Dis & Biosecur, Charles Perkins Ctr, Sydney, NSW 2006, Australia
基金
澳大利亚研究理事会; 英国医学研究理事会;
关键词
model adequacy; posterior predictive simulations; Bayesian phylogenetics; molecular clock; evolutionary rates; model selection; PHYLOGENETIC ANALYSIS; SUBSTITUTION; EVOLUTION; INFERENCE; CALIBRATION; CHOICE; UNCERTAINTY; FREQUENTIST; COALESCENT; SELECTION;
D O I
10.1093/molbev/msv154
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Molecular clock models are commonly used to estimate evolutionary rates and timescales from nucleotide sequences. The goal of these models is to account for rate variation among lineages, such that they are assumed to be adequate descriptions of the processes that generated the data. A common approach for selecting a clock model for a data set of interest is to examine a set of candidates and to select the model that provides the best statistical fit. However, this can lead to unreliable estimates if all the candidate models are actually inadequate. For this reason, a method of evaluating absolute model performance is critical. We describe a method that uses posterior predictive simulations to assess the adequacy of clock models. We test the power of this approach using simulated data and find that the method is sensitive to bias in the estimates of branch lengths, which tends to occur when using underparameterized clock models. We also compare the performance of the multinomial test statistic, originally developed to assess the adequacy of substitution models, but find that it has low power in identifying the adequacy of clock models. We illustrate the performance of our method using empirical data sets from coronaviruses, simian immunodeficiency virus, killer whales, and marine turtles. Our results indicate that methods of investigating model adequacy, including the one proposed here, should be routinely used in combination with traditional model selection in evolutionary studies. This will reveal whether a broader range of clock models to be considered in phylogenetic analysis.
引用
收藏
页码:2986 / 2995
页数:10
相关论文
共 50 条
  • [31] Evaluating Bayesian Models with Posterior Dispersion Indices
    Kucukelbir, Alp
    Wang, Yixin
    Blei, David M.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [32] Validating clustering of molecular dynamics simulations using polymer models
    Joshua L Phillips
    Michael E Colvin
    Shawn Newsam
    BMC Bioinformatics, 12
  • [33] Validating clustering of molecular dynamics simulations using polymer models
    Phillips, Joshua L.
    Colvin, Michael E.
    Newsam, Shawn
    BMC BIOINFORMATICS, 2011, 12
  • [34] Evaluating the adequacy of variance function using pairwise distances
    Zhao, Yan-Yong
    Zhao, Jian-Qiang
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2022, 51 (03) : 927 - 945
  • [35] Identifying the Infectious Period Distribution for Stochastic Epidemic Models Using the Posterior Predictive Check
    Alharthi, Muteb
    O'Neill, Philip
    Kypraios, Theodore
    BAYESIAN STATISTICS FROM METHODS TO MODELS AND APPLICATIONS: RESEARCH FROM BAYSM 2014, 2015, 126 : 109 - 114
  • [36] Assessing Quality of Spatial Models Using the Structural Similarity Index and Posterior Predictive Checks
    Robertson, Colin
    Long, Jed A.
    Nathoo, Farouk S.
    Nelson, Trisalyn A.
    Plouffe, Cameron C. F.
    GEOGRAPHICAL ANALYSIS, 2014, 46 (01) : 53 - 74
  • [37] Evaluating the adequacy of variance function using pairwise distances
    Yan-Yong Zhao
    Jian-Qiang Zhao
    Journal of the Korean Statistical Society, 2022, 51 : 927 - 945
  • [38] Fast predictive models based on multi-fidelity sampling of properties in molecular dynamics simulations
    Razi, M.
    Narayan, A.
    Kirby, R. M.
    Bedrov, D.
    COMPUTATIONAL MATERIALS SCIENCE, 2018, 152 : 125 - 133
  • [39] Evaluating the Theoretic Adequacy and Applied Potential of Computational Models of the Spacing Effect
    Walsh, Matthew M.
    Gluck, Kevin A.
    Gunzelmann, Glenn
    Jastrzembski, Tiffany
    Krusmark, Michael
    COGNITIVE SCIENCE, 2018, 42 : 644 - 691
  • [40] Evaluating the Adequacy of Gravity Models as a Description of Human Mobility for Epidemic Modelling
    Truscott, James
    Ferguson, Neil M.
    PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (10)