Generalized Mixture Models for Molecular Phylogenetic Estimation

被引:9
|
作者
Evans, Jason [1 ]
Sullivan, Jack [1 ]
机构
[1] Univ Idaho, Program Bioinformat & Computat Biol, Dept Biol Sci, Moscow, ID 83844 USA
基金
美国国家卫生研究院;
关键词
Bayesian phylogenetic inference; mixture models; model selection; polytomous trees; reversible jump Markov chain Monte-Carlo; CHAIN MONTE-CARLO; SEQUENCE; PATTERN;
D O I
10.1093/sysbio/syr093
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The rapidly growing availability of multigene sequence data during the past decade has enabled phylogeny estimation at phylogenomic scales. However, dealing with evolutionary process heterogeneity across the genome becomes increasingly challenging. Here we develop a mixture model approach that uses reversible jump Markov chain Monte Carlo (MCMC) estimation to permit as many distinct models as the data require. Each additional model considered may be a fully parametrized general time-reversible model or any of its special cases. Furthermore, we expand the usual proposal mechanisms for topology changes to permit hard polytomies (i.e., zero-length internal branches). This new approach is implemented in the Crux software toolkit. We demonstrate the feasibility of using reversible jump MCMC on mixture models by reexamining a well-known 44-taxon mammalian data set comprising 22 concatenated genes. We are able to reproduce the results of the original analysis (with respect to bipartition support) when we make identical assumptions, but when we allow for polytomies and/or use data-driven mixture model estimation, we infer much lower bipartition support values for several key bipartitions.
引用
收藏
页码:12 / 21
页数:10
相关论文
共 50 条
  • [1] Phylogenetic mixture models for proteins
    Le, Si Quang
    Lartillot, Nicolas
    Gascuel, Olivier
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1512) : 3965 - 3976
  • [2] Classification of molecular sequence data using Bayesian phylogenetic mixture models
    Loza-Reyes, E.
    Hum, M. A.
    Robinson, A.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 75 : 81 - 95
  • [3] When Do Phylogenetic Mixture Models Mimic Other Phylogenetic Models?
    Allman, Elizabeth S.
    Rhodes, John A.
    Sullivant, Seth
    SYSTEMATIC BIOLOGY, 2012, 61 (06) : 1049 - 1059
  • [4] Identifiability of Large Phylogenetic Mixture Models
    John A. Rhodes
    Seth Sullivant
    Bulletin of Mathematical Biology, 2012, 74 : 212 - 231
  • [5] Identifiability of Large Phylogenetic Mixture Models
    Rhodes, John A.
    Sullivant, Seth
    BULLETIN OF MATHEMATICAL BIOLOGY, 2012, 74 (01) : 212 - 231
  • [6] Finite Mixture of Generalized Semiparametric Models: Variable Selection via Penalized Estimation
    Eskandari, Farzad
    Ormoz, Ehsan
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (10) : 3744 - 3759
  • [7] Statistical Generalized Derivative Applied to the Profile Likelihood Estimation in a Mixture of Semiparametric Models
    Hirose, Yuichi
    Liu, Ivy
    ENTROPY, 2020, 22 (03)
  • [8] Empirical profile mixture models for phylogenetic reconstruction
    Quang, Le Si
    Gascuel, Olivier
    Lartillot, Nicolas
    BIOINFORMATICS, 2008, 24 (20) : 2317 - 2323
  • [9] Distributed Estimation of Mixture Models
    Dedecius, Kamil
    Reichl, Jan
    BAYESIAN STATISTICS FROM METHODS TO MODELS AND APPLICATIONS: RESEARCH FROM BAYSM 2014, 2015, 126 : 27 - 36
  • [10] Mixture models for baseline estimation
    de Rooi, Johan J.
    Eilers, Paul H. C.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 117 : 56 - 60