AnaCoDa: analyzing codon data with Bayesian mixture models

被引:7
|
作者
Landerer, Cedric [1 ,2 ]
Cope, Alexander [3 ,4 ]
Zaretzki, Russell [2 ,5 ]
Gilchrist, Michael A. [1 ,2 ]
机构
[1] Univ Tennessee, Dept Ecol & Evolutionary Biol, Knoxville, TN 37996 USA
[2] Univ Tennessee, Natl Inst Math & Biol Synth, Knoxville, TN 37996 USA
[3] Univ Tennessee, Genome Sci & Technol, Knoxville, TN USA
[4] Oak Ridge Natl Lab, Oak Ridge, TN USA
[5] Univ Tennessee, Dept Stat Operat & Management Sci, Knoxville, TN USA
基金
美国国家科学基金会;
关键词
SELECTION; USAGE; BIAS;
D O I
10.1093/bioinformatics/bty138
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
AnaCoDa is an R package for estimating biologically relevant parameters of mixture models, such as selection against translation inefficiency, non-sense errors and ribosome pausing time, from genomic and high throughput datasets. AnaCoDa provides an adaptive Bayesian MCMC algorithm, fully implemented in C++ for high performance with an ergonomic R interface to improve usability. AnaCoDa employs a generic object-oriented design to allow users to extend the framework and implement their own models. Current models implemented in AnaCoDa can accurately estimate biologically relevant parameters given either protein coding sequences or ribosome foot-printing data. Optionally, AnaCoDa can utilize additional data sources, such as gene expression measurements, to aid model fitting and parameter estimation. By utilizing a hierarchical object structure, some parameters can vary between sets of genes while others can be shared. Genes may be assigned to clusters or membership may be estimated by AnaCoDa. This flexibility allows users to estimate the same model parameter under different biological conditions and categorize genes into different sets based on shared model properties embedded within the data. AnaCoDa also allows users to generate simulated data which can be used to aid model development and model analysis as well as evaluate model adequacy. Finally, AnaCoDa contains a set of visualization routines and the ability to revisit or re-initiate previous model fitting, providing researchers with a well rounded easy to use framework to analyze genome scale data.
引用
收藏
页码:2496 / 2498
页数:3
相关论文
共 50 条
  • [41] Bayesian analysis of doubly semiparametric mixture cure models with interval-censored data
    Liu, Xiaoyu
    Xiang, Liming
    STATISTICS AND COMPUTING, 2025, 35 (03)
  • [42] Bayesian analysis for mixture of latent variable hidden Markov models with multivariate longitudinal data
    Xia, Ye-Mao
    Tang, Nian-Sheng
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 132 : 190 - 211
  • [43] Bayesian sensitivity analysis of incomplete data: bridging pattern-mixture and selection models
    Kaciroti, Niko A.
    Raghunathan, Trivellore
    STATISTICS IN MEDICINE, 2014, 33 (27) : 4841 - 4857
  • [44] IMPROVING BAYESIAN MIXTURE MODELS FOR MULTIPLE IMPUTATION OF MISSING DATA USING FOCUSED CLUSTERING
    Wei, Lan
    Reiter, Jerome P.
    REVSTAT-STATISTICAL JOURNAL, 2018, 16 (02) : 213 - 230
  • [45] Multiple imputation of longitudinal categorical data through bayesian mixture latent Markov models
    Vidotto, Davide
    Vermunt, Jeroen K.
    Van Deun, Katrijn
    JOURNAL OF APPLIED STATISTICS, 2020, 47 (10) : 1720 - 1738
  • [46] BClass: A Bayesian approach based on mixture models for clustering and classification of heterogeneous biological data
    Medrano-Soto, A
    Christen, JA
    Collado-Vides, J
    JOURNAL OF STATISTICAL SOFTWARE, 2005, 13 (02): : 1 - 18
  • [47] Bayesian mixture models for complex high dimensional count data in phage display experiments
    Ji, Yuan
    Yin, Guosheng
    Tsui, Kam-Wah
    Kolonin, Mikhail G.
    Sun, Jessica
    Arap, Wadih
    Pasqualini, Renata
    Do, Kim-Anh
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2007, 56 : 139 - 152
  • [48] Bayesian meta-analysis for longitudinal data models using multivariate mixture priors
    Lopes, HF
    Müller, P
    Rosner, GL
    BIOMETRICS, 2003, 59 (01) : 66 - 75
  • [49] Approximate Bayesian inference for mixture cure models
    E. Lázaro
    C. Armero
    V. Gómez-Rubio
    TEST, 2020, 29 : 750 - 767
  • [50] Approximate Bayesian computation for finite mixture models
    Simola, Umberto
    Cisewski-Kehe, Jessi
    Wolpert, Robert L.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (06) : 1155 - 1174