Bayesian mixture models for complex high dimensional count data in phage display experiments

被引:5
|
作者
Ji, Yuan
Yin, Guosheng
Tsui, Kam-Wah
Kolonin, Mikhail G.
Sun, Jessica
Arap, Wadih
Pasqualini, Renata
Do, Kim-Anh
机构
[1] Univ Texas, MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
[2] Univ Wisconsin, Madison, WI USA
关键词
Bayesian inference; Gibbs sampler; Markov chain Monte Carlo simulation; Metropolis-Hastings algorithm; peptide;
D O I
10.1111/j.1467-9876.2007.00570.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Phage display is a biological process that is used to screen random peptide libraries for ligands that bind to a target of interest with high affinity. On the basis of a count data set from an innovative multistage phage display experiment, we propose a class of Bayesian mixture models to cluster peptide counts into three groups that exhibit different display patterns across stages. Among the three groups, the investigators are particularly interested in that with an ascending display pattern in the counts, which implies that the peptides are likely to bind to the target with strong affinity. We apply a Bayesian false discovery rate approach to identify the peptides with the strongest affinity within the group. A list of peptides is obtained, among which important ones with meaningful functions are further validated by biologists. To examine the performance of the Bayesian model, we conduct a simulation study and obtain desirable results.
引用
收藏
页码:139 / 152
页数:14
相关论文
共 50 条
  • [1] Semiparametric Bayesian Inference for Phage Display Data
    Leon-Novelo, Luis G.
    Mueller, Peter
    Arap, Wadih
    Kolonin, Mikhail
    Sun, Jessica
    Pasqualini, Renata
    Do, Kim-Anh
    BIOMETRICS, 2013, 69 (01) : 174 - 183
  • [2] Model selection and application to high-dimensional count data clusteringvia finite EDCM mixture models
    Nuha Zamzami
    Nizar Bouguila
    Applied Intelligence, 2019, 49 : 1467 - 1488
  • [3] Bayesian negative binomial mixture regression models for the analysis of sequence count and methylation data
    Li, Qiwei
    Cassese, Alberto
    Guindani, Michele
    Vannucci, Marina
    BIOMETRICS, 2019, 75 (01) : 183 - 192
  • [4] Model selection and application to high-dimensional count data clustering: via finite EDCM mixture models
    Zamzami, Nuha
    Bouguila, Nizar
    APPLIED INTELLIGENCE, 2019, 49 (04) : 1467 - 1488
  • [5] Bayesian approach for mixture models with grouped data
    Gau, Shiow-Lan
    Tapsoba, Jean de Dieu
    Lee, Shen-Ming
    COMPUTATIONAL STATISTICS, 2014, 29 (05) : 1025 - 1043
  • [6] Bayesian approach for mixture models with grouped data
    Shiow-Lan Gau
    Jean de Dieu Tapsoba
    Shen-Ming Lee
    Computational Statistics, 2014, 29 : 1025 - 1043
  • [7] Bayesian mixture models for cytometry data analysis
    Lin, Lin
    Hejblum, Boris P.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2021, 13 (04)
  • [8] The Bayesian Analysis of Complex, High-Dimensional Models: Can It Be CODA?
    Ritov, Y.
    Bickel, P. J.
    Gamst, A. C.
    Kleijn, B. J. K.
    STATISTICAL SCIENCE, 2014, 29 (04) : 619 - 639
  • [9] Mixture models for capture-recapture count data
    Böhning D.
    Dietz E.
    Kuhnert R.
    Schön D.
    Statistical Methods and Applications, 2005, 14 (1) : 29 - 43
  • [10] Comparison of Poisson mixture models for count data clusterization
    Susinskas, J
    Radavicius, M
    INFORMATICA, 2002, 13 (02) : 209 - 226