Partitioning gene expression data by data-driven Markov chain Monte Carlo

被引:4
|
作者
Saraiva, E. F. [1 ]
Suzuki, A. K. [2 ]
Louzada, F. [2 ]
Milan, L. A. [3 ]
机构
[1] Univ Fed Mato Grosso do Sul, Inst Matemat, Campo Grande, MS, Brazil
[2] Univ Sao Paulo, Dept Matemat Aplicada & Estat, BR-05508 Sao Paulo, SP, Brazil
[3] Univ Fed Sao Carlos, Dept Estat, BR-13560 Sao Carlos, SP, Brazil
关键词
Metropolis-Hastings; gene expression; Bayesian mixture model; split-and-merge update; Gibbs sampling; UNKNOWN NUMBER; MIXTURES;
D O I
10.1080/02664763.2015.1092113
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we introduce a Bayesian mixture model with an unknown number of components for partitioning gene expression data. Inferences about all the unknown parameters involved are made by using the proposed data-driven Markov chain Monte Carlo. This algorithm is essentially a Metropolis-Hastings within Gibbs sampling. The Metropolis-Hastings is performed to change the number of partitions k in the neighborhood [GRAPHICS] and [GRAPHICS] using a pair of split-merge moves. Our strategy for splitting is based on data in which allocation probabilities are calculated based on marginal likelihood function from the previously allocated observations. Conditional on k, the partitions labels are updated via Gibbs sampling. The two main advantages of the proposed algorithm is that it is easy to be implemented and the acceptance probability for split-merge movements depends only on the observed data. We examine the performance of the proposed algorithm on simulated data and then analyze two publicly available gene expression data sets.
引用
收藏
页码:1155 / 1173
页数:19
相关论文
共 50 条
  • [1] Image segmentation by data-driven Markov Chain Monte Carlo
    Tu, ZW
    Zhu, SC
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) : 657 - 673
  • [2] Image segmentation by Data Driven Markov Chain Monte Carlo
    Tu, ZW
    Zhu, SC
    Shum, HY
    [J]. EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, 2001, : 131 - 138
  • [3] Estimating the Constant Elasticity of Variance Model with Data-Driven Markov Chain Monte Carlo Methods
    Xiao, Shuang
    Li, Guo
    Jia, Yunjing
    [J]. ASIA-PACIFIC JOURNAL OF OPERATIONAL RESEARCH, 2017, 34 (01)
  • [4] Effective Bayesian inference by Data-Driven Markov Chain Monte Carlo for object recognition and image segmentation
    Zhu, SC
    Tu, ZW
    Zhang, R
    [J]. AUTOMATIC TARGET RECOGNITION X, 2000, 4050 : 320 - 332
  • [5] Laser-based Detection and Tracking Moving Objects using Data-Driven Markov Chain Monte Carlo
    Vu, Trung-Dung
    Aycard, Olivier
    [J]. ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3952 - 3958
  • [6] A MARKOV CHAIN MONTE CARLO SAMPLER FOR GENE GENEALOGIES CONDITIONAL ON HAPLOTYPE DATA
    Burkett, K. M.
    McNeney, B.
    Graham, J.
    [J]. SOME RECENT ADVANCES IN MATHEMATICS & STATISTICS, 2013, : 29 - 44
  • [7] On Markov chain Monte Carlo methods for tall data
    Bardenet, Remi
    Doucet, Arnaud
    Holmes, Chris
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18 : 1 - 43
  • [8] Minimum-entropy data partitioning using reversible jump Markov chain Monte Carlo
    Roberts, SJ
    Holmes, C
    Denison, D
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (08) : 909 - 914
  • [9] Monte Carlo data-driven tight frame for seismic data recovery
    Yu, Siwei
    Ma, Jianwei
    Osher, Stanley
    [J]. GEOPHYSICS, 2016, 81 (04) : V327 - V340
  • [10] A framework for human pose estimation by integrating data-driven Markov chain Monte Carlo with multi-objective evolutionary algorithm
    Huang, Shih-Shinh
    Fu, Li-Chen
    Hsiao, Pei-Yung
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 3748 - +