An optimal data ordering scheme for Dirichlet process mixture models

被引:1
|
作者
Wang, Xue [1 ]
Walker, Stephen G. [2 ]
机构
[1] Univ Kent, Sch Math Stat & Actuarial Sci, Canterbury, Kent, England
[2] Univ Texas Austin, Dept Math, Austin, TX 78712 USA
关键词
Allocation variables; Predictive density; Optimal ordering; INFERENCE;
D O I
10.1016/j.csda.2017.02.010
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In recent years, there has been increasing interest in Bayesian nonparametric methods due to their flexibility, and the availability of Markov chain Monte Carlo (MCMC) methods for sampling from the posterior distribution. As MCMC methods are generally time consuming for computation, there is a need for faster methods, which can be executed within a matter of seconds. A fast alternative to MCMC for sampling the well known and widely used Dirichlet process mixture (DPM) model is investigated to draw approximate independent and identically distributed samples from the posterior distribution of the latent allocations, and then to draw samples from the weights and locations conditional on the allocations. To address the order depend issue of the proposed algorithm, an optimal ordering scheme based on a sequence of optimizations is proposed to first obtain an optimal order of the data, and then run the algorithm on this ordering. The fast sampling algorithm is assisted by parallel computing using commands within MATLAB. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:42 / 52
页数:11
相关论文
共 50 条
  • [1] Dirichlet process mixture models for insurance loss data
    Hong, Liang
    Martin, Ryan
    [J]. SCANDINAVIAN ACTUARIAL JOURNAL, 2018, (06) : 545 - 554
  • [2] Unsupervised learning of Dirichlet process mixture models with missing data
    Xunan ZHANG
    Shiji SONG
    Lei ZHU
    Keyou YOU
    Cheng WU
    [J]. Science China(Information Sciences), 2016, 59 (01) : 161 - 174
  • [3] Sampling in Dirichlet Process Mixture Models for Clustering Streaming Data
    Dinari, Or
    Freifeld, Oren
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 818 - 835
  • [4] CLASSIFICATION OF MULTIVARIATE DATA USING DIRICHLET PROCESS MIXTURE MODELS
    Djuric, Petar M.
    Ferrari, Andre
    [J]. 2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 441 - 445
  • [5] Scalable Estimation of Dirichlet Process Mixture Models on Distributed Data
    Wang, Ruohui
    Lin, Dahua
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4632 - 4639
  • [6] Unsupervised learning of Dirichlet process mixture models with missing data
    Zhang, Xunan
    Song, Shiji
    Zhu, Lei
    You, Keyou
    Wu, Cheng
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2016, 59 (01) : 1 - 14
  • [7] Dirichlet Process Mixture Models with Pairwise Constraints for Data Clustering
    Li C.
    Rana S.
    Phung D.
    Venkatesh S.
    [J]. Annals of Data Science, 2016, 3 (2) : 205 - 223
  • [8] Estimating mixture of Dirichlet process models
    MacEachern, SN
    Muller, P
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (02) : 223 - 238
  • [9] Dirichlet process mixture models for non-stationary data streams
    Casado, Ioar
    Perez, Aritz
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 873 - 878
  • [10] DIRICHLET PROCESS MIXTURE MODELS FOR CLUSTERING I-VECTOR DATA
    Seshadri, Shreyas
    Remes, Ulpu
    Rasanen, Okko
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5470 - 5474