Unsupervised learning of Dirichlet process mixture models with missing data

被引:0
|
作者
Xunan ZHANG [1 ]
Shiji SONG [1 ]
Lei ZHU [2 ]
Keyou YOU [1 ]
Cheng WU [1 ]
机构
[1] Department of Automation, Tsinghua University
[2] China Ocean Mineral Resources R&D Association
基金
中国国家自然科学基金;
关键词
Dirichlet processes; missing data; clustering; variational Bayesian; image analysis;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
This study presents a novel approach to unsupervised learning for clustering with missing data.We first extend a finite mixture model to the infinite case by considering Dirichlet process mixtures, which can automatically determine the number of mixture components or clusters. Furthermore, we view the missing features as latent variables and compute the posterior distributions using the variational Bayesian expectation maximization algorithm, which optimizes the evidence lower bound on the complete-data log marginal likelihood. We demonstrate the performance on several artificial data sets with missing values. The experimental results indicate that the proposed method outperforms some classic imputation methods. We finally present an application to seabed hydrothermal sulfide color images analysis problem.
引用
收藏
页码:161 / 174
页数:14
相关论文
共 50 条
  • [31] Distributed Inference for Dirichlet Process Mixture Models
    Ge, Hong
    Chen, Yutian
    Wan, Moquan
    Ghahramani, Zoubin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2276 - 2284
  • [32] DIRICHLET PROCESS MIXTURE MODELS WITH MULTIPLE MODALITIES
    Paisley, John
    Carin, Lawrence
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1613 - 1616
  • [33] Background Subtraction with Dirichlet Process Mixture Models
    Haines, Tom S. F.
    Xiang, Tao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (04) : 670 - 683
  • [34] Collapsed Variational Dirichlet Process Mixture Models
    Kurihara, Kenichi
    Welling, Max
    Teh, Yee Whye
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2796 - 2801
  • [35] High Dimensional Data Clustering by means of Distributed Dirichlet Process Mixture Models
    Meguelati, Khadidja
    Fontez, Benedicte
    Hilgert, Nadine
    Masseglia, Florent
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 890 - 899
  • [36] Bayesian curve fitting and clustering with Dirichlet process mixture models for microarray data
    Ju-Hyun Park
    Minjung Kyung
    Journal of the Korean Statistical Society, 2019, 48 : 207 - 220
  • [37] Bayesian curve fitting and clustering with Dirichlet process mixture models for microarray data
    Park, Ju-Hyun
    Kyung, Minjung
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2019, 48 (02) : 207 - 220
  • [38] Data Clustering using Variational Learning of Finite Scaled Dirichlet Mixture Models
    Hieu Nguyen
    Azam, Muhammad
    Bouguila, Nizar
    2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 1391 - 1396
  • [39] Unsupervised Variational Learning of Finite Generalized Inverted Dirichlet Mixture Models with Feature Selection and Component Splitting
    Maanicshah, Kamal
    Ali, Samr
    Fan, Wentao
    Bouguila, Nizar
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2019), PT II, 2019, 11663 : 94 - 105
  • [40] Hybrid Dirichlet mixture models for functional data
    Petrone, Sonia
    Guindani, Michele
    Gelfand, Alan E.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 : 755 - 782