Unsupervised nested Dirichlet finite mixture model for clustering

被引:0
|
作者
Fares Alkhawaja
Nizar Bouguila
机构
[1] Concordia University,Concordia Institute for Information Systems Engineering
来源
Applied Intelligence | 2023年 / 53卷
关键词
Nested Dirichlet distribution; Dirichlet-tree distribution; Minimum message length; Finite mixtures; Hierarchical learning;
D O I
暂无
中图分类号
学科分类号
摘要
The Dirichlet distribution is widely used in the context of mixture models. Despite its flexibility, it still suffers from some limitations, such as its restrictive covariance matrix and its direct proportionality between its mean and variance. In this work, a generalization over the Dirichlet distribution, namely the Nested Dirichlet distribution, is introduced in the context of finite mixture model providing more flexibility and overcoming the mentioned drawbacks, thanks to its hierarchical structure. The model learning is based on the generalized expectation-maximization algorithm, where parameters are initialized with the method of moments and estimated through the iterative Newton-Raphson method. Moreover, the minimum message length criterion is proposed to determine the best number of components that describe the data clusters by the finite mixture model. The Nested Dirichlet distribution is proven to be part of the exponential family, which offers several advantages, such as the calculation of several probabilistic distances in closed forms. The performance of the Nested Dirichlet mixture model is compared to the Dirichlet mixture model, the generalized Dirichlet mixture model, and the Convolutional Neural Network as a deep learning network. The excellence of the powerful proposed framework is validated through this comparison via challenging datasets. The hierarchical feature of the model is applied to real-world challenging tasks such as hierarchical cluster analysis and hierarchical feature learning, showing a significant improvement in terms of accuracy.
引用
收藏
页码:25232 / 25258
页数:26
相关论文
共 50 条
  • [31] A Dirichlet process mixture model for clustering longitudinal gene expression data
    Sun, Jiehuan
    Herazo-Maya, Jose D.
    Kaminski, Naftali
    Zhao, Hongyu
    Warren, Joshua L.
    STATISTICS IN MEDICINE, 2017, 36 (22) : 3495 - 3506
  • [32] A Spatial Dirichlet Process Mixture Model for Clustering Population Genetics Data
    Reich, Brian J.
    Bondell, Howard D.
    BIOMETRICS, 2011, 67 (02) : 381 - 390
  • [33] Urban Activity Clustering Method Based on Dirichlet Process Mixture Model
    Chen Z.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2020, 20 (06): : 247 - 252
  • [34] Unsupervised Learning Using Variational Inference on Finite Inverted Dirichlet Mixture Models with Component Splitting
    Maanicshah, Kamal
    Amayri, Manar
    Bouguila, Nizar
    Fan, Wentao
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 119 (02) : 1817 - 1844
  • [35] Unsupervised Learning Using Variational Inference on Finite Inverted Dirichlet Mixture Models with Component Splitting
    Kamal Maanicshah
    Manar Amayri
    Nizar Bouguila
    Wentao Fan
    Wireless Personal Communications, 2021, 119 : 1817 - 1844
  • [36] IMAGE DATABASE CATEGORIZATION USING ROBUST UNSUPERVISED LEARNING OF FINITE GENERALIZED DIRICHLET MIXTURE MODELS
    Ben Ismail, M. Maher
    Frigui, Hichem
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [37] Data Clustering using Online Variational Learning of Finite Scaled Dirichlet Mixture Models
    Nguyen, Hieu
    Kalra, Meeta
    Azam, Muhammad
    Bouguila, Nizar
    2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 267 - 274
  • [38] Model Selection and Estimation of a Finite Shifted-Scaled Dirichlet Mixture Model
    Alsuroji, Rua
    Zamzami, Nuha
    Bouguila, Nizar
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 707 - 713
  • [39] Unsupervised Variational Learning of Finite Generalized Inverted Dirichlet Mixture Models with Feature Selection and Component Splitting
    Maanicshah, Kamal
    Ali, Samr
    Fan, Wentao
    Bouguila, Nizar
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2019), PT II, 2019, 11663 : 94 - 105
  • [40] A hybrid SEM algorithm for high-dimensional unsupervised learning using a finite generalized dirichlet mixture
    Bouguila, Nizar
    Ziou, Djemel
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (09) : 2657 - 2668