Unsupervised nested Dirichlet finite mixture model for clustering

被引:0
|
作者
Fares Alkhawaja
Nizar Bouguila
机构
[1] Concordia University,Concordia Institute for Information Systems Engineering
来源
Applied Intelligence | 2023年 / 53卷
关键词
Nested Dirichlet distribution; Dirichlet-tree distribution; Minimum message length; Finite mixtures; Hierarchical learning;
D O I
暂无
中图分类号
学科分类号
摘要
The Dirichlet distribution is widely used in the context of mixture models. Despite its flexibility, it still suffers from some limitations, such as its restrictive covariance matrix and its direct proportionality between its mean and variance. In this work, a generalization over the Dirichlet distribution, namely the Nested Dirichlet distribution, is introduced in the context of finite mixture model providing more flexibility and overcoming the mentioned drawbacks, thanks to its hierarchical structure. The model learning is based on the generalized expectation-maximization algorithm, where parameters are initialized with the method of moments and estimated through the iterative Newton-Raphson method. Moreover, the minimum message length criterion is proposed to determine the best number of components that describe the data clusters by the finite mixture model. The Nested Dirichlet distribution is proven to be part of the exponential family, which offers several advantages, such as the calculation of several probabilistic distances in closed forms. The performance of the Nested Dirichlet mixture model is compared to the Dirichlet mixture model, the generalized Dirichlet mixture model, and the Convolutional Neural Network as a deep learning network. The excellence of the powerful proposed framework is validated through this comparison via challenging datasets. The hierarchical feature of the model is applied to real-world challenging tasks such as hierarchical cluster analysis and hierarchical feature learning, showing a significant improvement in terms of accuracy.
引用
收藏
页码:25232 / 25258
页数:26
相关论文
共 50 条
  • [41] Clustering of Laser Measurements via the Dirichlet Process Mixture Model for Object Tracking
    Lee, Yung-Chou
    Hsiao, Tesheng
    Chang, Chih-Tang
    2012 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2012, : 837 - 842
  • [42] Object Clustering With Dirichlet Process Mixture Model for Data Association in Monocular SLAM
    Wei, Songlin
    Chen, Guodong
    Chi, Wenzheng
    Wang, Zhenhua
    Sun, Lining
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (01) : 594 - 603
  • [43] ChromDMM: a Dirichlet-multinomial mixture model for clustering heterogeneous epigenetic data
    Osmala, Maria
    Eraslan, Gokcen
    Lahdesmaki, Harri
    BIOINFORMATICS, 2022, 38 (16) : 3863 - 3870
  • [44] Improved Dirichlet mixture model clustering algorithm for medical data anomaly detection
    Wu, Lili
    Ali, Majid Khan Majahar
    Shan, Fam Pei
    Tian, Ying
    Tao, Li
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2025, 25 (01) : 11 - 21
  • [45] Speaker Clustering Based on Utterance-oriented Dirichlet Process Mixture Model
    Tawara, Naohiro
    Watanabe, Shinji
    Ogawa, Tetsuji
    Kobayashi, Tetsunori
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2916 - +
  • [46] Tensor Dirichlet Process Multinomial Mixture Model with Graphs for Passenger Trajectory Clustering
    Li, Ziyue
    Yan, Hao
    Zhang, Chen
    Ketter, Wolfgang
    Tsung, Fugee
    PROCEEDINGS OF THE 6TH ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON AI FOR GEOGRAPHIC KNOWLEDGE DISCOVERY, GEOAI 2023, 2023, : 121 - 128
  • [47] Simultaneous inference for multiple testing and clustering via a Dirichlet, process mixture model
    Dahl, David B.
    Mo, Qianxing
    Vannucci, Marina
    STATISTICAL MODELLING, 2008, 8 (01) : 23 - 39
  • [48] A Dirichlet Multinomial Mixture Model-based Approach for Short Text Clustering
    Yin, Jianhua
    Wang, Jianyong
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 233 - 242
  • [49] Robust simultaneous positive data clustering and unsupervised feature selection using generalized inverted Dirichlet mixture models
    Al Mashrgy, Mohamed
    Bdiri, Taoufik
    Bouguila, Nizar
    KNOWLEDGE-BASED SYSTEMS, 2014, 59 : 182 - 195
  • [50] Assessing Search and Unsupervised Clustering Algorithms in Nested Sampling
    Maillard, Lune
    Finocchi, Fabio
    Trassinelli, Martino
    ENTROPY, 2023, 25 (02)