Unsupervised nested Dirichlet finite mixture model for clustering

被引:2
|
作者
Alkhawaja, Fares [1 ]
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
关键词
Nested Dirichlet distribution; Dirichlet-tree distribution; Minimum message length; Finite mixtures; Hierarchical learning; GENERALIZED DIRICHLET; INFORMATION; FRAMEWORK;
D O I
10.1007/s10489-023-04888-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Dirichlet distribution is widely used in the context of mixture models. Despite its flexibility, it still suffers from some limitations, such as its restrictive covariance matrix and its direct proportionality between its mean and variance. In this work, a generalization over the Dirichlet distribution, namely the Nested Dirichlet distribution, is introduced in the context of finite mixture model providing more flexibility and overcoming the mentioned drawbacks, thanks to its hierarchical structure. The model learning is based on the generalized expectation-maximization algorithm, where parameters are initialized with the method of moments and estimated through the iterative Newton-Raphson method. Moreover, the minimum message length criterion is proposed to determine the best number of components that describe the data clusters by the finite mixture model. The Nested Dirichlet distribution is proven to be part of the exponential family, which offers several advantages, such as the calculation of several probabilistic distances in closed forms. The performance of the Nested Dirichlet mixture model is compared to the Dirichlet mixture model, the generalized Dirichlet mixture model, and the Convolutional Neural Network as a deep learning network. The excellence of the powerful proposed framework is validated through this comparison via challenging datasets. The hierarchical feature of the model is applied to real-world challenging tasks such as hierarchical cluster analysis and hierarchical feature learning, showing a significant improvement in terms of accuracy.
引用
收藏
页码:25232 / 25258
页数:27
相关论文
共 50 条
  • [21] A Dirichlet Model of Alignment Cost in Mixed-Membership Unsupervised Clustering
    Liu, Xiran
    Kopelman, Naama M.
    Rosenberg, Noah A.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (03) : 1145 - 1159
  • [22] A Dirichlet Mixture Model of Hawkes Processes for Event Sequence Clustering
    Xu, Hongteng
    Zha, Hongyuan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [23] Data Clustering using Variational Learning of Finite Scaled Dirichlet Mixture Models
    Hieu Nguyen
    Azam, Muhammad
    Bouguila, Nizar
    [J]. 2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 1391 - 1396
  • [24] Unsupervised Clustering of Depth Images using Watson Mixture Model
    Hasnat, Md Abul
    Alata, Olivier
    Tremeau, Alain
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 214 - 219
  • [25] High-dimensional unsupervised selection and estimation of a finite generalized Dirichlet mixture model based on minimum message length
    Bouguila, Nizar
    Ziou, Djemel
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (10) : 1716 - 1731
  • [26] Clustering distributions with the marginalized nested Dirichlet process
    Zuanetti, Daiane Aparecida
    Muller, Peter
    Zhu, Yitan
    Yang, Shengjie
    Ji, Yuan
    [J]. BIOMETRICS, 2018, 74 (02) : 584 - 594
  • [27] Nested Gibbs sampling for mixture-of-mixture model and its application to speaker clustering
    Tawara, Naohiro
    Ogawa, Tetsuji
    Watanabe, Shinji
    Kobayashi, Tetsunori
    [J]. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2016, 5
  • [28] Clustering disaggregated load profiles using a Dirichlet process mixture model
    Granell, Ramon
    Axon, Colin J.
    Wallom, David C. H.
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2015, 92 : 507 - 516
  • [29] An Adaptive Dirichlet Multinomial Mixture Model for Short Text Streaming Clustering
    Duan, Ruting
    Li, Chunping
    [J]. 2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 49 - 55
  • [30] Urban Activity Clustering Method Based on Dirichlet Process Mixture Model
    Chen, Zhong
    [J]. Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2020, 20 (06): : 247 - 252