Model Selection and Estimation of a Finite Shifted-Scaled Dirichlet Mixture Model

被引:9
|
作者
Alsuroji, Rua [1 ,2 ]
Zamzami, Nuha [1 ,3 ]
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, CIISE, Montreal, PQ, Canada
[2] Umm Al Qura Univ, Coll Comp & Informat Syst, Mecca, Saudi Arabia
[3] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
Data clustering; Medical sciences; Mixture models; Shifted-scaled Dirichlet distribution; Unsupervised learning; Writer identification;
D O I
10.1109/ICMLA.2018.00112
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an unsupervised learning algorithm for a finite mixture model of shifted-scaled Dirichlet distributions. Maximum likelihood and Newton raphson approaches are used for parameters estimation. In this research work, we address the flexibility challenge of the Dirichlet distribution by having another set of parameters for the location (beside the Scale parameter) that add functional probability models. This paper evaluates the capability of the discussed model to perform the categorization using both synthetic and real data related to the medical science to help in selecting wart treatment method, in the business field to detect the reasons behind employees absenteeism, and the writer identification application to define the author of off-line handwritten documents. We also compare the model performance against scaled Dirichlet, the classic Dirichlet, and Gaussian mixture models. Finally, experimental results are presented on the selected datasets. Besides, we apply the minimum message length to determine the optimal number of the components found within each dataset.
引用
收藏
页码:707 / 713
页数:7
相关论文
共 50 条
  • [31] Data Clustering using Online Variational Learning of Finite Scaled Dirichlet Mixture Models
    Nguyen, Hieu
    Kalra, Meeta
    Azam, Muhammad
    Bouguila, Nizar
    [J]. 2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 267 - 274
  • [32] The multi-clump finite mixture distribution and model selection
    Paul, Sudhir R.
    Banerjee, Tathagata
    Balasoorya, Uditha
    [J]. ENVIRONMETRICS, 2010, 21 (02) : 133 - 142
  • [33] Variational approximations in Bayesian model selection for finite mixture distributions
    McGrory, C. A.
    Titterington, D. M.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (11) : 5352 - 5367
  • [34] A new model selection procedure for finite mixture regression models
    Yu, Conglian
    Wang, Xiyang
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2020, 49 (18) : 4347 - 4366
  • [35] Bayesian variable selection for finite mixture model of linear regressions
    Lee, Kuo-Jung
    Chen, Ray-Bing
    Wu, Ying Nian
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 95 : 1 - 16
  • [36] Joint rank and variable selection for parsimonious estimation in a high-dimensional finite mixture regression model
    Devijver, Emilie
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2017, 157 : 1 - 13
  • [37] Fitting Unstructured Finite Mixture Models in Longitudinal Design: A Recommendation for Model Selection and Estimation of the Number of Classes
    Todo, Naoya
    Usami, Satoshi
    [J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2016, 23 (05) : 695 - 712
  • [38] Maximum Likelihood Estimation of Finite Mixture Model for Economic Data
    Phoong, Seuk-Yen
    Ismail, Mohd Tahir
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MATHEMATICAL SCIENCES, 2014, 1602 : 1016 - 1020
  • [39] Quantile-Based Estimation of the Finite Cauchy Mixture Model
    Kalantan, Zakiah I.
    Einbeck, Jochen
    [J]. SYMMETRY-BASEL, 2019, 11 (09): : 1 - 19
  • [40] Unsupervised selection and estimation of finite mixture models
    Figueiredo, MAT
    Jain, AK
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 87 - 90