Robust simultaneous positive data clustering and unsupervised feature selection using generalized inverted Dirichlet mixture models

被引:31
|
作者
Al Mashrgy, Mohamed [1 ]
Bdiri, Taoufik [1 ]
Bouguila, Nizar [2 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1T7, Canada
[2] Concordia Univ, CIISE, Montreal, PQ H3G 1T7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Positive data; Generalized inverted Dirichlet; Finite mixture; Feature selection; Outliers; Model selection; Images clustering; VARIABLE SELECTION; REGRESSION;
D O I
10.1016/j.knosys.2014.01.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The discovery, extraction and analysis of knowledge from data rely generally upon the use of unsupervised learning methods, in particular clustering approaches. Much recent research in clustering and data engineering has focused on the consideration of finite mixture models which allow to reason in the face of uncertainty and to learn by example. The adoption of these models becomes a challenging task in the presence of outliers and in the case of high-dimensional data which necessitates the deployment of feature selection techniques. In this paper we tackle simultaneously the problems of cluster validation (i.e. model selection), feature selection and outliers rejection when clustering positive data. The proposed statistical framework is based on the generalized inverted Dirichlet distribution that offers a more practical and flexible alternative to the inverted Dirichlet which has a very restrictive covariance structure. The learning of the parameters of the resulting model is based on the minimization of a message length objective incorporating prior knowledge. We use synthetic data and real data generated from challenging applications, namely visual scenes and objects clustering, to demonstrate the feasibility and advantages of the proposed method. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:182 / 195
页数:14
相关论文
共 50 条
  • [1] Unsupervised feature and model selection for generalized Dirichlet mixture models
    Boutemedjet, Sabri
    Bouguila, Nizar
    Ziou, Djemel
    [J]. IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2007, 4633 : 330 - +
  • [2] Unsupervised Variational Learning of Finite Generalized Inverted Dirichlet Mixture Models with Feature Selection and Component Splitting
    Maanicshah, Kamal
    Ali, Samr
    Fan, Wentao
    Bouguila, Nizar
    [J]. IMAGE ANALYSIS AND RECOGNITION (ICIAR 2019), PT II, 2019, 11663 : 94 - 105
  • [3] Positive vectors clustering using inverted Dirichlet finite mixture models
    Bdiri, Taoufik
    Bouguila, Nizar
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) : 1869 - 1882
  • [4] Simultaneous Bayesian clustering and feature selection using RJMCMC-based learning of finite generalized Dirichlet mixture models
    Elguebaly, Tarek
    Bouguila, Nizar
    [J]. SIGNAL PROCESSING, 2013, 93 (06) : 1531 - 1546
  • [5] Simultaneous feature selection and clustering using mixture models
    Law, MHC
    Figueiredo, MAT
    Jain, AK
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) : 1154 - 1166
  • [6] Unsupervised clustering and feature weighting based on Generalized Dirichlet mixture modeling
    Ben Ismail, Mohamed Maher
    Frigui, Hichem
    [J]. INFORMATION SCIENCES, 2014, 274 : 35 - 54
  • [7] Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection
    Fan, Wentao
    Bouguila, Nizar
    [J]. PATTERN RECOGNITION, 2013, 46 (10) : 2754 - 2769
  • [8] IMAGE DATABASE CATEGORIZATION USING ROBUST UNSUPERVISED LEARNING OF FINITE GENERALIZED DIRICHLET MIXTURE MODELS
    Ben Ismail, M. Maher
    Frigui, Hichem
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [9] Online variational learning of generalized Dirichlet mixture models with feature selection
    Fan, Wentao
    Bouguila, Nizar
    [J]. NEUROCOMPUTING, 2014, 126 : 166 - 179
  • [10] Dirichlet Process Mixture of Generalized Inverted Dirichlet Distributions for Positive Vector Data With Extended Variational Inference
    Ma, Zhanyu
    Lai, Yuping
    Xie, Jiyang
    Meng, Deyu
    Kleijn, W. Bastiaan
    Guo, Jun
    Yu, Jingyi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6089 - 6102