Count Data Clustering using Unsupervised Localized Feature Selection and Outliers Rejection

被引:1
|
作者
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Quebec City, PQ, Canada
关键词
Mixture models; count data; outliers; feature selection; clustering; texture; images categorization; DIRICHLET MIXTURE MODEL; SCENE;
D O I
10.1109/ICTAI.2011.174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an unsupervised statistical model for simultaneous clustering, feature selection and outlier rejection in the case of count data. The proposed model is based on a finite discrete mixture to which a uniform component is added to ensure robustness to outliers and noise. The consideration of a finite mixture model is justified by its flexibility, its solid grounding in the theory of statistics and its competitive results. We derive a complete maximum a posteriori learning approach that does not require a priori knowledge about the number of outliers and the number of clusters. A rigorous expectation maximization (EM) algorithm, based on the formulation of a maximum a posteriori (MAP) estimation, is also provided. We report experimental results of applying our model to the challenging problems of visual scenes categorization and texture discrimination.
引用
收藏
页码:1020 / 1027
页数:8
相关论文
共 50 条
  • [1] Simultaneous Non-gaussian Data Clustering, Feature Selection and Outliers Rejection
    Bouguila, Nizar
    Ziou, Djemel
    Boutemedjet, Sabri
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, 2011, 6744 : 364 - 369
  • [2] Unsupervised Feature Selection with Feature Clustering
    Cheung, Yiu-ming
    Jia, Hong
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 9 - 15
  • [3] Localized feature selection for clustering
    Li, Yuanhong
    Dong, Ming
    Hua, Jing
    PATTERN RECOGNITION LETTERS, 2008, 29 (01) : 10 - 18
  • [4] Unsupervised feature selection for balanced clustering
    Zhou, Peng
    Chen, Jiangyong
    Fan, Mingyu
    Du, Liang
    Shen, Yi-Dong
    Li, Xuejun
    KNOWLEDGE-BASED SYSTEMS, 2020, 193
  • [5] An Unsupervised Attribute Clustering Algorithm for Unsupervised Feature Selection
    Zhou, Pei-Yuan
    Chan, Keith C. C.
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 710 - 716
  • [6] Feature Selection Using Differential Evolution for Unsupervised Image Clustering
    Gutoski, Matheus
    Ribeiro, Manasses
    Romero Aquino, Nelson Marcelo
    Hattori, Leandro Takeshi
    Lazzaretti, Andre Eugenio
    Lopes, Heitor Silverio
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2018, PT I, 2018, 10841 : 376 - 385
  • [7] Unsupervised Feature Selection for Proportional Data Clustering via Expectation Propagation
    Fan, Wentao
    Bouguila, Nizar
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [8] A Novel Unsupervised Feature Selection Method for Bioinformatics Data Sets through Feature Clustering
    Li, Guangrong
    Hu, Xiaohua
    Shen, Xiajiong
    Chen, Xin
    Li, Zhoujun
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 41 - +
  • [9] Unsupervised Feature Selection for Histogram-Valued Symbolic Data Using Hierarchical Conceptual Clustering
    Ichino, Manabu
    Umbleja, Kadri
    Yaguchi, Hiroyuki
    STATS, 2021, 4 (02): : 359 - 384
  • [10] Big data analysis using a parallel ensemble clustering architecture and an unsupervised feature selection approach
    Wang, Yubo
    Saraswat, Shelesh Krishna
    Komari, Iraj Elyasi
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (01) : 270 - 282