Count Data Clustering using Unsupervised Localized Feature Selection and Outliers Rejection

被引:1
|
作者
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Quebec City, PQ, Canada
关键词
Mixture models; count data; outliers; feature selection; clustering; texture; images categorization; DIRICHLET MIXTURE MODEL; SCENE;
D O I
10.1109/ICTAI.2011.174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an unsupervised statistical model for simultaneous clustering, feature selection and outlier rejection in the case of count data. The proposed model is based on a finite discrete mixture to which a uniform component is added to ensure robustness to outliers and noise. The consideration of a finite mixture model is justified by its flexibility, its solid grounding in the theory of statistics and its competitive results. We derive a complete maximum a posteriori learning approach that does not require a priori knowledge about the number of outliers and the number of clusters. A rigorous expectation maximization (EM) algorithm, based on the formulation of a maximum a posteriori (MAP) estimation, is also provided. We report experimental results of applying our model to the challenging problems of visual scenes categorization and texture discrimination.
引用
收藏
页码:1020 / 1027
页数:8
相关论文
共 50 条
  • [31] Network Anomaly Detection Using Unsupervised Feature Selection and Density Peak Clustering
    Ni, Xiejun
    He, Daojing
    Chan, Sammy
    Ahmad, Farooq
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY, ACNS 2016, 2016, 9696 : 212 - 227
  • [32] Unsupervised feature selection via discrete spectral clustering and feature weights
    Shang, Ronghua
    Kong, Jiarui
    Wang, Lujuan
    Zhang, Weitong
    Wang, Chao
    Li, Yangyang
    Jiao, Licheng
    NEUROCOMPUTING, 2023, 517 : 106 - 117
  • [33] Integration of dense subgraph finding with feature clustering for unsupervised feature selection
    Bandyopadhyay, Sanghamitra
    Bhadra, Tapas
    Mitra, Pabitra
    Maulik, Ujjwal
    PATTERN RECOGNITION LETTERS, 2014, 40 : 104 - 112
  • [34] Unsupervised feature selection using feature similarity
    Mitra, P
    Murthy, CA
    Pal, SK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (03) : 301 - 312
  • [35] Robust simultaneous positive data clustering and unsupervised feature selection using generalized inverted Dirichlet mixture models
    Al Mashrgy, Mohamed
    Bdiri, Taoufik
    Bouguila, Nizar
    KNOWLEDGE-BASED SYSTEMS, 2014, 59 : 182 - 195
  • [36] CGUFS: A clustering-guided unsupervised feature selection algorithm for gene expression data
    Xu, Zhaozhao
    Yang, Fangyuan
    Wang, Hong
    Sun, Junding
    Zhu, Hengde
    Wang, Shuihua
    Zhang, Yudong
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (09)
  • [37] A finite mixture model for simultaneous high-dimensional clustering, localized feature selection and outlier rejection
    Bouguila, Nizar
    Almakadmeh, Khaled
    Boutemedjet, Sabri
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (07) : 6641 - 6656
  • [38] Localized Graph-Based Feature Selection for Clustering
    Zhang, Zhihong
    Hancock, Edwin R.
    IMAGE ANALYSIS AND RECOGNITION, PT I, 2012, 7324 : 1 - 10
  • [39] Unsupervised feature selection using clustering ensembles and population based incremental learning algorithm
    Hong, Yi
    Kwong, Sam
    Chang, Yuchou
    Ren, Qingsheng
    PATTERN RECOGNITION, 2008, 41 (09) : 2742 - 2756
  • [40] Unsupervised feature selection in linked biological data
    Hoseini, Elham
    Mansoori, Eghbal G.
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (03) : 999 - 1013