Count Data Clustering using Unsupervised Localized Feature Selection and Outliers Rejection

被引:1
|
作者
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Quebec City, PQ, Canada
关键词
Mixture models; count data; outliers; feature selection; clustering; texture; images categorization; DIRICHLET MIXTURE MODEL; SCENE;
D O I
10.1109/ICTAI.2011.174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an unsupervised statistical model for simultaneous clustering, feature selection and outlier rejection in the case of count data. The proposed model is based on a finite discrete mixture to which a uniform component is added to ensure robustness to outliers and noise. The consideration of a finite mixture model is justified by its flexibility, its solid grounding in the theory of statistics and its competitive results. We derive a complete maximum a posteriori learning approach that does not require a priori knowledge about the number of outliers and the number of clusters. A rigorous expectation maximization (EM) algorithm, based on the formulation of a maximum a posteriori (MAP) estimation, is also provided. We report experimental results of applying our model to the challenging problems of visual scenes categorization and texture discrimination.
引用
收藏
页码:1020 / 1027
页数:8
相关论文
共 50 条
  • [41] Unsupervised Hierarchical Feature Selection on Networked Data
    Zhang, Yuzhe
    Chen, Chen
    Luo, Minnan
    Li, Jundong
    Yan, Caixia
    Zheng, Qinghua
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 : 137 - 153
  • [42] Unsupervised feature selection for large data sets
    de Amorim, Renato Cordeiro
    PATTERN RECOGNITION LETTERS, 2019, 128 : 183 - 189
  • [43] Unsupervised feature selection in linked biological data
    Elham Hoseini
    Eghbal G. Mansoori
    Pattern Analysis and Applications, 2019, 22 : 999 - 1013
  • [44] An unsupervised approach for feature selection in linked data
    Hoseini, Elham
    Mansoori, Eghbal G.
    2016 24TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2016, : 1881 - 1886
  • [45] Unsupervised feature selection based extreme learning machine for clustering
    Jichao Chen
    Yijie Zeng
    Yue Li
    Guang-Bin Huang
    NEUROCOMPUTING, 2020, 386 : 198 - 207
  • [46] A comparative study on unsupervised feature selection methods for text clustering
    Liu, LY
    Kang, JC
    Yu, J
    Wang, ZL
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 597 - 601
  • [47] Unsupervised Feature Selection through Fitness Proportionate Sharing Clustering
    Yan, Xuyang
    Homaifar, Abdollah
    Awogbami, Gabriel
    Girma, Abenezer
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 1355 - 1360
  • [48] Manifold Regularized Robust Unsupervised Feature Selection for Image Clustering
    Shi, Yuqing
    Du, Shiqiang
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11161 - 11165
  • [49] Robust sparse orthogonal basis clustering for unsupervised feature selection
    Miao, Jianyu
    Zhao, Jingjing
    Yang, Tiejun
    Tian, Yingjie
    Shi, Yong
    Xu, Mingliang
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
  • [50] Application of rough set theory to feature selection for unsupervised clustering
    Questier, F
    Arnaut-Rollier, I
    Walczak, B
    Massart, DL
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 63 (02) : 155 - 167