Count Data Clustering using Unsupervised Localized Feature Selection and Outliers Rejection

被引:1
|
作者
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Quebec City, PQ, Canada
关键词
Mixture models; count data; outliers; feature selection; clustering; texture; images categorization; DIRICHLET MIXTURE MODEL; SCENE;
D O I
10.1109/ICTAI.2011.174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an unsupervised statistical model for simultaneous clustering, feature selection and outlier rejection in the case of count data. The proposed model is based on a finite discrete mixture to which a uniform component is added to ensure robustness to outliers and noise. The consideration of a finite mixture model is justified by its flexibility, its solid grounding in the theory of statistics and its competitive results. We derive a complete maximum a posteriori learning approach that does not require a priori knowledge about the number of outliers and the number of clusters. A rigorous expectation maximization (EM) algorithm, based on the formulation of a maximum a posteriori (MAP) estimation, is also provided. We report experimental results of applying our model to the challenging problems of visual scenes categorization and texture discrimination.
引用
收藏
页码:1020 / 1027
页数:8
相关论文
共 50 条
  • [21] Filter Feature Selection for Unsupervised Clustering of Designer Drugs Using DFT Simulated IR Spectra Data
    He, Kedan
    ACS OMEGA, 2021, 6 (47): : 32151 - 32165
  • [22] Unsupervised feature selection for text data
    Wiratunga, Nirmalie
    Lothian, Rob
    Massie, Stewart
    ADVANCES IN CASE-BASED REASONING, PROCEEDINGS, 2006, 4106 : 340 - 354
  • [23] Unsupervised Feature Selection for Linked Data
    Nemade, Rachana T.
    Makhijani, Richa
    2014 RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2014,
  • [24] Unsupervised Feature Selection for Noisy Data
    Mahdavi, Kaveh
    Labarta, Jesus
    Gimenez, Judit
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2019, 2019, 11888 : 79 - 94
  • [25] Text clustering with feature selection by using statistical data
    Li, Yanjun
    Luo, Congnan
    Chung, Soon M.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (05) : 641 - 652
  • [26] Feature selection in unsupervised context: Clustering based approach
    Klepaczko, A
    Materka, A
    Computer Recognition Systems, Proceedings, 2005, : 219 - 226
  • [27] Empirical Study on Unsupervised Feature Selection for Document Clustering
    Mackute-Varoneckiene, Ausra
    Krilavicius, Tomas
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2014, 2014, 268 : 107 - +
  • [28] Spectral Clustering Based Unsupervised Feature Selection Algorithms
    Xie J.-Y.
    Ding L.-J.
    Wang M.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1009 - 1024
  • [29] Unsupervised Simultaneous Orthogonal Basis Clustering Feature Selection
    Han, Dongyoon
    Kim, Junmo
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5016 - 5023
  • [30] Subspace Clustering via Joint Unsupervised Feature Selection
    Dong, Wenhua
    Wu, Xiao-Jun
    Li, Hui
    Feng, Zhen-Hua
    Kittler, Josef
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3892 - 3898