Unidimensional Clustering of Discrete Data Using Latent Tree Models

被引:0
|
作者
Liu, April H. [1 ]
Poon, Leonard K. M. [2 ]
Zhang, Nevin L. [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Hong Kong Inst Educ, Dept Math & Informat Technol, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with model-based clustering of discrete data. Latent class models (LCMs) are usually used for the task. An LCM consists of a latent variable and a number of attributes. It makes the overly restrictive assumption that the attributes are mutually independent given the latent variable. We propose a novel method to relax the assumption. The key idea is to partition the attributes into groups such that correlations among the attributes in each group can be properly modeled by using one single latent variable. The latent variables for the attribute groups are then used to build a number of models and one of them is chosen to produce the clustering results. Extensive empirical studies have been conducted to compare the new method with LCM and several other methods (K-means, kernel K means and spectral clustering) that are not model-based. The new method outperforms the alternative methods in most cases and the differences are often large.
引用
收藏
页码:2771 / 2777
页数:7
相关论文
共 50 条
  • [1] UC-LTM: Unidimensional clustering using latent tree models for discrete data
    Poon, Leonard K. M.
    Liu, April H.
    Zhang, Nevin L.
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 92 : 392 - 409
  • [2] CLUSTERING CRITERIA FOR DISCRETE-DATA AND LATENT CLASS MODELS
    CELEUX, G
    GOVAERT, G
    JOURNAL OF CLASSIFICATION, 1991, 8 (02) : 157 - 176
  • [3] Latent tree models for rounding in spectral clustering
    Liu, April H.
    Poon, Leonard K. M.
    Liu, Teng-Fei
    Zhang, Nevin L.
    NEUROCOMPUTING, 2014, 144 : 448 - 462
  • [4] Discrete data clustering using finite mixture models
    Bouguila, Nizar
    ElGuebaly, Walid
    PATTERN RECOGNITION, 2009, 42 (01) : 33 - 42
  • [5] Discrete choice models with latent variables using subjective data
    Morikawa, T
    Sasaki, K
    TRAVEL BEHAVIOUR RESEARCH: UPDATING THE STATE OF PLAY, 1998, : 435 - 455
  • [6] Greedy learning of latent tree models for multidimensional clustering
    Liu, Teng-Fei
    Zhang, Nevin L.
    Chen, Peixian
    Liu, April Hua
    Poon, Leonard K. M.
    Wang, Yi
    MACHINE LEARNING, 2015, 98 (1-2) : 301 - 330
  • [7] Greedy learning of latent tree models for multidimensional clustering
    Teng-Fei Liu
    Nevin L. Zhang
    Peixian Chen
    April Hua Liu
    Leonard K. M. Poon
    Yi Wang
    Machine Learning, 2015, 98 : 301 - 330
  • [8] Spectral methods for learning discrete latent tree models
    Wang, Xiaofei
    Guo, Jianhua
    Hao, Lizhu
    Zhang, Nevin L.
    STATISTICS AND ITS INTERFACE, 2017, 10 (04) : 677 - 698
  • [9] A characterization of monotone unidimensional latent variable models
    Junker, BW
    Ellis, JL
    ANNALS OF STATISTICS, 1997, 25 (03): : 1327 - 1343
  • [10] Latent Clustering Models for Outlier Identification in Telecom Data
    Ye Ouyang
    Huet, Alexis
    Shim, J. P.
    Hu, Mantian
    MOBILE INFORMATION SYSTEMS, 2016, 2016