Unidimensional Clustering of Discrete Data Using Latent Tree Models

被引:0
|
作者
Liu, April H. [1 ]
Poon, Leonard K. M. [2 ]
Zhang, Nevin L. [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Hong Kong Inst Educ, Dept Math & Informat Technol, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2015年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with model-based clustering of discrete data. Latent class models (LCMs) are usually used for the task. An LCM consists of a latent variable and a number of attributes. It makes the overly restrictive assumption that the attributes are mutually independent given the latent variable. We propose a novel method to relax the assumption. The key idea is to partition the attributes into groups such that correlations among the attributes in each group can be properly modeled by using one single latent variable. The latent variables for the attribute groups are then used to build a number of models and one of them is chosen to produce the clustering results. Extensive empirical studies have been conducted to compare the new method with LCM and several other methods (K-means, kernel K means and spectral clustering) that are not model-based. The new method outperforms the alternative methods in most cases and the differences are often large.
引用
收藏
页码:2771 / 2777
页数:7
相关论文
共 50 条
  • [11] Hierarchical clustering with discrete latent variable models and the integrated classification likelihood
    Etienne Côme
    Nicolas Jouvin
    Pierre Latouche
    Charles Bouveyron
    Advances in Data Analysis and Classification, 2021, 15 : 957 - 986
  • [12] Hierarchical clustering with discrete latent variable models and the integrated classification likelihood
    Come, Etienne
    Jouvin, Nicolas
    Latouche, Pierre
    Bouveyron, Charles
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (04) : 957 - 986
  • [13] Bayesian Pyramids: identifiable multilayer discrete latent structure models for discrete data
    Gu, Yuqi
    Dunson, David B.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (02) : 399 - 426
  • [14] CONTINUOUS AND DISCRETE LATENT STRUCTURE MODELS FOR ITEM RESPONSE DATA
    HAERTEL, EH
    PSYCHOMETRIKA, 1990, 55 (03) : 477 - 494
  • [15] Discrete latent Markov models for normally distributed response data
    Schmittmann, VD
    Dolan, CV
    Van der Maas, HLJ
    Neale, MC
    MULTIVARIATE BEHAVIORAL RESEARCH, 2005, 40 (04) : 461 - 488
  • [16] UNIDIMENSIONAL MEASUREMENT AND STRUCTURAL EQUATION MODELS WITH LATENT-VARIABLES
    DANES, JE
    MANN, OK
    JOURNAL OF BUSINESS RESEARCH, 1984, 12 (03) : 337 - 352
  • [17] Discrete Latent Variable Models
    Bartolucci, Francesco
    Pandolfi, Silvia
    Pennoni, Fulvia
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2022, 9 : 425 - 452
  • [18] Multiview Clustering: A Late Fusion Approach Using Latent Models
    Bruno, Eric
    Marchand-Maillet, Stephane
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 736 - 737
  • [19] On discrete data clustering
    Bouguila, Nizar
    EIGuebaly, Walid
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 503 - 510
  • [20] Predictive Discrete Latent Factor Models for Large Scale Dyadic Data
    Agarwal, Deepak
    Merugu, Srujana
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 26 - 35