Unidimensional Clustering of Discrete Data Using Latent Tree Models

被引:0
|
作者
Liu, April H. [1 ]
Poon, Leonard K. M. [2 ]
Zhang, Nevin L. [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Hong Kong Inst Educ, Dept Math & Informat Technol, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2015年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with model-based clustering of discrete data. Latent class models (LCMs) are usually used for the task. An LCM consists of a latent variable and a number of attributes. It makes the overly restrictive assumption that the attributes are mutually independent given the latent variable. We propose a novel method to relax the assumption. The key idea is to partition the attributes into groups such that correlations among the attributes in each group can be properly modeled by using one single latent variable. The latent variables for the attribute groups are then used to build a number of models and one of them is chosen to produce the clustering results. Extensive empirical studies have been conducted to compare the new method with LCM and several other methods (K-means, kernel K means and spectral clustering) that are not model-based. The new method outperforms the alternative methods in most cases and the differences are often large.
引用
收藏
页码:2771 / 2777
页数:7
相关论文
共 50 条
  • [31] Learning Mixed Latent Tree Models
    Zhou, Can
    Wang, Xiaofei
    Guo, Jianhua
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [32] A Survey on Latent Tree Models and Applications
    Mourad, Raphael
    Sinoquet, Christine
    Zhang, Nevin L.
    Liu, Tengfei
    Leray, Philippe
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 : 157 - 203
  • [33] Learning mixed latent tree models
    Zhou, Can
    Wang, Xiaofei
    Guo, Jianhua
    Journal of Machine Learning Research, 2020, 21
  • [34] Clustering Discrete Choice Data
    Vicari, Donatella
    Alfo, Marco
    COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 369 - 378
  • [35] Approximate Clustering on Data Streams Using Discrete Cosine Transform
    Yu, Feng
    Oyana, Damalie
    Hou, Wen-Chi
    Wainer, Michael
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2010, 6 (01): : 67 - 78
  • [36] Using association patterns for discrete-valed data clustering
    Wong, Andrew K. C.
    Li, Gary C. L.
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2007, : 410 - +
  • [38] Mining features for biomedical data using clustering tree ensembles
    Pliakos, Konstantinos
    Vens, Celine
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 85 : 40 - 48
  • [39] Effects of data reduction when using Gaussian Mixture Models in unidimensional biometric signals
    Tirado-Martin, Paloma
    Sanchez-Reillo, Raul
    Park, Jongwon
    2018 52ND ANNUAL IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2018, : 48 - 52
  • [40] Spatial modeling of brain connectivity data via latent distance models with nodes clustering
    Aliverti, Emanuele
    Durante, Daniele
    STATISTICAL ANALYSIS AND DATA MINING, 2019, 12 (03) : 185 - 196