共 50 条
Clustering of Mixed Data by Integrating Fuzzy, Probabilistic, and Collaborative Clustering Framework
被引:0
|作者:
Arkanath Pathak
Nikhil R. Pal
机构:
[1] Indian Institute of Technology Kharagpur,Electronics and Communication Sciences Unit
[2] Indian Statistical Institute,undefined
来源:
关键词:
Fuzzy clustering;
Mixed data;
Mixture models;
Collaborative clustering;
D O I:
暂无
中图分类号:
学科分类号:
摘要:
Clustering of numerical data is a very well researched problem and so is clustering of categorical data. However, when it comes to clustering of data with mixed attributes, the literature is not that rich. For numerical data, fuzzy clustering, in particular, the fuzzy c-means (FCM), is a very effective and popular algorithm, while for categorical data, use of mixture model is quite popular. In this paper, we propose a novel framework for clustering of mixed data which contains both numerical and categorical attributes. Our objective is to find the cluster substructures that are common to both the categorical and numerical data. Our formulation is inspired by the FCM algorithm (for dealing with numerical data), mixture models (for dealing with categorical data), and the collaborative clustering framework for aggregation of the two—it is an integrated approach that judiciously uses all three components. We use our algorithm on a few commonly used datasets and compare our results with those by some state of the art methods.
引用
收藏
页码:339 / 348
页数:9
相关论文