A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets

被引:43
|
作者
Ahmad, Amir [1 ]
Dey, Lipika [2 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Rabigh, Saudi Arabia
[2] Tata Consultancy Serv, Innovat Labs, New Delhi, India
关键词
Clustering; Subspace clustering; Mixed data; Categorical data;
D O I
10.1016/j.patrec.2011.02.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Almost all subspace clustering algorithms proposed so far are designed for numeric datasets. In this paper, we present a k-means type clustering algorithm that finds clusters in data subspaces in mixed numeric and categorical datasets. In this method, we compute attributes contribution to different clusters. We propose a new cost function for a k-means type algorithm. One of the advantages of this algorithm is its complexity which is linear with respect to the number of the data points. This algorithm is also useful in describing the cluster formation in terms of attributes contribution to different clusters. The algorithm is tested on various synthetic and real datasets to show its effectiveness. The clustering results are explained by using attributes weights in the clusters. The clustering results are also compared with published results. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1062 / 1069
页数:8
相关论文
共 50 条
  • [21] Unsupervised K-Means Clustering Algorithm
    Sinaga, Kristina P.
    Yang, Miin-Shen
    [J]. IEEE ACCESS, 2020, 8 : 80716 - 80727
  • [22] Granular K-means Clustering Algorithm
    Zhou, Chenglong
    Chen, Yuming
    Zhu, Yidong
    [J]. Computer Engineering and Applications, 2023, 59 (13) : 317 - 324
  • [23] An Improved K-means Clustering Algorithm
    Wang Yintong
    Li Wanlong
    Gao Rujia
    [J]. 2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [24] The MinMax k-Means clustering algorithm
    Tzortzis, Grigorios
    Likas, Aristidis
    [J]. PATTERN RECOGNITION, 2014, 47 (07) : 2505 - 2516
  • [25] Modified K-means clustering algorithm
    Li, Wei
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 618 - 621
  • [26] Modified k-Means Clustering Algorithm
    Patel, Vaishali R.
    Mehta, Rupa G.
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 307 - +
  • [27] Improved K-means clustering algorithm
    Zhang, Zhe
    Zhang, Junxi
    Xue, Huifeng
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
  • [28] The global k-means clustering algorithm
    Likas, A
    Vlassis, N
    Verbeek, JJ
    [J]. PATTERN RECOGNITION, 2003, 36 (02) : 451 - 461
  • [29] A k-means based clustering algorithm
    Bloisi, Domenico Daniele
    Locchi, Luca
    [J]. COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 109 - 118
  • [30] Improved Algorithm for the k-means Clustering
    Zhang, Sheng
    Wang, Shouqiang
    [J]. PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4717 - 4720