A subspace hierarchical clustering algorithm for categorical data

被引:1
|
作者
Carbonera, Joel Luis [1 ]
Abel, Mara [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Porto Alegre, RS, Brazil
关键词
K-MEANS;
D O I
10.1109/ICTAI.2019.00077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a soft subspace hierarchical clustering for dealing with categorical data. The proposed algorithm extends the traditional agglomerative hierarchical clustering approach for identifying clusters of categorical data in subspaces. The algorithm adopts a correlation-based approach for measuring the relevance of each categorical attribute during the clustering process. We performed experiments on six well-known datasets, comparing the performance of our algorithms with the original agglomerative algorithm for hierarchical clustering and other five partitional subspace clustering algorithms, using two well-known evaluation metrics: accuracy and f-measure. According to the experiments, the proposed algorithm outperforms the original one. Besides that, the proposed algorithm outperforms most of the partitional algorithms, while provides additional advantages.
引用
收藏
页码:509 / 516
页数:8
相关论文
共 50 条
  • [1] Parallel Hierarchical Subspace Clustering of Categorical Data
    Pang, Ning
    Zhang, Jifu
    Zhang, Chaowei
    Qin, Xiao
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (04) : 542 - 555
  • [2] Kernel Subspace Clustering Algorithm for Categorical Data
    Xu, Kun-Peng
    Chen, Li-Fei
    Sun, Hao-Jun
    Wang, Bei-Zhan
    [J]. Ruan Jian Xue Bao/Journal of Software, 2020, 31 (11): : 3492 - 3505
  • [3] A hierarchical clustering algorithm for categorical sequence data
    Oh, SJ
    Kim, JY
    [J]. INFORMATION PROCESSING LETTERS, 2004, 91 (03) : 135 - 140
  • [4] An entropy-based subspace clustering algorithm for categorical data
    Carbonera, Joel Luis
    Abel, Mara
    [J]. 2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 272 - 277
  • [5] PARTCAT: A subspace clustering algorithm for high dimensional categorical data
    Gan, Guojun
    Wu, Jianhong
    Yang, Zijiang
    [J]. 2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 4406 - +
  • [6] A weighting k-modes algorithm for subspace clustering of categorical data
    Cao, Fuyuan
    Liang, Jiye
    Li, Deyu
    Zhao, Xingwang
    [J]. NEUROCOMPUTING, 2013, 108 : 23 - 30
  • [7] A Subspace Clustering Algorithm of Categorical Data Using Multiple Attribute Weights
    [J]. Zhang, Ji-Fu (jifuzh@sina.com), 2018, Science Press (44):
  • [8] A hierarchical clustering algorithm for categorical attributes
    Agarwal, Parul
    Alam, M. Afshar
    Biswas, Ranjit
    [J]. 2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 2, 2010, : 365 - 368
  • [9] Subspace Clustering with Feature Grouping for Categorical Data
    Jia, Hong
    Dong, Menghan
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, KSEM 2023, 2023, 14117 : 247 - 254
  • [10] Ordering of categorical data in hierarchical clustering
    Kazimianec, Michail
    [J]. DATABASES AND INFORMATION SYSTEMS, 2008, : 401 - 404