Attribute weights-based clustering centres algorithm for initialising K-modes clustering

被引:2
|
作者
Liwen Peng
Yongguo Liu
机构
[1] University of Electronic Science and Technology of China,Knowledge and Data Engineering Laboratory of Chinese Medicine, School of Information and Software Engineering
来源
Cluster Computing | 2019年 / 22卷
关键词
Clustering centers; Weight; Density; Distance;
D O I
暂无
中图分类号
学科分类号
摘要
The K-modes algorithm based on partitional clustering technology is a very popular and effective clustering method; moreover, it handles categorical data. However, the performance of the K-modes method is largely affected by the initial clustering centres. Random selection of the initial clustering centres commonly leads to non-repeatable clustering result. Hence, suitable choice of the initial clustering centres is crucial to realizing high-performance K-modes clustering. The present article develops an initialisation algorithm for K-modes. At initialisation, the distance between two instances calculated after weighting the attributes of the instances. Many studies have shown that if clustering is based only on distances or density between the instances, the clustering revolves around one centre or the outliers. Therefore, based on the attribute weights, we combine the distance and density measures to select the clustering centres. In experiments on several UCI machine learning repository benchmark datasets, the new initialisation method outperformed the existing K-modes clustering methods.
引用
下载
收藏
页码:6171 / 6179
页数:8
相关论文
共 50 条
  • [21] Cluster center initialization algorithm for K-modes clustering
    Khan, Shehroz S.
    Ahmad, Amir
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (18) : 7444 - 7456
  • [22] A weighting k-modes algorithm for subspace clustering of categorical data
    Cao, Fuyuan
    Liang, Jiye
    Li, Deyu
    Zhao, Xingwang
    NEUROCOMPUTING, 2013, 108 : 23 - 30
  • [23] Approximation algorithms for K-modes clustering
    He, Zengyou
    Deng, Shengchun
    Xu, Xiaofei
    COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 296 - 302
  • [24] A load clustering algorithm based on discrete wavelet transform and fuzzy K-modes
    Zhang J.
    Zhang Y.
    Hong J.
    Gao H.
    Liu J.
    Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2019, 39 (02): : 100 - 106and122
  • [25] DP-k-modes: A self-tuning k-modes clustering algorithm
    Xie, Juanying
    Wang, Mingzhao
    Lu, Xiaoxiao
    Liu, Xinglin
    Grant, Philip W.
    Pattern Recognition Letters, 2022, 158 : 117 - 124
  • [26] A genetic fuzzy k-Modes algorithm for clustering categorical data
    Gan, G.
    Wu, J.
    Yang, Z.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1615 - 1620
  • [27] Computation of Initial Modes for K-modes Clustering Algorithm using Evidence Accumulation
    Khan, Shehroz S.
    Kant, Shri
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2784 - 2789
  • [28] A Global-Relationship Dissimilarity Measure for the k-Modes Clustering Algorithm
    Zhou, Hongfang
    Zhang, Yihui
    Liu, Yibin
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017
  • [29] Initialization of K-Modes Clustering for Categorical Data
    Li Tao-ying
    Chen Yan
    Jin Zhi-hong
    Li Ye
    2013 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING (ICMSE), 2013, : 107 - 112
  • [30] A Moving Shape-based Robust Fuzzy K-modes Clustering Algorithm for Electricity Profiles
    Liu, Chang
    Wang, Xiaodi
    Huang, Yuan
    Liu, Youbo
    Li, Ran
    Li, Yang
    Liu, Junyong
    ELECTRIC POWER SYSTEMS RESEARCH, 2020, 187