Automated Attribute Weighting Fuzzy k-Centers Algorithm for Categorical Data Clustering

被引:4
|
作者
Mau, Toan Nguyen [1 ]
Huynh, Van-Nam [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Adv Sci & Technol, Nomi, Ishikawa, Japan
关键词
Fuzzy clustering; Categorical data; k-representatives; k-centers; MODES ALGORITHM;
D O I
10.1007/978-3-030-85529-1_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cluster analysis plays an important role in exploring the correlations in data by dividing datasets into separate clusters so that similar objects are located in the same cluster. Moreover, fuzzy cluster analysis can reveal the mixtures of clusters in datasets containing multiple distributions. Certainly, the outcome of clustering methods is approximately determined by the similarity definition. Thus, the similarity measurement is exceedingly important to the formation of fuzzy clusters. In fact, the similarity between two objects is mostly calculated by the mean of differences across multiple dimensions. However, the dissimilarity in some dimensions has little or no effect on the fuzzy clustering outcome. In this study, we explore such impacts for fuzzy clustering of data with categorical attributes. Accordingly, the impact of each attribute on each fuzzy cluster is calculated using an optimizer, and the overlapping dissimilar values are then adjusted by the corresponding weights. We propose to apply this approach to the Fk-centers clustering algorithm, and the experimental results show that our proposed method can achieve higher fuzzy silhouette scores than other related works. These results demonstrate the applicability of deploying of the proposed method in real-world application.
引用
收藏
页码:205 / 217
页数:13
相关论文
共 50 条
  • [1] K-centers algorithm for clustering mixed type data
    Zhao, Wei-Dong
    Dai, Wei-Hui
    Tang, Chun-Bin
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 1140 - +
  • [2] A novel attribute weighting algorithm for clustering high-dimensional categorical data
    Bai, Liang
    Liang, Jiye
    Dang, Chuangyin
    Cao, Fuyuan
    PATTERN RECOGNITION, 2011, 44 (12) : 2843 - 2861
  • [3] A weighting k-modes algorithm for subspace clustering of categorical data
    Cao, Fuyuan
    Liang, Jiye
    Li, Deyu
    Zhao, Xingwang
    NEUROCOMPUTING, 2013, 108 : 23 - 30
  • [4] A fuzzy k-modes algorithm for clustering categorical data
    Huang, ZX
    Ng, MK
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1999, 7 (04) : 446 - 452
  • [5] A genetic fuzzy k-Modes algorithm for clustering categorical data
    Gan, G.
    Wu, J.
    Yang, Z.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1615 - 1620
  • [6] Optimal fully dynamic k-centers clustering
    Bateni, MohammadHossein
    Esfandiari, Hossein
    Mirrokni, Vahab
    Jayaram, Rajesh
    arXiv, 2021,
  • [7] Categorical data clustering: A correlation-based approach for unsupervised attribute weighting
    Carbonera, Joel Luis
    Abel, Mara
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 259 - 263
  • [8] A fuzzy k-prototype clustering algorithm for mixed numeric and categorical data
    Ji, Jinchao
    Pang, Wei
    Zhou, Chunguang
    Han, Xiao
    Wang, Zhe
    KNOWLEDGE-BASED SYSTEMS, 2012, 30 : 129 - 135
  • [9] Multiobjective clustering algorithm with fuzzy centroids for categorical data
    Zhou Z.
    Zhu S.
    Zhang D.
    1600, Science Press (53): : 2594 - 2606
  • [10] Fuzzy Clustering Ensemble Algorithm for Partitioning Categorical Data
    Li, Taoying
    Chen, Yan
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 170 - 174