Automated Attribute Weighting Fuzzy k-Centers Algorithm for Categorical Data Clustering

被引:4
|
作者
Mau, Toan Nguyen [1 ]
Huynh, Van-Nam [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Adv Sci & Technol, Nomi, Ishikawa, Japan
关键词
Fuzzy clustering; Categorical data; k-representatives; k-centers; MODES ALGORITHM;
D O I
10.1007/978-3-030-85529-1_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cluster analysis plays an important role in exploring the correlations in data by dividing datasets into separate clusters so that similar objects are located in the same cluster. Moreover, fuzzy cluster analysis can reveal the mixtures of clusters in datasets containing multiple distributions. Certainly, the outcome of clustering methods is approximately determined by the similarity definition. Thus, the similarity measurement is exceedingly important to the formation of fuzzy clusters. In fact, the similarity between two objects is mostly calculated by the mean of differences across multiple dimensions. However, the dissimilarity in some dimensions has little or no effect on the fuzzy clustering outcome. In this study, we explore such impacts for fuzzy clustering of data with categorical attributes. Accordingly, the impact of each attribute on each fuzzy cluster is calculated using an optimizer, and the overlapping dissimilar values are then adjusted by the corresponding weights. We propose to apply this approach to the Fk-centers clustering algorithm, and the experimental results show that our proposed method can achieve higher fuzzy silhouette scores than other related works. These results demonstrate the applicability of deploying of the proposed method in real-world application.
引用
收藏
页码:205 / 217
页数:13
相关论文
共 50 条
  • [31] Fuzzy Rough Attribute Reduction for Categorical Data
    Wang, Changzhong
    Wang, Yan
    Shao, Mingwen
    Qian, Yuhua
    Chen, Degang
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) : 818 - 830
  • [32] Clustering of Categorical Data Using Intuitionistic Fuzzy k-modes
    Mehta, Darshan
    Tripathy, B. K.
    PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2016), VOL 1, 2017, 546 : 254 - 263
  • [33] Many-objective fuzzy centroids clustering algorithm for categorical data
    Zhu, Shuwei
    Xu, Lihong
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 : 230 - 248
  • [34] Attribute value weighting in k-modes clustering
    He, Zengyou
    Xu, Xiaofei
    Deng, Shengchun
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) : 15365 - 15369
  • [35] Formulations of fuzzy clustering for categorical data
    Umayahara, Kazutaka
    Miyamoto, Sadaaki
    Nakamori, Yoshiteru
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2005, 1 (01): : 83 - 94
  • [36] Fuzzy rough clustering for categorical data
    Xu, Shuliang
    Liu, Shenglan
    Zhou, Jian
    Feng, Lin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (11) : 3213 - 3223
  • [37] Fuzzy rough clustering for categorical data
    Shuliang Xu
    Shenglan Liu
    Jian Zhou
    Lin Feng
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 3213 - 3223
  • [38] Fuzzy clustering for categorical multivariate data
    Oh, CH
    Honda, K
    Ichihashi, H
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 2154 - 2159
  • [39] An New Algorithm-based Rough Set for Selecting Clustering Attribute in Categorical Data
    Baroud, Muftah Mohamed Jomah
    Hashim, Siti Zaiton Mohd
    Zainal, Anazida
    Ahnad, Jamilah
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 1358 - 1364
  • [40] Fuzzy clustering with weighting of data variables
    Keller, A
    Klawonn, F
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2000, 8 (06) : 735 - 746