Automated Attribute Weighting Fuzzy k-Centers Algorithm for Categorical Data Clustering

被引:4
|
作者
Mau, Toan Nguyen [1 ]
Huynh, Van-Nam [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Adv Sci & Technol, Nomi, Ishikawa, Japan
关键词
Fuzzy clustering; Categorical data; k-representatives; k-centers; MODES ALGORITHM;
D O I
10.1007/978-3-030-85529-1_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cluster analysis plays an important role in exploring the correlations in data by dividing datasets into separate clusters so that similar objects are located in the same cluster. Moreover, fuzzy cluster analysis can reveal the mixtures of clusters in datasets containing multiple distributions. Certainly, the outcome of clustering methods is approximately determined by the similarity definition. Thus, the similarity measurement is exceedingly important to the formation of fuzzy clusters. In fact, the similarity between two objects is mostly calculated by the mean of differences across multiple dimensions. However, the dissimilarity in some dimensions has little or no effect on the fuzzy clustering outcome. In this study, we explore such impacts for fuzzy clustering of data with categorical attributes. Accordingly, the impact of each attribute on each fuzzy cluster is calculated using an optimizer, and the overlapping dissimilar values are then adjusted by the corresponding weights. We propose to apply this approach to the Fk-centers clustering algorithm, and the experimental results show that our proposed method can achieve higher fuzzy silhouette scores than other related works. These results demonstrate the applicability of deploying of the proposed method in real-world application.
引用
收藏
页码:205 / 217
页数:13
相关论文
共 50 条
  • [41] Fuzzy clustering with weighting of data variables
    Keller, Annette
    Klawonn, Frank
    International Journal of Uncertainty, Fuzziness and Knowlege-Based Systems, 2000, 8 (06): : 735 - 746
  • [42] CLUSTERING CATEGORICAL DATA BASED ON COMBINATIONS OF ATTRIBUTE VALUES
    Do, Hee-Jung
    Kim, Jae Yearn
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (12A): : 4393 - 4405
  • [43] An Incremental Clustering with Attribute Unbalance Considered for Categorical Data
    Chen, Jize
    Yang, Zhimin
    Yin, Jian
    Yang, Xiaobo
    Huang, Li
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2009, 51 : 433 - +
  • [44] Weighted Numerical and Categorical Attribute Clustering in Data Streams
    Liang, Wen-Bin
    Wang, Chang-Dong
    Lai, Jian-Huang
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3066 - 3072
  • [45] A k-mean clustering algorithm for mixed numeric and categorical data
    Ahmad, Amir
    Dey, Lipika
    DATA & KNOWLEDGE ENGINEERING, 2007, 63 (02) : 503 - 527
  • [46] TW-k-Means: Automated Two-Level Variable Weighting Clustering Algorithm for Multiview Data
    Chen, Xiaojun
    Xu, Xiaofei
    Huang, Joshua Zhexue
    Ye, Yunming
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (04) : 932 - 944
  • [47] Fuzzy clustering of categorical data using fuzzy centroids
    Kim, DW
    Lee, KH
    Lee, D
    PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1263 - 1271
  • [48] HABOS clustering algorithm for categorical data
    Wu, Sen (wusen@manage.ustb.edu.cn), 2016, Science Press (38):
  • [49] Clustering algorithm for Boolean and categorical data
    Liu, H.
    Deng, H.
    Lu, S.
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 2001, 29 (03): : 30 - 32
  • [50] Fuzzy Co-clustering with Automated Variable Weighting
    Laclau, Charlotte
    de Carvalho, Francisco de A. T.
    Nadif, Mohamed
    2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,