A MD fuzzy k-modes Algorithm for Clustering Categorical Matrix-Object Data; [基于分类型矩阵对象数据的MD fuzzy k-modes聚类算法]

被引:0
|
作者
Li S. [1 ]
Zhang M. [1 ]
Cao F. [2 ]
机构
[1] School of Mathematical Sciences, Shanxi University, Taiyuan
[2] School of Computer and Information Technology, Shanxi University, Taiyuan
基金
中国国家自然科学基金;
关键词
Cluster centers; Clustering; Dissimilarity measure; Matrix-object data; MD fuzzy k-modes algorithm;
D O I
10.7544/issn1000-1239.2019.20180737
中图分类号
学科分类号
摘要
Traditional algorithms generally cluster single-valued attributed data. However, in practice, each attribute of the data object is described by more than one feature vector. For example, customers may purchase multiple products at the same time as they shop. An object described by multiple feature vectors is called a matrix object and such data are called matrix-object data. At present, the research work on clustering algorithms for categorical matrix- object data is relatively rare, and there are still many issues to be settled. In this paper, we propose a new matrix-object data fuzzy k-modes (MD fuzzy k-modes) algorithm that uses the fuzzy k-modes clustering process to cluster categorical matrix-object data. In the proposed algorithm, we introduce the fuzzy factor β with the concept of fuzzy set. The dissimilarity measure between two categorical matrix-objects is redefined, and the heuristic updating algorithm of the cluster centers is provided. Finally, the effectiveness of the MD fuzzy k-modes algorithm is verified on the five real-world data sets, and the relationship between fuzzy factor β and membership w is analyzed. Therefore, in the era of big data, clustering multiple records by using the MD fuzzy k-modes algorithm can make it easier to find customers' spending habits and preferences, so as to make more targeted recommendation. © 2019, Science Press. All right reserved.
引用
收藏
页码:1325 / 1337
页数:12
相关论文
共 50 条
  • [1] 基于分类型矩阵对象数据的MD fuzzy k-modes聚类算法
    李顺勇
    张苗苗
    曹付元
    [J]. 计算机研究与发展, 2019, 56 (06) : 1325 - 1337
  • [2] A fuzzy k-modes algorithm for clustering categorical data
    Huang, ZX
    Ng, MK
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1999, 7 (04) : 446 - 452
  • [3] A genetic fuzzy k-Modes algorithm for clustering categorical data
    Gan, G.
    Wu, J.
    Yang, Z.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1615 - 1620
  • [4] Clustering of Categorical Data Using Intuitionistic Fuzzy k-modes
    Mehta, Darshan
    Tripathy, B. K.
    [J]. PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2016), VOL 1, 2017, 546 : 254 - 263
  • [5] Block Fuzzy K-modes Clustering Algorithm
    Yang, Miin-Shen
    Lin, Chih-Ying
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 384 - 389
  • [6] MD-SPKM: A set pair k-modes clustering algorithm for incomplete categorical matrix data
    Zhang, Chunying
    Gao, Ruiyan
    Wang, Jiahao
    Chen, Song
    Liu, Fengchun
    Ren, Jing
    Feng, Xiaoze
    [J]. INTELLIGENT DATA ANALYSIS, 2021, 25 (06) : 1507 - 1524
  • [7] A Global K-modes Algorithm for Clustering Categorical Data
    Bai Tian
    Kulikowski, C. A.
    Gong Leiguang
    Yang Bin
    Huang Lan
    Zhou Chunguang
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2012, 21 (03) : 460 - 465
  • [8] A genetic k-modes algorithm for clustering categorical data
    Gan, GJ
    Yang, ZJ
    Wu, JH
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 195 - 202
  • [9] Genetic intuitionistic weighted fuzzy k-modes algorithm for categorical data
    Kuo, R. J.
    Thi Phuong Quyen Nguyen
    [J]. NEUROCOMPUTING, 2019, 330 : 116 - 126
  • [10] Rough Set Based Fuzzy K-Modes for Categorical Data
    Saha, Indrajit
    Sarkar, Jnanendra Prasad
    Maulik, Ujjwal
    [J]. SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, (SEMCCO 2012), 2012, 7677 : 323 - 330