A Modified Initialization Method to Find an Initial Center for Fuzzy K-Modes Clustering

被引:0
|
作者
Saranya, S. [1 ]
Jayanthi, P. [1 ]
机构
[1] Kongu Engn Coll, Dept CSE, Perundurai, Tamil Nadu, India
关键词
Fuzzy K-Modes clustering; Outlier Detection Technique; Initial cluster center; Initial_Distance; Initial_Entropy; MEANS ALGORITHM; SETS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining is used to analyzing the data and extracts the information from large data sets. Clustering is a technique which is used to grouping the objects which is having similar characters. In existing system, K-Means clustering is used to cluster the numerical data but it does not cluster the categorical data. To overcome the limitation of using numeric data only in the clustering process, F Jiang proposed the initialization method for K-modes clustering in which the initial cluster centres were calculated using Ini_Distance and Ini_Entropy methods were used. To improve the performance further, a new modified model is proposed for Fuzzy K-modes clustering. In this model, a modified Initial_Distance and Initial_Entropy Initialization method are used. It ensures that K initial cluster centers are not selected from the same cluster and selected initial cluster centers are not outliers. Also, one of a data pre-processing methods-Data cleaning is used to increase the quality of clustering results and a new modified weight matching distance (WMD) is calculated by using distance between two objects at the time initialization. An experiment result is demonstrated with five data sets taken from UCI machine learning repository and the results proved the effectiveness of the initialization method.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Cluster center initialization algorithm for K-modes clustering
    Khan, Shehroz S.
    Ahmad, Amir
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (18) : 7444 - 7456
  • [2] Initialization of K-Modes Clustering for Categorical Data
    Li Tao-ying
    Chen Yan
    Jin Zhi-hong
    Li Ye
    [J]. 2013 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING (ICMSE), 2013, : 107 - 112
  • [3] Initialization of K-modes clustering using outlier detection techniques
    Jiang, Feng
    Liu, Guozhu
    Du, Junwei
    Sui, Yuefei
    [J]. INFORMATION SCIENCES, 2016, 332 : 167 - 183
  • [4] Block Fuzzy K-modes Clustering Algorithm
    Yang, Miin-Shen
    Lin, Chih-Ying
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 384 - 389
  • [5] CLEKMODES: a modified k-modes clustering algorithm
    Mastrogiannis, N.
    Giannikos, I.
    Boutsinas, B.
    Antzoulatos, G.
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2009, 60 (08) : 1085 - 1095
  • [6] K-modes clustering
    Chaturvedi, A
    Green, PE
    Carroll, JD
    [J]. JOURNAL OF CLASSIFICATION, 2001, 18 (01) : 35 - 55
  • [7] K-modes Clustering
    Anil Chaturvedi
    Paul E. Green
    J. Douglas Caroll
    [J]. Journal of Classification, 2001, 18 : 35 - 55
  • [8] A fuzzy k-modes algorithm for clustering categorical data
    Huang, ZX
    Ng, MK
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1999, 7 (04) : 446 - 452
  • [9] Feature-Weighted Fuzzy K-Modes Clustering
    Nataliani, Yessica
    Yang, Miin-Shen
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, METAHEURISTICS & SWARM INTELLIGENCE (ISMSI 2019), 2019, : 63 - 68
  • [10] Clustering of Categorical Data Using Intuitionistic Fuzzy k-modes
    Mehta, Darshan
    Tripathy, B. K.
    [J]. PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2016), VOL 1, 2017, 546 : 254 - 263