K-Harmonic means type clustering algorithm for mixed datasets

被引:38
|
作者
Ahmad, Amir [1 ,2 ]
Hashmi, Sarosh [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Rabigh, Saudi Arabia
[2] United Arab Emirates Univ, Coll Informat Technol, Al Ain, U Arab Emirates
关键词
Clustering; Categorical attributes; Numeric attributes; Mixed data; K-Harmonic means clustering; CENTER INITIALIZATION ALGORITHM; CATEGORICAL-DATA;
D O I
10.1016/j.asoc.2016.06.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-means type clustering algorithms for mixed data that consists of numeric and categorical attributes suffer from cluster center initialization problem. The final clustering results depend upon the initial cluster centers. Random cluster center initialization is a popular initialization technique. However, clustering results are not consistent with different cluster center initializations. K-Harmonic means clustering algorithm tries to overcome this problem for pure numeric data. In this paper, we extend the K-Harmonic means clustering algorithm for mixed datasets. We propose a definition for a cluster center and a distance measure. These cluster centers and the distance measure are used with the cost function of K-Harmonic means clustering algorithm in the proposed algorithm. Experiments were carried out with pure categorical datasets and mixed datasets. Results suggest that the proposed clustering algorithm is quite insensitive to the cluster center initialization problem. Comparative studies with other clustering algorithms show that the proposed algorithm produce better clustering results. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:39 / 49
页数:11
相关论文
共 50 条
  • [41] A novel hybrid clustering approach based on K-harmonic means using robust design
    Yeh, Wei-Chang
    Lai, Chyh-Ming
    Chang, Kuei-Hu
    [J]. NEUROCOMPUTING, 2016, 173 : 1720 - 1732
  • [42] Spectral co-clustering documents and words using fuzzy K-harmonic means
    Liu, Na
    Chen, Fei
    Lu, Mingyu
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2013, 4 (01) : 75 - 83
  • [43] Possibilistic Fuzzy K-Harmonic Means Clustering of Fourier Transform Infrared Spectra of Tea
    Wu Bin
    Wang Da-zhi
    Wu Xiao-hong
    Jia Hong-wen
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2018, 38 (03) : 745 - 749
  • [44] Spectral co-clustering documents and words using fuzzy K-harmonic means
    Na Liu
    Fei Chen
    Mingyu Lu
    [J]. International Journal of Machine Learning and Cybernetics, 2013, 4 : 75 - 83
  • [45] An efficient hybrid data clustering method based on K-harmonic means and Particle Swarm Optimization
    Yang, Fengqin
    Sun, Tieli
    Zhang, Changhai
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 9847 - 9852
  • [46] Generalized K-Harmonic means - Boosting in unsupervised learning
    Zhang, Bin
    [J]. HP Laboratories Technical Report, 2000, (137):
  • [47] 模糊K-Harmonic Means聚类算法
    赵恒
    杨万海
    张高煜
    [J]. 西安电子科技大学学报, 2005, (04) : 603 - 606
  • [48] Speaker Identification with Vector Quantization and K-Harmonic Means
    Yazici, Mustafa
    Ulutas, Mustafa
    [J]. 2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 2134 - 2137
  • [49] Efficient collaborative filtering using particle swarm optimization and K-harmonic means algorithm
    Xu, Chonghuan
    Ju, Chunhua
    Qiang, Xiaodan
    [J]. Journal of Computational and Theoretical Nanoscience, 2015, 12 (12) : 6334 - 6342
  • [50] Improved K-Harmonic Means in Wireless Sensor Networks
    Goel, Rashi
    Gupta, Prabhav
    Yadav, Rajesh Kumar
    [J]. PROCEEDINGS OF THE 2017 IEEE 15TH STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2017, : 275 - 279