Density-sensitive fuzzy kernel maximum entropy clustering algorithm

被引:26
|
作者
Tao, Xinmin [1 ]
Wang, Ruotong [1 ]
Chang, Rui [1 ]
Li, Chenxi [1 ]
机构
[1] Northeast Forestry Univ, Coll Engn Technol, Harbin 150040, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Relative density-based weight; Maximum entropy clustering algorithm; Robustness; C-MEANS; IMAGE SEGMENTATION; SELECTION;
D O I
10.1016/j.knosys.2018.12.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Maximum entropy clustering algorithm (ME) has lately received great attention for its high performance in large-scale data clustering and simplicity in implementation. However, previous studies have demonstrated that different clusters obtained by traditional ME tend to converge to the same one during its process of iteration affected by regularization coefficient and these cluster centers are subject to bias due to its sensitivity to different distributions of objects. These drawbacks of traditional ME can result in its failure of revealing the natural groupings in most datasets, especially in non-Gaussian distributed datasets. In order to address those limitations, we present a novel density-sensitive fuzzy kernel maximum entropy clustering algorithm in this paper. In the proposed approach, to accommodate non-Gaussian distributed cases, the dataset to be clustered in the original space is firstly implicitly mapped into high-dimensional feature space through the kernel function. By introducing the kernel function-based similarity terms in the update formula of the cluster centers, the effect of the objects not belonging to the current cluster on the update of its corresponding center can be counteracted, and simultaneously the influence of regularization coefficient on the clustering result is restricted as well, which can effectively overcome the convergence of the different clusters encountered by traditional ME. In addition, in order to prevent cluster centers from biases caused by the different distribution of the objects in the feature space, the relative density-based weights are also incorporated into the cost function, which can help the proposed approach produce more reasonable and accurate clustering results. In the experiments, the influence of the different parameters on the clustering performance is discussed in detail and some suggestions are also provided. Theoretical analysis and experimental results on several synthetic datasets, UCI benchmark datasets and generated large MNIST handwritten digits datasets demonstrate that the proposed approach is superior to other existing clustering techniques with good robustness. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:42 / 57
页数:16
相关论文
共 50 条
  • [21] DENSITY-SENSITIVE SEMISUPERVISED INFERENCE
    Azizyan, Martin
    Singh, Aarti
    Wasserman, Larry
    ANNALS OF STATISTICS, 2013, 41 (02): : 751 - 771
  • [22] Clustering based on kernel density estimation: nearest local maximum searching algorithm
    Wang, WJ
    Tan, YX
    Jiang, JH
    Lu, JZ
    Shen, GL
    Yu, RQ
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2004, 72 (01) : 1 - 8
  • [23] Maximum weighted entropy clustering algorithm
    Lao, Li
    Wu, Xiaoming
    Cheng, Lingpeng
    Zhu, Xuefeng
    PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, 2006, : 1022 - 1025
  • [24] An Algorithm of Maximum Entropy Fuzzy Clustering Based on Improved Particle Swarm Optimization
    Su, Rijian
    Kong, Li
    Cheng, Jingjing
    Su, Rijian
    Song, Shengli
    PROCEEDINGS OF THE 2011 INTERNATIONAL CONFERENCE ON INFORMATICS, CYBERNETICS, AND COMPUTER ENGINEERING (ICCE2011), VOL 2: INFORMATION SYSTEMS AND COMPUTER ENGINEERING, 2011, 111 : 323 - +
  • [25] A Rough Fuzzy Kernel Clustering Algorithm
    Ouyang Hao
    Wang Ri Feng
    Wang Zhi Wen
    Huang Zhen Jin
    2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION PROBLEM-SOLVING (ICCP), 2015, : 501 - 505
  • [26] An Algorithm of Maximum Entropy Fuzzy Clustering Based on Improved Particle Swarm Optimization
    Su, Rijian
    Kong, Li
    Cheng, Jingjing
    Song, Shengli
    2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL II, 2010, : 157 - 160
  • [27] A modified K-means clustering with a density-sensitive distance metric
    Wang, Ling
    Bo, Liefeng
    Jiao, Licheng
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2006, 4062 : 544 - 551
  • [28] Improvement fuzzy kernel clustering algorithm
    Zhang, Sen
    Zhu, Mei-Ling
    Hou, Guang-Kui
    Beijing Gongye Daxue Xuebao/Journal of Beijing University of Technology, 2012, 38 (09): : 1408 - 1411
  • [29] A Robust Fuzzy Kernel Clustering Algorithm
    Zhang Chen
    Xia Shixiong
    Liu Bing
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (03): : 1005 - 1012
  • [30] DENSITY-SENSITIVE INSTABILITIES IN MAGNETOSPHERE
    CORNWALL, JM
    JOURNAL OF ATMOSPHERIC AND TERRESTRIAL PHYSICS, 1976, 38 (11): : 1111 - 1114