Fuzzy C-means plus plus : Fuzzy C-means with effective seeding initialization

被引:76
|
作者
Stetco, Adrian [1 ]
Zeng, Xiao-Jun [1 ]
Keane, John [1 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
基金
英国工程与自然科学研究理事会;
关键词
Cluster analysis; Fuzzy C-means clustering; Initialization;
D O I
10.1016/j.eswa.2015.05.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fuzzy C-means has been utilized successfully in a wide range of applications, extending the clustering capability of the K-means to datasets that are uncertain, vague and otherwise hard to cluster. This paper introduces the Fuzzy C-means++ algorithm which, by utilizing the seeding mechanism of the K-means++ algorithm, improves the effectiveness and speed of Fuzzy C-means. By careful seeding that disperses the initial cluster centers through the data space, the resulting Fuzzy C-means++ approach samples starting cluster representatives during the initialization phase. The cluster representatives are well spread in the input space, resulting in both faster convergence times and higher quality solutions. Implementations in R of standard Fuzzy C-means and Fuzzy C-means++ are evaluated on various data sets. We investigate the cluster quality and iteration count as we vary the spreading factor on a series of synthetic data sets. We run the algorithm on real world data sets and to account for the non-determinism inherent in these algorithms we record multiple runs while choosing different k parameter values. The results show that the proposed method gives significant improvement in convergence times (the number of iterations) of up to 40 (2.1 on average) times the standard on synthetic datasets and, in general, an associated lower cost function value and Xie-Beni value. A proof sketch of the logarithmically bounded expected cost function value is given. (C) 2015 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http:// creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:7541 / 7548
页数:8
相关论文
共 50 条
  • [1] An new initialization method for fuzzy c-means algorithm
    Kaiqi Zou
    Zhiping Wang
    Ming Hu
    Fuzzy Optimization and Decision Making, 2008, 7 : 409 - 416
  • [2] An new initialization method for fuzzy c-means algorithm
    Zou, Kaiqi
    Wang, Zhiping
    Hu, Ming
    FUZZY OPTIMIZATION AND DECISION MAKING, 2008, 7 (04) : 409 - 416
  • [3] A Novel Initialization Algorithm for Fuzzy C-means Problem
    Liu, Qian
    Liu, Jianxin
    Li, Min
    Zhou, Yang
    THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, TAMC 2020, 2020, 12337 : 215 - 225
  • [4] Application of Hard C-means and Fuzzy C-means in data fusion
    Tang Ai-Hong
    Cai Li-An
    Zhang You-Mei
    DIGITAL MANUFACTURING & AUTOMATION III, PTS 1 AND 2, 2012, 190-191 : 265 - 268
  • [5] Conditional fuzzy C-Means
    Pedrycz, W
    PATTERN RECOGNITION LETTERS, 1996, 17 (06) : 625 - 631
  • [6] Fuzzy c-Means Herding
    Runkler, Thomas A.
    PROCEEDINGS OF THE JOINT 2009 INTERNATIONAL FUZZY SYSTEMS ASSOCIATION WORLD CONGRESS AND 2009 EUROPEAN SOCIETY OF FUZZY LOGIC AND TECHNOLOGY CONFERENCE, 2009, : 149 - 154
  • [7] Vector fuzzy C-means
    Hadi, Mahdipour
    Morteza, Khademi
    Hadi, Sadoghi Yazdi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2013, 24 (02) : 363 - 381
  • [8] Rough C-means and Fuzzy Rough C-means for Colour Quantisation
    Schaefer, Gerald
    Hu, Qinghua
    Zhou, Huiyu
    Peters, James F.
    Hassanien, Aboul Ella
    FUNDAMENTA INFORMATICAE, 2012, 119 (01) : 113 - 120
  • [9] The MinMax Fuzzy C-Means
    Mashayekhi, Yoosof
    Nazerfard, Ehsan
    Rahbar, Arman
    Mahmood, Samira Shirzadeh Haji
    2019 IEEE 5TH CONFERENCE ON KNOWLEDGE BASED ENGINEERING AND INNOVATION (KBEI 2019), 2019, : 210 - 215
  • [10] A Comparative Study of K-Means, K-Means plus plus and Fuzzy C-Means Clustering Algorithms
    Kapoor, Akanksha
    Singhal, Abhishek
    2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,