A Pillar Algorithm for K-Means Optimization by Distance Maximization for Initial Centroid Designation

被引:17
|
作者
Barakbah, Ali Ridho [1 ]
Kiyoki, Yasushi [2 ,3 ]
机构
[1] Elect Engn Polytech Inst Technol, Informat Technol Dept, Soft Comp Lab, Surabaya, Indonesia
[2] Keio Univ, Multi Database & Multimedia Database Lab, Tokyo, Japan
[3] Keio Univ, Fac Environm Informat, Tokyo, Japan
关键词
D O I
10.1109/CIDM.2009.4938630
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering performance of the K-mcans greatly relies upon the correctness of the initial centroids. Usually the initial centroids for the K-means clustering are determined randomly so that the determined centroids may reach the nearest local minima, not the global optimum. This paper proposes a new approach to optimizing the designation of initial centroids for K-means clustering. This approach is inspired by the thought process of determining a set of pillars' locations in order to make a stable house or building. We consider the pillars' placement which should be located as far as possible from each other to withstand against the pressure distribution of a roof, as identical to the number of centroids amongst the data distribution. Therefore, our proposed approach in this paper designates positions of initial centroids by using the farthest accumulated distance between them. First, the accumulated distance metric between all data points and their grand mean is created. The first initial centroid which has maximum accumulated distance metric is selected from the data points. The next initial centroids are designated by modifying the accumulated distance metric between each data point and all previous initial centroids, and then, a data point which has the maximum distance is selected as a new initial centroid. This iterative process is needed so that all the initial centroids are designated. This approach also has a mechanism to avoid outlier data being chosen as the initial centroids. The experimental results show effectiveness of the proposed algorithm for improving the clustering results of K-means clustering.
引用
收藏
页码:61 / 68
页数:8
相关论文
共 50 条
  • [1] Centronit: Initial Centroid Designation Algorithm for K-Means Clustering
    Barakbah, Ali Ridho
    Arai, Kohei
    [J]. EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY, 2014, 2 (01) : 50 - 62
  • [2] Initial Centroid Selection Method for an Enhanced K-means Clustering Algorithm
    Aamer, Youssef
    Benkaouz, Yahya
    Ouzzif, Mohammed
    Bouragba, Khalid
    [J]. UBIQUITOUS NETWORKING, UNET 2019, 2020, 12293 : 182 - 190
  • [3] Tackling Initial Centroid of K-Means with Distance Part (DP-KMeans)
    Ilham, Ahmad
    Ibrahim, Danny
    Assaffat, Luqman
    Solichan, Achmad
    [J]. 2018 INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT INFORMATICS (SAIN), 2018, : 185 - 189
  • [4] An Initial Centroid Selection Method based on Radial and Angular Coordinates for K-means Algorithm
    Rahim, Md Shamsur
    Ahmed, Tanvir
    [J]. 2017 20TH INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2017,
  • [5] Initial Centroid Selection Optimization for K-Means with Genetic Algorithm to Enhance Clustering of Transcribed Arabic Broadcast News Documents
    Maghawry, Ahmed Mohamed
    Omar, Yasser
    Badr, Amr
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND MATHEMATICAL METHODS: COMPUTATIONAL METHODS IN SYSTEMS AND SOFTWARE 2017, VOL. 2, 2018, 662 : 86 - 101
  • [6] Bayesian K-Means as a "Maximization-Expectation" Algorithm
    Welling, Max
    Kurihara, Kenichi
    [J]. PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 474 - +
  • [7] Bayesian k-Means as a "Maximization-Expectation" Algorithm
    Kurihara, Kenichi
    Welling, Max
    [J]. NEURAL COMPUTATION, 2009, 21 (04) : 1145 - 1172
  • [8] Interpretation and optimization of the k-means algorithm
    Kristian Sabo
    Rudolf Scitovski
    [J]. Applications of Mathematics, 2014, 59 : 391 - 406
  • [9] K-means algorithm with a novel distance measure
    Abudalfa, Shadi I.
    Mikki, Mohammad
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2013, 21 (06) : 1665 - 1684
  • [10] Interpretation and optimization of the k-means algorithm
    Sabo, Kristian
    Scitovski, Rudolf
    [J]. APPLICATIONS OF MATHEMATICS, 2014, 59 (04) : 391 - 406