A K-means Based Genetic Algorithm for Data Clustering

被引:4
|
作者
Pizzuti, Clara [1 ]
Procopio, Nicola [1 ]
机构
[1] Natl Res Council Italy CNR, Inst High Performance Comp & Networking ICAR, Via P Bucci 7-11, I-87036 Arcavacata Di Rende, CS, Italy
关键词
D O I
10.1007/978-3-319-47364-2_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A genetic algorithm, that exploits the K-means principles for dividing objects in groups having high similarity, is proposed. The method evolves a population of chromosomes, each representing a division of objects in a different number of clusters. A group-based crossover, enriched with the one-step K-means operator, and a mutation strategy that reassigns objects to clusters on the base of their distance to the clusters computed so far, allow the approach to determine the best number of groups present in the dataset. The method has been experimented with four different fitness functions on both synthetic and real-world datasets, for which the ground-truth division is known, and compared with the K-means method. Results show that the approach obtains higher values of evaluation indexes than that obtained by the K-means method.
引用
收藏
页码:211 / 222
页数:12
相关论文
共 50 条
  • [21] Data clustering using K-Means based on Crow Search Algorithm
    K Lakshmi
    N Karthikeyani Visalakshi
    S Shanthi
    Sādhanā, 2018, 43
  • [22] A fast K-Means clustering algorithm based on grid data reduction
    Li, Daqi
    Shen, Junyi
    Chen, Hongmin
    2008 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2008, : 2273 - +
  • [23] Data clustering using K-Means based on Crow Search Algorithm
    Lakshmi, K.
    Visalakshi, N. Karthikeyani
    Shanthi, S.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2018, 43 (11):
  • [24] Enhanced Data Lake Clustering Design based on K-means Algorithm
    Kachaoui, Jabrane
    Belangour, Abdessamad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 547 - 554
  • [25] The SKM Algorithm: A K-Means Algorithm for Clustering Sequential Data
    Dias, Jose G.
    Cortinhal, Maria Joao
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2008, PROCEEDINGS, 2008, 5290 : 173 - 182
  • [26] An efficient K-means clustering algorithm for tall data
    Capo, Marco
    Perez, Aritz
    Lozano, Jose A.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (03) : 776 - 811
  • [27] An extension of the K-means algorithm to clustering skewed data
    Volodymyr Melnykov
    Xuwen Zhu
    Computational Statistics, 2019, 34 : 373 - 394
  • [28] An efficient K-means clustering algorithm for tall data
    Marco Capó
    Aritz Pérez
    Jose A. Lozano
    Data Mining and Knowledge Discovery, 2020, 34 : 776 - 811
  • [29] Parallelization of K-Means Clustering Algorithm for Data Mining
    Jiang, Hao
    Yu, Liyan
    4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [30] An extension of the K-means algorithm to clustering skewed data
    Melnykov, Volodymyr
    Zhu, Xuwen
    COMPUTATIONAL STATISTICS, 2019, 34 (01) : 373 - 394