A K-means Based Genetic Algorithm for Data Clustering

被引:4
|
作者
Pizzuti, Clara [1 ]
Procopio, Nicola [1 ]
机构
[1] Natl Res Council Italy CNR, Inst High Performance Comp & Networking ICAR, Via P Bucci 7-11, I-87036 Arcavacata Di Rende, CS, Italy
关键词
D O I
10.1007/978-3-319-47364-2_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A genetic algorithm, that exploits the K-means principles for dividing objects in groups having high similarity, is proposed. The method evolves a population of chromosomes, each representing a division of objects in a different number of clusters. A group-based crossover, enriched with the one-step K-means operator, and a mutation strategy that reassigns objects to clusters on the base of their distance to the clusters computed so far, allow the approach to determine the best number of groups present in the dataset. The method has been experimented with four different fitness functions on both synthetic and real-world datasets, for which the ground-truth division is known, and compared with the K-means method. Results show that the approach obtains higher values of evaluation indexes than that obtained by the K-means method.
引用
收藏
页码:211 / 222
页数:12
相关论文
共 50 条
  • [11] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [12] Weighted K-means Clustering Analysis Based on Improved Genetic Algorithm
    Zhang, Tongjie
    Cao, Yan
    Mu, Xiangwei
    SENSORS, MECHATRONICS AND AUTOMATION, 2014, 511-512 : 904 - 908
  • [13] Research of K-means clustering method based on parallel genetic algorithm
    Dai, Wenhua
    Jiao, Cuizhen
    He, Tingting
    2007 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL II, PROCEEDINGS, 2007, : 158 - +
  • [14] An improved genetic k-means algorithm for optimal clustering
    Guo, Hai-Xiang
    Zhu, Ke-Jun
    Gao, Si-Wei
    Liu, Ting
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 793 - +
  • [15] An Improved Genetic K-Means Algorithm for Spatial Clustering
    Wang, Yuanni
    Ge, Fei
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, 2008, : 123 - 126
  • [16] A genetic algorithm with gene rearrangement for K-means clustering
    Chang, Dong-Xia
    Zhang, Xian-Da
    Zheng, Chang-Wen
    PATTERN RECOGNITION, 2009, 42 (07) : 1210 - 1222
  • [17] Optimization of K-Means clustering Using Genetic Algorithm
    Irfan, Shadab
    Dwivedi, Gaurav
    Ghosh, Subhajit
    2017 INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES FOR SMART NATION (IC3TSN), 2017, : 157 - 162
  • [18] A GPS location data clustering approach based on a niche genetic algorithm and hybrid K-means
    Ma, Hongjiang
    Zhou, Xiangbing
    INTELLIGENT DATA ANALYSIS, 2019, 23 : S175 - S198
  • [19] A Clustering Method Based on K-Means Algorithm
    Li, Youguo
    Wu, Haiyan
    INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 1104 - 1109
  • [20] A Fuzzy Clustering Algorithm Based on K-means
    Yan, Zhen
    Pi, Dechang
    ECBI: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMMERCE AND BUSINESS INTELLIGENCE, PROCEEDINGS, 2009, : 523 - 528