K-means algorithm based on particle swarm optimization for web document clustering

被引:0
|
作者
Xiao, L. Z. [1 ]
Shao, Z. Q. [1 ]
Gu, X. M. [1 ]
机构
[1] E China Univ Sci & Technol, Coll Informat Sci & Engn, Shanghai 200237, Peoples R China
关键词
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
K-means as a clustering algorithm has been studied in Web document clustering. However, with the deficiency of global search ability it is not satisfactory. Particle swarm optimization (PSO) is one of the evolutionary computation techniques based on swarm intelligence, which has high global search ability. So K-means algorithm based on PSO (PSO-KM) was proposed in this paper. The vector space model (VSM) was employed to represent the documents, and Compressed Sparse Row (CSR) format was implemented to store the data. Computational experiments were conducted to test the performance of the hybrid algorithm using three web document datasets. The F-measure and the entropy were adopted to evaluate the quality of clustering. The results were compared with that of K-means algorithm and that of K-means algorithm based on genetic algorithm (GA-KM), which show that the quality of the clustering solutions obtained from PSO-KM is better than that from K-means or GA-KM. The run time of PSO-KM is less than that of GA-KM algorithm.
引用
收藏
页码:980 / 984
页数:5
相关论文
共 50 条
  • [31] New initialization approaches for the k-means and particle swarm optimization based clustering algorithms
    Cinaroglu, Sinem
    Bulut, Hasan
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2018, 33 (02): : 413 - 422
  • [32] A population-based clustering technique using particle swarm optimization and k-means
    Ben Niu
    Qiqi Duan
    Jing Liu
    Lijing Tan
    Yanmin Liu
    Natural Computing, 2017, 16 : 45 - 59
  • [33] Improved Gene Clustering Based on Particle Swarm Optimization, K-Means, and Cluster Matching
    Lam, Yau-King
    Tsang, P. W. M.
    Leung, Chi-Sing
    NEURAL INFORMATION PROCESSING, PT I, 2011, 7062 : 654 - +
  • [34] A population-based clustering technique using particle swarm optimization and k-means
    Niu, Ben
    Duan, Qiqi
    Liu, Jing
    Tan, Lijing
    Liu, Yanmin
    NATURAL COMPUTING, 2017, 16 (01) : 45 - 59
  • [35] New initialization approaches for the k-means and particle swarm optimization based clustering algorithms
    K-ortalamalar ve parçacık sürü optimizasyonu tabanlı kümeleme algoritmaları için yeni ilklendirme yaklaşımları
    Bulut, Hasan (hasan.bulut@ege.edu.tr), 2018, Gazi Universitesi (33):
  • [36] An Improved K-means Algorithm for Document Clustering
    Wu, Guohua
    Lin, Hairong
    Fu, Ershuai
    Wang, Liuyang
    2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND MECHANICAL AUTOMATION (CSMA), 2015, : 65 - 69
  • [37] Harmony K-means algorithm for document clustering
    Mahdavi, Mehrdad
    Abolhassani, Hassan
    DATA MINING AND KNOWLEDGE DISCOVERY, 2009, 18 (03) : 370 - 391
  • [38] Harmony K-means algorithm for document clustering
    Mehrdad Mahdavi
    Hassan Abolhassani
    Data Mining and Knowledge Discovery, 2009, 18 : 370 - 391
  • [39] K-means Clustering Optimization Algorithm Based on MapReduce
    Li, Zhihua
    Song, Xudong
    Zhu, Wenhui
    Chen, Yanxia
    PROCEEDINGS OF THE 2015 INTERNATIONAL SYMPOSIUM ON COMPUTERS & INFORMATICS, 2015, 13 : 198 - 203
  • [40] K-means Algorithm Based on Particle Swarm Optimization for the Identification of Rock Discontinuity Sets
    Yanyan Li
    Qing Wang
    Jianping Chen
    Liming Xu
    Shengyuan Song
    Rock Mechanics and Rock Engineering, 2015, 48 : 375 - 385