Research of text clustering based on hybrid Parallel Genetic Algorithm

被引:0
|
作者
Dai, Wenhua [1 ]
Rao, Guizhen [1 ]
He, Tingting
机构
[1] Cent China Normal Univ, Dept Comp Sci, Wuhan 430079, Peoples R China
关键词
Parallel Genetic Algorithm; K-means Clustering; text clustering; vector space model; feature extraction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-means Clustering Algorithm is sensitive to the choice of the initial cluster center, and easy to fall into a local optimal solution. In order to avoid this kind of flaw, we proposed Hybrid Parallel Genetic Algorithm. In this method, we expressed the documents set into Vector Space Model and randomly chose initial clustering center to form chromosome among document vectors. Combined with the efficiency of K-means Algorithm and the global optimization ability of Parallel Genetic Algorithm, we can provide a higher efficiency and precision for text clustering by means of heredity, mutation in the community, and parallel evolution, intermarriage among communities. Experiments indicate that Hybrid Parallel Genetic Algorithm has higher accuracy and global optimization ability than the other text clustering methods like K-means Algorithm, Genetic Algorithm and so on.
引用
收藏
页码:28 / 31
页数:4
相关论文
共 50 条
  • [31] Research on Parallel Hybrid Genetic Algorithm based on Multi-group in Job Shop Scheduling
    Yan, Cunliang
    Shi, Weifeng
    Zhao, Ruilin
    [J]. ADVANCED COMPOSITE MATERIALS, PTS 1-3, 2012, 482-484 : 2227 - +
  • [32] Research on Genetic and Simulated Annealing Algorithm for Multiple Sequence Alignment Based on Hybrid Parallel Computation
    Li, Longsheng
    Liu, Yu
    [J]. PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL AND ELECTRICAL ENGINEERING (AMEE 2017), 2017, 87 : 205 - 208
  • [33] NK Hybrid Genetic Algorithm for Clustering
    Tinos, Renato
    Zhao, Liang
    Chicano, Francisco
    Whitley, Darrell
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2018, 22 (05) : 748 - 761
  • [34] An improved hybrid genetic clustering algorithm
    Liu, YG
    Peng, J
    Chen, KF
    Zhang, Y
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 192 - 202
  • [35] Research on Text Clustering Algorithm Based on Improved K_means
    Li Xinwu
    [J]. 2009 ETP INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATION (FCC 2009), 2009, : 19 - 22
  • [36] Research on Hadoop-based Massive short text clustering algorithm
    Zhao, Qiang
    Shi, Yuliang
    Qing, Zepeng
    [J]. FOURTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2019, 11198
  • [37] Research on Big Data Text Clustering Algorithm Based on Swarm Intelligence
    Li, Xiaorong
    Shu, Zhinian
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [38] Research on Text Clustering Algorithm Based on K_means and SOM
    Li Xinwu
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 341 - 344
  • [39] A Hybrid Data Clustering Using Firefly Algorithm Based Improved Genetic Algorithm
    Maheshwar
    Kaushik, Keshav
    Arora, Vikram
    [J]. SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15), 2015, 58 : 249 - 256
  • [40] PGAC: A parallel genetic algorithm for data clustering
    Lo Bosco, G
    [J]. CAMP 2005: Seventh International Workshop on Computer Architecture for Machine Perception , Proceedings, 2005, : 283 - 287