Research of text clustering based on hybrid Parallel Genetic Algorithm

被引:0
|
作者
Dai, Wenhua [1 ]
Rao, Guizhen [1 ]
He, Tingting
机构
[1] Cent China Normal Univ, Dept Comp Sci, Wuhan 430079, Peoples R China
关键词
Parallel Genetic Algorithm; K-means Clustering; text clustering; vector space model; feature extraction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-means Clustering Algorithm is sensitive to the choice of the initial cluster center, and easy to fall into a local optimal solution. In order to avoid this kind of flaw, we proposed Hybrid Parallel Genetic Algorithm. In this method, we expressed the documents set into Vector Space Model and randomly chose initial clustering center to form chromosome among document vectors. Combined with the efficiency of K-means Algorithm and the global optimization ability of Parallel Genetic Algorithm, we can provide a higher efficiency and precision for text clustering by means of heredity, mutation in the community, and parallel evolution, intermarriage among communities. Experiments indicate that Hybrid Parallel Genetic Algorithm has higher accuracy and global optimization ability than the other text clustering methods like K-means Algorithm, Genetic Algorithm and so on.
引用
收藏
页码:28 / 31
页数:4
相关论文
共 50 条
  • [1] Research on Text Feature Clustering Based on Improved Parallel Genetic Algorithm
    Jiang, Mingyang
    Fan, Xiaojing
    Pei, Zhili
    Zhang, Zhifeng
    [J]. PROCEEDINGS OF 2018 TENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2018, : 235 - 238
  • [2] Research on Text Clustering Based on Hybrid Simulated-Genetic Algorithm
    Wu, Xiao-qin
    [J]. COMPUTER SCIENCE AND TECHNOLOGY (CST2016), 2017, : 580 - 588
  • [3] Research on Text Feature Extraction Based on Hybrid Parallel Genetic Algorithm
    Dai, Wenhua
    Jiao, Cuizhen
    He, Tingting
    [J]. 2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 5581 - +
  • [4] Research on The parallel Text Clustering Algorithm Based on the Semantic Tree
    Liu, Gangfeng
    Wang, Yunlan
    Zhao, Tianhai
    Li, Dongyang
    [J]. 2011 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY (ICCIT), 2012, : 400 - 403
  • [5] Research of Adaptive Text Fuzzy Clustering Method Based on Genetic Algorithm
    Dai, Wenhua
    Jiao, Cuizhen
    He, Tingting
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1270 - +
  • [6] Text Summarization Using Hybrid Parallel Genetic Algorithm
    Tang, Xinlai
    Wang, XiaoRong
    Wang, Meng
    [J]. COMPUTATIONAL MATERIALS SCIENCE, PTS 1-3, 2011, 268-270 : 1073 - +
  • [7] Text Summarization Using Hybrid Parallel Genetic Algorithm
    Tang, Xinlai
    Wang, XiaoRong
    [J]. ADVANCED MATERIALS AND INFORMATION TECHNOLOGY PROCESSING, PTS 1-3, 2011, 271-273 : 154 - +
  • [8] A hybrid genetic based clustering algorithm
    Liu, YG
    Chen, KF
    Li, XM
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1677 - 1682
  • [9] A parallel text document clustering algorithm based on neighbors
    Li, Yanjun
    Luo, Congnan
    Chung, Soon M.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 933 - 948
  • [10] A parallel text document clustering algorithm based on neighbors
    Yanjun Li
    Congnan Luo
    Soon M. Chung
    [J]. Cluster Computing, 2015, 18 : 933 - 948