Research of text segmentation based on parallel genetic algorithm

被引:0
|
作者
Zhao, Yu [1 ]
Cai, Wandong [1 ]
Fan, Na [1 ]
Liu, Nian [2 ]
机构
[1] College of Computer Science, Northwestern Polytechnical University, Xi'an 710072, China
[2] Library, Xi'an University of Architecture and Technology, Xi'an 710055, China
关键词
Chinese information processing - Global optimal solutions - Internal cohesion - Latent semantics - Multi-objective optimization problem - Objective functions - Parallel genetic algorithms - Text segmentation;
D O I
暂无
中图分类号
学科分类号
摘要
Focusing on the data sparseness of short texts, an algorithm based on knowledge from external corpus is proposed to improve the accuracy of text segmentation, which contains two steps: Gibbs sampling is adopted to estimate the LDA model; corresponding to the corpus and the latent semantic structure information of the text is inferred based on the LDA model. Two objective functions of internal cohesion and external dissimilarity are then defined to transform text segmentation into a multi-objective optimization problem. A parallel genetic algorithm based on the objective functions is employed to obtain the global optimal solution for text segmentation. According to the experiments, the proposed algorithm achieves higher accuracy than the MDA and LDA-based methods in the case of data sparseness.
引用
下载
收藏
页码:40 / 44
相关论文
共 50 条
  • [21] Research of Adaptive Text Fuzzy Clustering Method Based on Genetic Algorithm
    Dai, Wenhua
    Jiao, Cuizhen
    He, Tingting
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1270 - +
  • [22] Research on text knowledge acquisition of product design based on genetic algorithm
    1600, TeknoScienze, Viale Brianza,22, Milano, 20127, Italy (28):
  • [23] Research on Text Knowledge Acquisition of Product Design based on Genetic Algorithm
    Zhang, Shuai
    Zuo, Tiefeng
    Jin, Jiuzhi
    Wang, Zhen
    AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 2312 - 2316
  • [24] Research on Intelligent Generating Test Paper Based on Parallel Genetic Algorithm
    Li, Jianjun
    Wang, Meng
    SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 2, 2012, 115 : 161 - +
  • [25] Research on Scheduling Strategy in Parallel Applications Based on a Hybrid Genetic Algorithm
    Gao, Ren
    Zhou, Huaibei
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 5731 - +
  • [26] Research and application of Distributed Parallel Genetic Algorithm Based on PC Cluster
    Liu, Keyan
    Sheng, Wanxing
    Li, Yunhua
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (02): : 157 - 163
  • [27] Research and Application of News-text Similarity Algorithm based on Chinese word segmentation
    Guan, Wei
    Zhang, Pengzhou
    2013 3RD INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, COMMUNICATIONS AND NETWORKS (CECNET), 2013, : 484 - 487
  • [28] Research on text segmentation based on topic analysis
    Liu, Ming
    Wang, Xiao-Long
    Liu, Yuan-Chao
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (02): : 278 - 284
  • [29] Genetic Algorithm Based Tree Segmentation
    Varjovi, Mahdi Hatami
    Altun, Sara
    Talu, Muhammed Fatih
    Yeroglu, Celaleddin
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [30] Research on method of text classification rule extraction based on genetic algorithm and entropy
    Computer Engineering Department of Nanhai Campus, South China Normal University, Foshan 528225, China
    Zhongshan Daxue Xuebao, 2007, 5 (18-21+24):