A Network Decomposition-based Text Clustering Algorithm for Topic Detection

被引:1
|
作者
Meng, Zuqiang [1 ]
Shen, Shimo [1 ]
Chen, Qiulian [1 ]
机构
[1] Guangxi Univ, Sch Comp Elect & Informat, Nanning 530004, Peoples R China
关键词
topic detection; text clustering; network; k-means algorithm; vector space model;
D O I
10.4028/www.scientific.net/AMM.239-240.1318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text clustering is one of the most popular topic detection techniques. However, the existing text clustering approaches require that each document has to be partitioned to one and only one cluster. This is not reasonable in some cases for there exist some documents which should not used to constitute topics. This paper firstly models a text document set as a network and designs a method for decomposing such a network, and then proposes a truly original text clustering algorithm for topic detection, called a network decomposition-based text clustering algorithm for topic detection (NDTCATD). The proposed algorithm ensures that meaningless documents can not be used to constitute topics. Experimental results show that NDTCATD is much better than bisecting k-means algorithm in terms of overall similarity and average cluster similarity. Therefore the proposed algorithm is reasonable and effective and is especially suitable for topic detection.
引用
收藏
页码:1318 / 1323
页数:6
相关论文
共 50 条
  • [31] Decomposition-based Bayesian network structure learning algorithm using local topology information
    Dai, Jingguo
    Ren, Jia
    Du, Wencai
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 195
  • [32] News Text Topic Clustering Optimized Method Based on TF-IDF Algorithm on Spark
    Zhou, Zhuo
    Qin, Jiaohua
    Xiang, Xuyu
    Tan, Yun
    Liu, Qiang
    Xiong, Neal N.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 62 (01): : 217 - 231
  • [33] A decomposition-based ant colony optimization algorithm for the multi-objective community detection
    Ji, Ping
    Zhang, Shanxin
    Zhou, ZhiPing
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (01) : 173 - 188
  • [34] A Decomposition-Based Multiobjective Chemical Reaction Optimization Algorithm for Community Detection in Complex Networks
    Hongye Li
    Wei Gan
    [J]. International Journal of Computational Intelligence Systems, 2020, 13 : 524 - 537
  • [35] Low rank decomposition-based anomaly detection
    Chen, Shih-Yu
    Yang, Shiming
    Kalpakis, Konstantinos
    Chang, Chein-, I
    [J]. ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XIX, 2013, 8743
  • [36] Application of Density Clustering Algorithm Based on SNN in the Topic Analysis of Microblogging Text: A Case of Smog
    Lu, Yonghe
    Luo, Jiayi
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 : 955 - 972
  • [37] A Novel Hybrid Clustering Algorithm for Microblog Topic Detection
    Geng, Xiao
    Zhang, Yanmei
    Jiao, Yuhang
    Mei, Yinan
    [J]. 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, RESOURCE AND ENVIRONMENTAL ENGINEERING (MSREE 2017), 2017, 1890
  • [38] A Decomposition-Based Multiobjective Chemical Reaction Optimization Algorithm for Community Detection in Complex Networks
    Li, Hongye
    Gan, Wei
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 524 - 537
  • [39] A decomposition-based ant colony optimization algorithm for the multi-objective community detection
    Ping Ji
    Shanxin Zhang
    ZhiPing Zhou
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 173 - 188
  • [40] An Algorithm of Topic Distillation Based on Anchor Text
    Jiang Kai-zhong
    Lu Zhao
    Wu Yuan-qiong
    Gu Jun-zhong
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 11 - +