A Network Decomposition-based Text Clustering Algorithm for Topic Detection

被引:1
|
作者
Meng, Zuqiang [1 ]
Shen, Shimo [1 ]
Chen, Qiulian [1 ]
机构
[1] Guangxi Univ, Sch Comp Elect & Informat, Nanning 530004, Peoples R China
关键词
topic detection; text clustering; network; k-means algorithm; vector space model;
D O I
10.4028/www.scientific.net/AMM.239-240.1318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text clustering is one of the most popular topic detection techniques. However, the existing text clustering approaches require that each document has to be partitioned to one and only one cluster. This is not reasonable in some cases for there exist some documents which should not used to constitute topics. This paper firstly models a text document set as a network and designs a method for decomposing such a network, and then proposes a truly original text clustering algorithm for topic detection, called a network decomposition-based text clustering algorithm for topic detection (NDTCATD). The proposed algorithm ensures that meaningless documents can not be used to constitute topics. Experimental results show that NDTCATD is much better than bisecting k-means algorithm in terms of overall similarity and average cluster similarity. Therefore the proposed algorithm is reasonable and effective and is especially suitable for topic detection.
引用
收藏
页码:1318 / 1323
页数:6
相关论文
共 50 条
  • [1] Topic text detection by clustering algorithm for social network media
    Sha, Sha
    [J]. International Journal of Networking and Virtual Organisations, 2024, 30 (03) : 246 - 256
  • [2] Clustering Based Topic Events Detection on Text Stream
    Li, Chunshan
    Ye, Yunming
    Zhang, Xiaofeng
    Chu, Dianhui
    Deng, Shengchun
    Xu, Xiaofei
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT 1, 2014, 8397 : 42 - 52
  • [3] A Topic-based Dynamic Clustering Algorithm for Text Stream
    Rao, Y.
    Li, X. J.
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRIAL ENGINEERING (AIIE 2015), 2015, 123 : 480 - 483
  • [4] Topic Detection from Microblog Based on Text Clustering and Topic Model Analysis
    Huang, Siqi
    Yang, Yitao
    Li, Huakang
    Sun, Guozi
    [J]. 2014 ASIA-PACIFIC SERVICES COMPUTING CONFERENCE (APSCC), 2014, : 88 - 92
  • [5] Network Topic Detection Model Based on Text Reconstructions
    Zhu, Zhenfang
    Wang, Peipei
    Jia, Zhiping
    Xiao, Hairong
    Zhang, Guangyuan
    Liang, Hao
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2013, 37 (04): : 367 - 372
  • [6] A Tensor Decomposition-Based Anomaly Detection Algorithm for Hyperspectral Image
    Zhang, Xing
    Wen, Gongjian
    Dai, Wei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10): : 5801 - 5820
  • [7] Wavelet packet decomposition-based fuzzy clustering algorithm for gene expression data
    Cui, Guangzhao
    Cao, Xianghong
    Wang, Yanfeng
    Cao, Lingzhi
    Huang, Buyi
    Yang, Cunxiang
    [J]. 2006 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, 2006, : 1027 - +
  • [8] A novel topic clustering algorithm based on graph neural network for question topic diversity
    Wu, Yongliang
    Wang, Xuejun
    Zhao, Wenbin
    Lv, Xiaofeng
    [J]. INFORMATION SCIENCES, 2023, 629 : 685 - 702
  • [9] Iterative QR Decomposition-Based Parallel Diversity Noncoherent Detection Algorithm
    Wang, Jieling
    Zhou, Bin
    Zhao, Mao
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2023, 132 (04) : 2823 - 2838
  • [10] Iterative QR Decomposition-Based Parallel Diversity Noncoherent Detection Algorithm
    Jieling Wang
    Bin Zhou
    Mao Zhao
    [J]. Wireless Personal Communications, 2023, 132 : 2823 - 2838