A text clustering algorithm based on find of density peaks

被引:3
|
作者
Liu, Peiyu [1 ]
Liu, Yingying [2 ]
Hou, Xiuyan [2 ]
Li, Qingqing [2 ]
Zhu, Zhenfang [3 ]
机构
[1] Shandong Yingcai Univ, Jinan, Peoples R China
[2] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
[3] Shandong Jiaotong Univ, Sch Informat Sci & Elect Engn, Jinan, Peoples R China
关键词
Density; Text clustering; Feature term; Vector distance; Similarity;
D O I
10.1109/ITME.2015.103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The text clustering is one of core issues in the field of text mining and information retrieval. The clustering algorithm is divided into four categories: the partitioned clustering algorithm, the hierarchical clustering algorithm, density-based clustering algorithm, as well as intelligence clustering algorithm, but at present, many of which cannot meet the demand of speed and self-adapting about text clustering. Therefore this paper proposed a text clustering algorithm based on find of density peaks. The algorithm was implemented by the calculation of text distance and density, which was in accordance with calculation of the text vector similarity. SVM was used to express text to obtain the vector mapping for the similarity calculation. The next work was the finding of the local density and the distance from points of higher density of each text, removing the noise points, selecting the cluster center. The remaining points were assigned into the cluster which its nearest cluster center represented. According to several sets of contrast experiment, the density-based text clustering has an advantage of reliability and robustness.
引用
下载
收藏
页码:348 / 352
页数:5
相关论文
共 50 条
  • [21] PARALLEL CLUSTERING BY FAST SEARCH AND FIND OF DENSITY PEAKS
    Ji Chengheng
    Lei Yongmei
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 563 - 567
  • [22] Adaptive Clustering by Fast Search and Find of Density Peaks
    Chen, Yuanyuan
    Ge, Lina
    Zhang, Guifen
    Zhou, Yongquan
    INTELLIGENT COMPUTING METHODOLOGIES, PT III, 2022, 13395 : 802 - 813
  • [23] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Hanqing Wang
    Bin Zhou
    Jianyong Zhang
    Ruixue Cheng
    International Journal of Computational Intelligence Systems, 2020, 13 : 690 - 697
  • [24] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Wang, Hanqing
    Zhou, Bin
    Zhang, Jianyong
    Cheng, Ruixue
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 690 - 697
  • [25] Manifold Density Peaks Clustering Algorithm
    Xu, Xiaohua
    Ju, Yongsheng
    Liang, Yali
    He, Ping
    2015 THIRD INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, 2015, : 311 - 318
  • [26] Survey on Density Peaks Clustering Algorithm
    Xu X.
    Ding S.-F.
    Ding L.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (05): : 1800 - 1816
  • [27] A Turkish Text Classification Based Feature Selection and Density Peaks Clustering
    Zorarpaci, Ezgi
    2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [28] Parallel Implementation of Density Peaks Clustering Algorithm Based on Spark
    Liu, Rui
    Li, Xiaoge
    Du, Liping
    Zhi, Shuting
    Wei, Mian
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 107 : 442 - 447
  • [29] RFDPC: Density Peaks Clustering Algorithm Based on Resultant Force
    Zhang, Yongzhong
    Huang, Hexiao
    Du, Jie
    Ma, Yan
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [30] An Improvement of Density Peaks Clustering Algorithm Based on KNN and Gravitation
    Sun, Jianyang
    Liu, Guanjun
    2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021), 2021, : 234 - 239