A text clustering algorithm based on find of density peaks

被引:3
|
作者
Liu, Peiyu [1 ]
Liu, Yingying [2 ]
Hou, Xiuyan [2 ]
Li, Qingqing [2 ]
Zhu, Zhenfang [3 ]
机构
[1] Shandong Yingcai Univ, Jinan, Peoples R China
[2] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
[3] Shandong Jiaotong Univ, Sch Informat Sci & Elect Engn, Jinan, Peoples R China
关键词
Density; Text clustering; Feature term; Vector distance; Similarity;
D O I
10.1109/ITME.2015.103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The text clustering is one of core issues in the field of text mining and information retrieval. The clustering algorithm is divided into four categories: the partitioned clustering algorithm, the hierarchical clustering algorithm, density-based clustering algorithm, as well as intelligence clustering algorithm, but at present, many of which cannot meet the demand of speed and self-adapting about text clustering. Therefore this paper proposed a text clustering algorithm based on find of density peaks. The algorithm was implemented by the calculation of text distance and density, which was in accordance with calculation of the text vector similarity. SVM was used to express text to obtain the vector mapping for the similarity calculation. The next work was the finding of the local density and the distance from points of higher density of each text, removing the noise points, selecting the cluster center. The remaining points were assigned into the cluster which its nearest cluster center represented. According to several sets of contrast experiment, the density-based text clustering has an advantage of reliability and robustness.
引用
下载
收藏
页码:348 / 352
页数:5
相关论文
共 50 条
  • [31] GDPC: Gravitation-based Density Peaks Clustering algorithm
    Jiang, Jianhua
    Hao, Dehao
    Chen, Yujun
    Parmar, Milan
    Li, Keqin
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 502 : 345 - 355
  • [32] An Improved Density Peaks-Based Graph Clustering Algorithm
    Chen, Lei
    Zheng, Heding
    Liu, Zhaohua
    Li, Qing
    Guo, Lian
    Liang, Guangsheng
    ADVANCES IN INTERNET, DATA & WEB TECHNOLOGIES (EIDWT-2022), 2022, 118 : 68 - 80
  • [33] DPCG: an efficient density peaks clustering algorithm based on grid
    Xiao Xu
    Shifei Ding
    Mingjing Du
    Yu Xue
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 743 - 754
  • [34] Density Peaks Based Clustering Algorithm for Overlapping Community Detection
    Liu, Hongtao
    Zhao, Chaoyue
    Tian, Yuan
    Yang, Juan
    PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2016, : 1 - 8
  • [35] An improved density peaks clustering based on sparrow search algorithm
    Chen, Yaru
    Zhou, Jie
    He, Xingshi
    Luo, Xinglong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08): : 11017 - 11037
  • [36] Density Peaks Clustering Based on Improved RNA Genetic Algorithm
    Ren, Liyan
    Zang, Wenke
    HUMAN CENTERED COMPUTING, HCC 2017, 2018, 10745 : 28 - 33
  • [37] DPCG: an efficient density peaks clustering algorithm based on grid
    Xu, Xiao
    Ding, Shifei
    Du, Mingjing
    Xue, Yu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (05) : 743 - 754
  • [38] Density Peaks Clustering Algorithm Based on K Nearest Neighbors
    Yin, Shihao
    Wu, Runxiu
    Li, Peiwu
    Liu, Baohong
    Fu, Xuefeng
    ADVANCES IN INTELLIGENT SYSTEMS AND COMPUTING (ECC 2021), 2022, 268 : 129 - 144
  • [39] Hierarchical clustering algorithm based on natural local density peaks
    Cai, Fapeng
    Feng, Ji
    Yang, Degang
    Chen, Zhongshang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 7989 - 8004
  • [40] A novel density peaks clustering algorithm based on Hopkins statistic
    Zhang, Ruilin
    Miao, Zhenguo
    Tian, Ye
    Wang, Hongpeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 201