An Improved Method Based on the Density and K-means Nearest Neighbor Text Clustering Algorithm

被引:0
|
作者
Fan, Xiaojing [1 ]
Jiang, Mingyang [2 ]
Pei, Zhili [2 ]
Qiao, Shicheng [2 ]
Lian, Jie [2 ]
Wang, Chaoyong [3 ]
机构
[1] Inner Mongolia Univ Nationalities, Coll Mech & Engn, Tongliao, Peoples R China
[2] Inner Mongolia Univ Nationalities, Coll Comp Sci & Technol, Tongliao, Peoples R China
[3] Jilin Teachers Inst Engn & Technol, Changchun 130052, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For k-means algorithm to the initial cluster centers sensitive to outliers shortcomings, we propose a density-based method to improve the k-means algorithm. Density-based methods are used, by setting the neighborhood and the neighborhood of the object that contains at least to exclude isolated point, and will not repeat the core point as the initial cluster centers We use the ratio of the distance between the distance and class within the class as a criterion evaluation function, the number of clusters to obtain the minimum value of the criterion function as the best number of clusters. These improvements effectively overcome the shortcomings of K-means algorithm. Finally, a few examples of the improved algorithm introduces specific application examples show that the improved algorithm has a higher accuracy than the original clustering algorithm, can help achieve tight class within the class room away from the clustering effect.
引用
下载
收藏
页码:312 / 315
页数:4
相关论文
共 50 条
  • [21] Improved K-means clustering algorithm
    Zhang, Zhe
    Zhang, Junxi
    Xue, Huifeng
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
  • [22] An improved K-means clustering algorithm
    Huang, Xiuchang
    Su, Wei
    Journal of Networks, 2014, 9 (01) : 161 - 167
  • [23] Improved Algorithm for the k-means Clustering
    Zhang, Sheng
    Wang, Shouqiang
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4717 - 4720
  • [24] A Clustering K-means Algorithm Based on Improved PSO Algorithm
    Tan, Long
    2015 FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT2015), 2015, : 940 - 944
  • [25] An Improved K-means Clustering Algorithm Based on Dissimilarity
    Wang Shunye
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 2629 - 2633
  • [26] A Two-Stage Clustering Algorithm based on Improved K-means and Density Peak Clustering
    Xiao, Na
    Zhou, Xu
    Huang, Xin
    Yang, Zhibang
    2019 10TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK 2019), 2019, : 296 - 301
  • [27] An Improved Method for K-Means Clustering
    Cui, Xiaowei
    Wang, Fuxiang
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 756 - 759
  • [28] Improved K-means algorithm based on density Canopy
    Zhang, Geng
    Zhang, Chengchang
    Zhang, Huayu
    KNOWLEDGE-BASED SYSTEMS, 2018, 145 : 289 - 297
  • [29] Design and application of a text clustering algorithm based on parallelized k-means clustering
    Wang H.
    Zhou C.
    Li L.
    Revue d'Intelligence Artificielle, 2019, 33 (06) : 453 - 460
  • [30] An improved K-nearest-neighbor algorithm for text categorization
    Jiang, Shengyi
    Pang, Guansong
    Wu, Meiling
    Kuang, Limin
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 1503 - 1509