An Improved Method Based on the Density and K-means Nearest Neighbor Text Clustering Algorithm

被引:0
|
作者
Fan, Xiaojing [1 ]
Jiang, Mingyang [2 ]
Pei, Zhili [2 ]
Qiao, Shicheng [2 ]
Lian, Jie [2 ]
Wang, Chaoyong [3 ]
机构
[1] Inner Mongolia Univ Nationalities, Coll Mech & Engn, Tongliao, Peoples R China
[2] Inner Mongolia Univ Nationalities, Coll Comp Sci & Technol, Tongliao, Peoples R China
[3] Jilin Teachers Inst Engn & Technol, Changchun 130052, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For k-means algorithm to the initial cluster centers sensitive to outliers shortcomings, we propose a density-based method to improve the k-means algorithm. Density-based methods are used, by setting the neighborhood and the neighborhood of the object that contains at least to exclude isolated point, and will not repeat the core point as the initial cluster centers We use the ratio of the distance between the distance and class within the class as a criterion evaluation function, the number of clusters to obtain the minimum value of the criterion function as the best number of clusters. These improvements effectively overcome the shortcomings of K-means algorithm. Finally, a few examples of the improved algorithm introduces specific application examples show that the improved algorithm has a higher accuracy than the original clustering algorithm, can help achieve tight class within the class room away from the clustering effect.
引用
下载
收藏
页码:312 / 315
页数:4
相关论文
共 50 条
  • [31] A K-means Optimized Clustering Algorithm Based on Improved Genetic Algorithm
    Pu, Qiu-Mei
    Wu, Qiong
    Li, Qian
    Lecture Notes in Electrical Engineering, 2022, 801 LNEE : 133 - 140
  • [32] Improved rough K-means clustering algorithm based on firefly algorithm
    Ye, Tingyu
    Ye, Jun
    Wang, Lei
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2023, 17 (01) : 1 - 12
  • [33] A Nonuniform Clustering Routing Algorithm Based on an Improved K-Means Algorithm
    Tang, Xinliang
    Zhang, Man
    Yu, Pingping
    Liu, Wei
    Cao, Ning
    Xu, Yunfeng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 64 (03): : 1725 - 1739
  • [34] K-means clustering algorithm based on improved flower pollination algorithm
    Jiang, Shuhao
    Wang, Mengyuan
    Guo, Jichang
    Wang, Mengqian
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [35] An Improved K-means text clustering algorithm By Optimizing initial cluster centers
    Xiong, Caiquan
    Hua, Zhen
    Lv, Ke
    Li, Xuan
    2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 265 - 268
  • [36] Digital image clustering based on improved k-means algorithm
    Gao Xi
    Hu Zi-mu
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2020, 35 (02) : 173 - 179
  • [37] An Improved Sampling K-means Clustering Algorithm Based on MapReduce
    Zhang Ya-ling
    Wang Ya-nan
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,
  • [38] Improved K-means clustering algorithm based on user tag
    Tang J.
    Journal of Convergence Information Technology, 2010, 5 (10) : 124 - 130
  • [39] Video Classification Based On the Improved K-Means Clustering Algorithm
    Peng, Taile
    Zhang, Zhen
    Shen, Ke
    Jiang, Tao
    2019 5TH INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION, 2020, 440
  • [40] Load Forecasting Based on Improved K-means Clustering Algorithm
    Wang Yanbo
    Liu Li
    Pang Xinfu
    Fan Enpeng
    2018 CHINA INTERNATIONAL CONFERENCE ON ELECTRICITY DISTRIBUTION (CICED), 2018, : 2751 - 2755