An Improved Method Based on the Density and K-means Nearest Neighbor Text Clustering Algorithm

被引:0
|
作者
Fan, Xiaojing [1 ]
Jiang, Mingyang [2 ]
Pei, Zhili [2 ]
Qiao, Shicheng [2 ]
Lian, Jie [2 ]
Wang, Chaoyong [3 ]
机构
[1] Inner Mongolia Univ Nationalities, Coll Mech & Engn, Tongliao, Peoples R China
[2] Inner Mongolia Univ Nationalities, Coll Comp Sci & Technol, Tongliao, Peoples R China
[3] Jilin Teachers Inst Engn & Technol, Changchun 130052, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For k-means algorithm to the initial cluster centers sensitive to outliers shortcomings, we propose a density-based method to improve the k-means algorithm. Density-based methods are used, by setting the neighborhood and the neighborhood of the object that contains at least to exclude isolated point, and will not repeat the core point as the initial cluster centers We use the ratio of the distance between the distance and class within the class as a criterion evaluation function, the number of clusters to obtain the minimum value of the criterion function as the best number of clusters. These improvements effectively overcome the shortcomings of K-means algorithm. Finally, a few examples of the improved algorithm introduces specific application examples show that the improved algorithm has a higher accuracy than the original clustering algorithm, can help achieve tight class within the class room away from the clustering effect.
引用
下载
收藏
页码:312 / 315
页数:4
相关论文
共 50 条
  • [1] K-means Clustering Algorithm based on Improved Density Peak
    Wei, Debin
    Zhang, Zhenxing
    ACM International Conference Proceeding Series, 2023, : 105 - 109
  • [2] Automatic Text Summarization Method Based on Improved TextRank Algorithm and K-Means Clustering
    Liu, Wenjun
    Sun, Yuyan
    Yu, Bao
    Wang, Hailan
    Peng, Qingcheng
    Hou, Mengshu
    Guo, Huan
    Wang, Hai
    Liu, Cheng
    KNOWLEDGE-BASED SYSTEMS, 2024, 287
  • [3] Improved K-Means algorithm in text semantic clustering
    Ma, Junhong
    Open Cybernetics and Systemics Journal, 2014, 8 : 530 - 534
  • [4] An improved K-Means text clustering algorithm based on Local Search
    Liu, Xiangwei
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11578 - 11581
  • [5] K-means clustering method based on nearest-neighbor density matrix for customer electricity behavior analysis
    Chen, Yafeng
    Tan, Pingan
    Li, Mu
    Yin, Han
    Tang, Rui
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 161
  • [6] An Improved K-Means Clustering Algorithm Based on Spectral Method
    Tian, Shengwen
    Yang, Hongyong
    Wang, Yilei
    Li, Ali
    ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2008, 5370 : 530 - 536
  • [7] An Improved Clustering Algorithm Based on Density and Shared Nearest Neighbor
    Ye, Hanmin
    Lv, Hao
    Sun, Qianting
    2016 IEEE INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2016, : 37 - 40
  • [8] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [9] Text Document Clustering Based on Density K-means
    Wu, Di
    Zeng, Yan
    Qu, Yin-chuan
    INTERNATIONAL CONFERENCE ON COMPUTER, MECHATRONICS AND ELECTRONIC ENGINEERING (CMEE 2016), 2016,
  • [10] A new Chinese text clustering algorithm based on WRD and improved K-means
    Cui, Zicai
    Zhong, Bocheng
    Bai, Chen
    INTELLIGENT DATA ANALYSIS, 2023, 27 (04) : 1205 - 1220