A K-means Clustering with Optimized Initial Center Based on Hadoop Platform

被引:0
|
作者
Lin, Kunhui [1 ]
Li, Xiang [1 ]
Zhang, Zhongnan [1 ]
Chen, Jiahong [1 ]
机构
[1] Xiamen Univ, Software Sch, Xiamen, Peoples R China
关键词
MapReduce; K-means clustering; Initial center; Density; MAPREDUCE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the explosive growth of data, the traditional clustering algorithms running on separate servers can not meet the demand. To solve the problem, more and more researchers implement the traditional clustering algorithms on the cloud computing platforms, especially for K-means clustering. But, few researchers pay attention to the K-means clustering structure, and most of researchers optimized the model of the cloud computing platform to raise the computing speed of K-means clustering. However the problem of instability caused by the random initial centers still exists. In this paper, we propose a K-means clustering algorithm with optimized initial centers based on data dimensional density. This method avoids the deficiency of the random initial centers and improves the stability of the Kmeans clustering. The experimental results show that the approach achieves a good performance on K-means, and improves the accuracy of K-means clustering on the test set.
引用
收藏
页码:263 / 266
页数:4
相关论文
共 50 条
  • [1] Optimization of K-means Clustering Algorithm Based on Hadoop Platform
    Duan, A. L.
    Xu, Z. X.
    Zhang, H. J.
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENVIRONMENTAL ENGINEERING (CSEE 2015), 2015, : 1195 - 1203
  • [2] An Improved K-means Clustering Algorithm Based on Hadoop Platform
    Hou, Xiangru
    [J]. CYBER SECURITY INTELLIGENCE AND ANALYTICS, 2020, 928 : 1101 - 1109
  • [3] Improved K-means Clustering Algorithm Based on the Optimized Initial Centriods
    Wang, Shunye
    [J]. 2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 450 - 453
  • [4] An Optimized Initialization Center K-means Clustering Algorithm based on Density
    Yuan, Qilong
    Shi, Haibo
    Zhou, Xiaofeng
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2015, : 790 - 794
  • [5] The Application of K-Means Clustering Algorithm Based on Hadoop
    Zhong, Yurong
    Liu, Dan
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2016), 2016, : 88 - 92
  • [6] K-means Clustering Algorithm with improved Initial Center
    Zhang Chen
    Xia Shixiong
    [J]. WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 790 - 792
  • [7] K-means Clustering Algorithm with Refined Initial Center
    Chen, Xuhui
    Xu, Yong
    [J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 2203 - 2206
  • [8] An Optimized k-means Algorithm for Selecting Initial Clustering Centers
    Song, Jianhui
    Li, Xuefei
    Liu, Yanju
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2015, 9 (10): : 177 - 186
  • [9] K-means Optimization algorithms of initial clustering center based on regional density
    He, Yanxiang
    Cai, Rui
    Wu, Libing
    Li, Fei
    [J]. APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 478 - 482
  • [10] Improved initial clustering center selection algorithm for K-means
    Chen Lasheng
    Li Yuqiang
    [J]. 2017 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA 2017), 2017, : 275 - 279