Parallel Analysis on Clustering Algorithm Based on Hadoop Cloud Computing Platform

被引:0
|
作者
OuYang, Baicheng [1 ]
机构
[1] Guiyang Univ, Coll Math & Informat Sci, Guiyang 550003, Guizhou, Peoples R China
关键词
Hadoop cloud computing platform; Clustering algorithm; Parallel optimization;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
Ordinary single mode clustering algorithm has been unable to adapt to the current actual information processing requirements both in efficiency or computing complexity theory, and the clustering algorithm based on Hadoop cloud computing platform parallel optimization analysis will indicate the trend clearer. Hadoop is actually a kind of open source project based on the type covered by the Apache, and it is also the basic operation mode of cloud computing distributed platform. Based on Hadoop cloud computing platform, the distributed file system (HDFS) can be used to realize data storage, and Map Reduce programming parallel optimization can be used for huge amounts of data information. According to the characteristics of the general clustering algorithms, combine it with Map Reduce programming mode, it can ensure programming developers to implement parallel need for insight into the structure optimized communication process, which is the parallel optimization of clustering algorithm stated in this paper. In view of this, the following will be based on general K-means algorithm in the initial clustering center selects random features and computing results limitations shall be optimized and improved, further approach based on Hadoop cloud computing platform to realize the related application of actual project.
引用
收藏
页码:499 / 502
页数:4
相关论文
共 50 条
  • [1] Spectral clustering algorithm based on Hadoop cloud platform research
    Zhang, LiSheng
    Hou, Ling
    Lei, DaJiang
    [J]. PROCEEDINGS OF THE 2016 5TH INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS AND COMPUTER SCIENCE, 2016, 80 : 495 - 498
  • [2] A parallel clustering algorithm for Logs Data Based on Hadoop Platform
    Huo, Jiuyuan
    Weng, Jian
    Qu, Hong
    [J]. 2019 THE 3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPILATION, COMPUTING AND COMMUNICATIONS (HP3C 2019), 2019, : 90 - 94
  • [3] Research on parallel algorithm based on hadoop distributed computing platform
    Heilongjiang University of Technology, Jixi, China
    [J]. Int. J. Grid Distrib. Comput., 4 (163-170):
  • [4] Research on Private Cloud Platform of Seed Tracing Based on Hadoop Parallel Computing
    Li Dongming
    Li Yan
    Yuan Chao
    Chen Haochuan
    Zhang Lijuan
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 134 - 137
  • [5] OPTIMIZATION OF WEIGHTING ALGORITHM IN ENTERPRISE HRMS BASED ON CLOUD COMPUTING AND HADOOP PLATFORM
    Zhao, Genliang
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (05): : 3970 - 3978
  • [6] Implementation of a Parallel Algorithm Based on a Spark Cloud Computing Platform
    Wang, Longhui
    Wang, Yong
    Xie, Yudong
    [J]. ALGORITHMS, 2015, 8 (03): : 407 - 414
  • [7] The study of cloud computing experimental platform based on the Hadoop
    Sang, Jinge
    Yu, Haicun
    Yu, Guoli
    Li, Feng
    [J]. INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 1251 - 1257
  • [8] Cloud Computing K-Means Text Clustering Filtering Algorithm based on Hadoop
    Huang Suyu
    [J]. Proceedings of the 2016 4th International Conference on Machinery, Materials and Information Technology Applications, 2016, 71 : 1516 - 1521
  • [9] Research on PageRank Algorithm parallel computing Based on Hadoop
    Yang, Pengfei
    Zhou, Liqing
    [J]. Proceedings of the 2016 4th International Conference on Mechanical Materials and Manufacturing Engineering (MMME 2016), 2016, 79 : 182 - 185
  • [10] A Research on Routing Scheduling of Cloud Computing Based on Adaptive Ant Colony Algorithm on Hadoop Platform
    Gao, Chen Zhi
    [J]. 2012 INTERNATIONAL ACADEMIC CONFERENCE OF ART ENGINEERING AND CREATIVE INDUSTRY (IACAE 2012), 2012, : 445 - 449