Affinity Propagation Clustering Algorithm based on Spark Platform

被引:0
|
作者
Zhang, Lijia [1 ]
Cheng, Lianglun [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Guangdong, Peoples R China
关键词
Affinity propagation; Resilient Distributed Datasets; Spark; Large scale dataset;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the explosive growing of data, there are challenges to deal with the large scale complex data. Many clustering algorithms have been proposed. Such as Affinity Propagation (AP) clustering Algorithm, AP takes similarity between pairs of data point as input measures. AP is a fast and efficient clustering algorithm for large dataset compared with the existing clustering algorithm. As the scale of data grows more explosively, the time efficiency of AP algorithm cannot be satisfied. Therefore, AP clustering algorithm based on Spark platform (Spark-AP) is proposed in this paper. Firstly, a dataset is partitioned into several Resilient Distributed Datasets (RDD) on a strategy and select the exemplars of each RDD. Then exemplars are merged and are used to next AP clustering algorithm, which forms a set of high-quality exemplars after convergence. Experiments show that Spark-AP performs better both in processing scale and processing time.
引用
收藏
页码:532 / 535
页数:4
相关论文
共 50 条
  • [21] Multi-objective Differential Evolution Algorithm Based on Affinity Propagation Clustering
    Qu, Dan
    Li, Hongyi
    Chen, Huafei
    IAENG International Journal of Applied Mathematics, 2023, 53 (04)
  • [22] Semi-supervised affinity propagation clustering algorithm based on stratified combination
    Zhang, Z. (zhangzhen2096@163.com), 2013, Science Press (35):
  • [23] A New Method for Grayscale Image Segmentation Based on Affinity Propagation Clustering Algorithm
    Du, Hui
    Wang, Yuping
    Duan, Lili
    2013 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2013, : 170 - 173
  • [24] Gravity Theory-Based Affinity Propagation Clustering Algorithm and Its Applications
    Wang, Limin
    Hao, Zhiyuan
    Han, Xuming
    Zhou, Ruihong
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2018, 25 (04): : 1125 - 1135
  • [25] CLUSTERING LARGE-SCALE DATA BASED ON MODIFIED AFFINITY PROPAGATION ALGORITHM
    Serdah, Ahmed M.
    Ashour, Wesam M.
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2016, 6 (01) : 23 - 33
  • [26] Voting Affinity Propagation Algorithm for Clustering XML Documents
    Wang, Xu
    Wei, Jinmao
    Fan, Baoquan
    Yang, Ting
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1907 - 1913
  • [27] Semi-supervised Affinity Propagation Clustering Algorithm Based On Kernel Function
    Zhao Xiaoqiang
    Xie Yaping
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 3275 - 3279
  • [28] Research on the Parallelization of the DBSCAN Clustering Algorithm for Spatial Data Mining Based on the Spark Platform
    Huang, Fang
    Zhu, Qiang
    Zhou, Ji
    Tao, Jian
    Zhou, Xiaocheng
    Jin, Du
    Tan, Xicheng
    Wang, Lizhe
    REMOTE SENSING, 2017, 9 (12)
  • [29] Transfer affinity propagation-based clustering
    Hang, Wenlong
    Chung, Fu-lai
    Wang, Shitong
    INFORMATION SCIENCES, 2016, 348 : 337 - 356
  • [30] Affinity propagation clustering algorithm based on large-scale data-set
    Wang L.
    Zheng K.
    Tao X.
    Han X.
    International Journal of Computers and Applications, 2018, 40 (03) : 1 - 6