Affinity propagation clustering algorithm based on large-scale data-set

被引:34
|
作者
Wang L. [1 ]
Zheng K. [1 ]
Tao X. [2 ]
Han X. [3 ]
机构
[1] School of Management Science and Information Engineering, Jilin University Finance and Economics, Changchun
[2] School of Library Information and Archives Management Engineering, Jilin University, Changchun
[3] School of Computer Science and Engineering, Changchun University of Technology, Jiiln
基金
中国国家自然科学基金;
关键词
affinity propagation algorithm; density peak algorithm; Large-scale data-sets; structural similarity;
D O I
10.1080/1206212X.2018.1425184
中图分类号
学科分类号
摘要
Affinity Propagation (AP) algorithm is not effective in processing large-scale data-sets, so the paper purposed an affinity propagation clustering algorithm based on large scale data-set, called LD-AP. First, we use the idea of grid clustering to divide large data-sets into small datasets and running AP in them to ensure the center of clustering. Then, we introduced the structure similarity matrix to calculate the distance of the cluster center. At last, we used Density peak Clustering Algorithm (DP) algorithm to cluster the cluster again. The experimental results show that the improved algorithm is better than the original algorithm in the clustering effect and computation speed. © 2018, © 2018 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:1 / 6
页数:5
相关论文
共 50 条
  • [1] CLUSTERING LARGE-SCALE DATA BASED ON MODIFIED AFFINITY PROPAGATION ALGORITHM
    Serdah, Ahmed M.
    Ashour, Wesam M.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2016, 6 (01) : 23 - 33
  • [2] An Improved Affinity Propagation Clustering Algorithm for Large-scale Data Sets
    Liu, Xiaonan
    Yin, Meijuan
    Luo, Junyong
    Chen, Wuping
    [J]. 2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 894 - 899
  • [3] The Research on Large Scale Data Set Clustering Algorithm Based on Tag Set
    Chen, Qiang
    [J]. COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, (ISICA 2015), 2016, 575 : 365 - 372
  • [4] A stratified sampling based clustering algorithm for large-scale data
    Zhao, Xingwang
    Liang, Jiye
    Dang, Chuangyin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 416 - 428
  • [5] Exploring gendered cycling behaviours within a large-scale behavioural data-set
    Beecham, Roger
    Wood, Jo
    [J]. TRANSPORTATION PLANNING AND TECHNOLOGY, 2014, 37 (01) : 83 - 97
  • [6] Fuzzy clustering algorithm based on multiple medoids for large-scale data
    Chen A.-G.
    Wang S.-T.
    [J]. Kongzhi yu Juece/Control and Decision, 2016, 31 (12): : 2122 - 2130
  • [7] Local and global approaches of affinity propagation clustering for large scale data
    Ding-yin Xia
    Fei Wu
    Xu-qing Zhang
    Yue-ting Zhuang
    [J]. Journal of Zhejiang University-SCIENCE A, 2008, 9 : 1373 - 1381
  • [8] Local and global approaches of affinity propagation clustering for large scale data
    Xia, Ding-yin
    Wu, Fei
    Zhang, Xu-qing
    Zhuang, Yue-ting
    [J]. JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2008, 9 (10): : 1373 - 1381
  • [10] A Local Approach of Adaptive Affinity Propagation Clustering for Large Scale Data
    Sun, Changyin
    Wang, Chenghong
    Song, Su
    Wang, Yifan
    [J]. IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 161 - +