Affinity propagation clustering algorithm based on large-scale data-set

被引:34
|
作者
Wang L. [1 ]
Zheng K. [1 ]
Tao X. [2 ]
Han X. [3 ]
机构
[1] School of Management Science and Information Engineering, Jilin University Finance and Economics, Changchun
[2] School of Library Information and Archives Management Engineering, Jilin University, Changchun
[3] School of Computer Science and Engineering, Changchun University of Technology, Jiiln
基金
中国国家自然科学基金;
关键词
affinity propagation algorithm; density peak algorithm; Large-scale data-sets; structural similarity;
D O I
10.1080/1206212X.2018.1425184
中图分类号
学科分类号
摘要
Affinity Propagation (AP) algorithm is not effective in processing large-scale data-sets, so the paper purposed an affinity propagation clustering algorithm based on large scale data-set, called LD-AP. First, we use the idea of grid clustering to divide large data-sets into small datasets and running AP in them to ensure the center of clustering. Then, we introduced the structure similarity matrix to calculate the distance of the cluster center. At last, we used Density peak Clustering Algorithm (DP) algorithm to cluster the cluster again. The experimental results show that the improved algorithm is better than the original algorithm in the clustering effect and computation speed. © 2018, © 2018 Informa UK Limited, trading as Taylor & Francis Group.
引用
下载
收藏
页码:1 / 6
页数:5
相关论文
共 50 条
  • [21] Probability of large-scale data set EM clustering algorithms based on partial information constraints
    Liu, Xiaoyan
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 1748 - 1751
  • [22] OpenBHB: a Large-Scale Multi-Site Brain MRI Data-set for Age Prediction and Debiasing
    Dufumier, Benoit
    Grigis, Antoine
    Victor, Julie
    Ambroise, Corentin
    Frouin, Vincent
    Duchesnay, Edouard
    NEUROIMAGE, 2022, 263
  • [23] Genetic Algorithm Based Clustering for Large-Scale Sensor Networks
    Lin, Hai
    Kong, Ruoshan
    Liu, Jiali
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2015, 15 (06) : 168 - 177
  • [24] Large-scale parallel data clustering
    Judd, D
    McKinley, PK
    Jain, AK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (08) : 871 - 876
  • [25] Density Peaks Clustering Algorithm for Large-scale Data Based on Divide-and-Conquer Strategy
    Wang, Yining
    2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 416 - 419
  • [26] A Fast Granular-Ball-Based Density Peaks Clustering Algorithm for Large-Scale Data
    Cheng, Dongdong
    Li, Ya
    Xia, Shuyin
    Wang, Guoyin
    Huang, Jinlong
    Zhang, Sulan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (12) : 1 - 14
  • [27] Granular-ball-based Fast Spectral Embedding Clustering Algorithm for Large-Scale Data
    Liu, Shushu
    Cheng, Dongdong
    Xie, Jiang
    ACM International Conference Proceeding Series, : 16 - 20
  • [28] KD-tree Based Clustering Algorithm for Fast Face Recognition on Large-scale Data
    Wang, Yuanyuan
    Lin, Yaping
    Yang, Junfeng
    SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2015), 2015, 9631
  • [29] On the Clustering of Large-scale Data: A Matrix-based Approach
    Wang, Lijun
    Dong, Ming
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 139 - 144
  • [30] CLUSTERING STUDY BASED ON A LARGE DATA SET OF QUANTUM GENETIC SPECTRAL CLUSTERING ALGORITHM
    Jiang Yong
    Tan Huailiang
    Li Guangwen
    Zhou Hengwei
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 435 - 440