Paralleled Fast Search and Find of Density Peaks Clustering Algorithm on GPUs with CUDA

被引:0
|
作者
Li, Mi [1 ]
Huang, Jie [1 ]
Wang, Jingpeng [1 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai, Peoples R China
关键词
Clustering; FSFDP; CUDA; Shared memory; Stream; GPU clusters;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fast Search and Find of Density Peaks (FSFDP) is a newly proposed clustering algorithm that has already been successfully applied in many applications. However, this algorithm shows a dissatisfactory performance on large dataset due to the time-consuming calculation of the distance matrix and potentials. In this paper, we proposed a GPU-accelerated FSFDP with CUDA to improve its performance. Thread/block models and the shared memory usage are dedicatedly designed to maximize the utilization of GPUs' hardware resources, and a merge accumulation algorithm based on the odd and even positions of an array is introduced as well. Experimental results show that our parallel implementation of FSFDP can reach a 4.39X and a 15.75X speedup for the calculation of the distance matrix and potentials respectively compared to the serial program on a single CPU core. Higher speedup can be expected for data of larger scales until the device limits are reached. Besides, CUDA stream mechanism is also employed and extra time savings can be obtained by hiding the corresponding memory latency of multiple kernels in a twoway streams' scheduling. Moreover, we evaluate our GPU-based implementation on GPU clusters of 9 nodes and compared to one GPU node, the program can achieve a further 7.55X speedup.
引用
收藏
页码:313 / 318
页数:6
相关论文
共 50 条
  • [1] Paralleled fast search and find of density peaks clustering algorithm on GPUs with CUDA
    Li M.
    Huang J.
    Wang J.
    International Journal of Networked and Distributed Computing, 2016, 4 (3) : 173 - 181
  • [2] Clustering by fast search and find of density peaks
    Rodriguez, Alex
    Laio, Alessandro
    SCIENCE, 2014, 344 (6191) : 1492 - 1496
  • [3] ICFS: An Improved Fast Search and Find of Density Peaks Clustering Algorithm
    Gao, Jing
    Zhao, Liang
    Chen, Zhikui
    Li, Peng
    Xu, Han
    Hu, Yueming
    2016 IEEE 14TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 14TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 2ND INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/DATACOM/CYBERSC, 2016, : 537 - 543
  • [4] Fuzzy clustering by fast search and find of density peaks
    Mehmood, Rashid
    Dawood, Hussain
    Bie, Rongfang
    Ahmad, Haseeb
    2015 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION, AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2015, : 258 - 261
  • [5] PARALLEL CLUSTERING BY FAST SEARCH AND FIND OF DENSITY PEAKS
    Ji Chengheng
    Lei Yongmei
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 563 - 567
  • [6] Adaptive Clustering by Fast Search and Find of Density Peaks
    Chen, Yuanyuan
    Ge, Lina
    Zhang, Guifen
    Zhou, Yongquan
    INTELLIGENT COMPUTING METHODOLOGIES, PT III, 2022, 13395 : 802 - 813
  • [7] A clustering algorithm for fuzzy numbers based on fast search and find of density peaks
    Li, Ye
    Chen, Yiyan
    Li, Qun
    INTELLIGENT DATA ANALYSIS, 2019, 23 : S25 - S52
  • [8] A fuzzy mixed data clustering algorithm by fast search and find of density peaks
    Li, Ye
    Chen, Yiyan
    Li, Qun
    INTELLIGENT DATA ANALYSIS, 2019, 23 : S199 - S224
  • [9] Adaptive fuzzy clustering by fast search and find of density peaks
    Bie, Rongfang
    Mehmood, Rashid
    Ruan, Shanshan
    Sun, Yunchuan
    Dawood, Hussain
    PERSONAL AND UBIQUITOUS COMPUTING, 2016, 20 (05) : 785 - 793
  • [10] Adaptive fuzzy clustering by fast search and find of density peaks
    Rongfang Bie
    Rashid Mehmood
    Shanshan Ruan
    Yunchuan Sun
    Hussain Dawood
    Personal and Ubiquitous Computing, 2016, 20 : 785 - 793