GPU enhanced parallel computing for large scale data clustering

被引:15
|
作者
Cui, Xiaohui [1 ,4 ]
St Charles, Jesse [3 ]
Potok, Thomas [2 ]
机构
[1] Oak Ridge Natl Lab, Dept Energy, Oak Ridge, TN 37831 USA
[2] Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] New York Inst Technol, New York, NY 10023 USA
关键词
GPU; Swarm intelligence; Data clustering; CUDA;
D O I
10.1016/j.future.2012.07.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Analyzing and clustering large scale data set is a complex problem. One explored method of solving this problem borrows from nature, imitating the flocking behavior of birds. One limitation of this method of data clustering is its complexity O(n(2)). As the number of data and feature dimensions grows, it becomes increasingly difficult to generate results in a reasonable amount of time. In the last few years, the graphics processing unit (GPU) has received attention for its ability to solve highly-parallel and semi-parallel problems much faster than the traditional sequential processor. In this paper, we have conducted research to exploit this architecture and apply its strengths to the flocking based high dimension data clustering problem. Using the CUDA platform from NVIDIA, we developed a Multiple Species Data Flocking implementation to be run on the NVIDIA GPU. Performance gains ranged from 30 to 60 times improvement of the GPU over the 3GHz CPU implementation. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:1736 / 1741
页数:6
相关论文
共 50 条
  • [21] An efficient implementation of 3D high-resolution imaging for large-scale seismic data with GPU/CPU heterogeneous parallel computing
    Xu, Jincheng
    Liu, Wei
    Wang, Jin
    Liu, Linong
    Zhang, Jianfeng
    [J]. COMPUTERS & GEOSCIENCES, 2018, 111 : 272 - 282
  • [22] A heterogeneous parallel implementation of the Markov clustering algorithm for large-scale biological networks on distributed CPU–GPU clusters
    You Fu
    Wei Zhou
    [J]. The Journal of Supercomputing, 2022, 78 : 9017 - 9037
  • [23] An efficient parallel clustering algorithm for large scale database
    School of Electronic Information, Wuhan University, Wuhan, Hubei, China
    不详
    不详
    [J]. J. Softw., 2009, 10 (1119-1126):
  • [24] Large-Scale Welding Process Simulation by GPU Parallelized Computing
    Huang, H.
    Chen, J.
    Feng, Z.
    Wang, H-P
    Cal, W.
    Carlson, B. E.
    [J]. WELDING JOURNAL, 2021, 100 (11) : 359S - 370S
  • [25] GPU and CPU cooperation parallel visualisation for large seismic data
    Xie, K.
    Wu, P.
    Yang, S.
    [J]. ELECTRONICS LETTERS, 2010, 46 (17) : 1196 - U46
  • [26] GPU-Based Large Seismic Data Parallel Compression
    Xie, Kai
    Yu, H. Q.
    Lu, G. Y.
    [J]. INTELLIGENCE COMPUTATION AND EVOLUTIONARY COMPUTATION, 2013, 180 : 339 - 345
  • [27] Data Parallel Large Sparse Deep Neural Network on GPU
    Sattar, Naw Safrin
    Arifuzzaman, Shaikh
    [J]. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 1006 - 1014
  • [28] A GPU parallel computing method for LPUSS
    Kim, Chyon Hae
    Sugano, Shigeki
    [J]. ADVANCED ROBOTICS, 2013, 27 (15) : 1199 - 1207
  • [29] LARGE-SCALE PARALLEL MULTIBODY DYNAMICS WITH FRICTIONAL CONTACT ON THE GPU
    Negrut, Dan
    Tasora, Alessandro
    Anitescu, Mihai
    [J]. PROCEEDINGS OF THE ASME DYNAMIC SYSTEMS AND CONTROL CONFERENCE 2008, PTS A AND B, 2009, : 347 - 354
  • [30] Efficient parallel spectral clustering algorithm design for large data sets under cloud computing environment
    Jin R.
    Kou C.
    Liu R.
    Li Y.
    [J]. Journal of Cloud Computing, 2013, 2 (01)