Parallel TNN spectral clustering algorithm in CPU-GPU heterogeneous computing environment

被引:0
|
作者
Zhang, Shuai [1 ]
Li, Tao [1 ]
Jiao, Xiaofan [1 ]
Wang, Yifeng [1 ]
Yang, Yulu [1 ]
机构
[1] Department of Computer Science and Information Security, College of Computer and Control Engineering, Nankai University, Tianjin,300071, China
关键词
D O I
10.7544/issn1000-1239.2015.20148151
中图分类号
学科分类号
摘要
Spectral clustering is one of the most popular clustering algorithms in the data mining field. However, this algorithm suffers from the storage and computational bottlenecks heavily when dealing with large-scale datasets. Current work focuses on improving the spectral clustering on both algorithm and implementation levels. But how to design an efficient spectral clustering algorithm, which can handle million scale datasets on a single node with multicore CPU and manycore accelerators, is still an unsolved problem. A parallel spectral clustering using T-nearest-neighbors (TNN) and its implementation for CPU-GPU heterogeneous computing environment, named parallel spectral clustering for hybrids (PSCH), is proposed in this paper. It breaks the GPU device memory limitation by partitioning the TNN similarity matrix into blocks, so the dataset scale only subjects to the size of the host memory. In PSCH, the 4-stage pipeline mechanism with dual rotating buffers is designed to compute the TNN similarity matrix using CUDA, which keeps all the CPU, GPU, and PCIe bus busy to achieve high performance gains while breaking the device memory limitation. The implicitly restarted Lanczos method (IRIM) on GPU is employed for the eigen-decomposition of the sparse TNN similarity matrix, alleviating the computational bottleneck of the eigensolver. The results show that PSCH is highly-efficient at exploring the GPU memory bandwidth and hybrid CPU-GPU computation power. PSCH is able to cluster million scale datasets on a single node equipped with one GTX 480 GPU and achieve 2.0~4.5 times performance gains compared with the MPI parallel spectral clustering implementation PSC using 16 processes for 4 datasets. © 2015, Science Press. All right reserved.
引用
收藏
页码:2555 / 2567
相关论文
共 50 条
  • [1] Parallel and accurate k-means algorithm on CPU-GPU architectures for spectral clustering
    He, Guanlin
    Vialle, Stephane
    Baboulin, Marc
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (14):
  • [2] Heterogeneous Computing (CPU-GPU) for Pollution Dispersion in an Urban Environment
    Fernandez, Gonzalo
    Mendina, Mariana
    Usera, Gabriel
    [J]. COMPUTATION, 2020, 8 (01)
  • [3] Parallel Smoothers in Multigrid Method for Heterogeneous CPU-GPU Environment
    Iyer, Neha
    Ganesan, Sashikumaar
    [J]. PARALLEL COMPUTING: TECHNOLOGY TRENDS, 2020, 36 : 114 - 123
  • [4] The Design and Implementation of Parallel Algorithm Accelerator Based on CPU-GPU Collaborative Computing Environment
    Yang Fan
    Shi Tongnian
    Chu Han
    Wang Kun
    [J]. OPTICAL, ELECTRONIC MATERIALS AND APPLICATIONS II, 2012, 529 : 408 - +
  • [5] Algorithm for Cooperative CPU-GPU Computing
    Aciu, Razvan-Mihai
    Ciocarlie, Horia
    [J]. 2013 15TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2013), 2014, : 352 - 358
  • [6] A Survey of CPU-GPU Heterogeneous Computing Techniques
    Mittal, Sparsh
    Vetter, Jeffrey S.
    [J]. ACM COMPUTING SURVEYS, 2015, 47 (04)
  • [7] Optimization of Parallel Algorithm for Kalman Filter on CPU-GPU Heterogeneous System
    Xu, Dandan
    Xiao, Zheng
    Li, Dapu
    Wu, Fan
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 2165 - 2172
  • [8] Parabolic Radon transform parallel algorithm for CPU-GPU heterogeneous platform
    Zhang, Quan
    Lin, Baiyue
    Yang, Bo
    Peng, Bo
    Zhang, Wei
    Tu, Ran
    [J]. Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2020, 55 (06): : 1263 - 1270
  • [9] Development of a CPU-GPU heterogeneous platform based on a nonlinear parallel algorithm
    Ma, Haifeng
    [J]. NONLINEAR ENGINEERING - MODELING AND APPLICATION, 2022, 11 (01): : 215 - 222
  • [10] A Heterogeneous Parallel Computing Approach Optimizing SpTTM on CPU-GPU via GCN
    Wang, Haotian
    Yang, Wangdong
    Ouyang, Renqiu
    Hu, Rong
    Li, Kenli
    Li, Keqin
    [J]. ACM TRANSACTIONS ON PARALLEL COMPUTING, 2023, 10 (02)