An efficient parallel collaborative filtering algorithm on multi-GPU platform

被引:0
|
作者
Zhongya Wang
Ying Liu
Steve Chiu
机构
[1] University of Chinese Academy of Sciences,School of Computer and Control
[2] Chinese Academy of Sciences,Fictitious Economy and Data Science Research Center
[3] Idaho State University,College of Science and Engineering
来源
关键词
Parallel computing; CUDA; Collaborative filtering; Multi-GPU; Recommendation system;
D O I
暂无
中图分类号
学科分类号
摘要
Collaborative filtering (CF) is one of the essential algorithms in recommendation system. As the size of the data in real applications is huge, usually at the magnitude of Petabytes, parallel computing technique is required to accelerate the computation. Due to GPU’s tremendous computing capability, it has emerged as the co-processor of the CPU to achieve a high overall throughput. In this paper, we identify the computation kernel, similarity matrix calculation. Then, we present a CUDA multithread model, where the data elements are processed in a data-parallel fashion. A workload partitioning scheme is proposed to balance the workload distributed to different GPUs. In the experiments, our CUDA-enabled CF algorithm significantly outperforms the serial CF workstation, achieving up to 3,691×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} speedup using four Tesla K10 graphics cards. It also shows good scalability when varying the number of users, the number of items and the number of GPUs.
引用
下载
收藏
页码:2080 / 2094
页数:14
相关论文
共 50 条
  • [1] An efficient parallel collaborative filtering algorithm on multi-GPU platform
    Wang, Zhongya
    Liu, Ying
    Chiu, Steve
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (06): : 2080 - 2094
  • [2] Efficient parallel A* search on multi-GPU system
    He, Xin
    Yao, Yapeng
    Chen, Zhiwen
    Sun, Jianhua
    Chen, Hao
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 : 35 - 47
  • [3] A Multi-GPU Parallel Algorithm in Hypersonic Flow Computations
    Lai, Jianqi
    Li, Hua
    Tian, Zhengyu
    Zhang, Ye
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [4] Cardiac simulation on multi-GPU platform
    Nimmagadda, Venkata Krishna
    Akoglu, Ali
    Hariri, Salim
    Moukabary, Talal
    JOURNAL OF SUPERCOMPUTING, 2012, 59 (03): : 1360 - 1378
  • [5] Cardiac simulation on multi-GPU platform
    Venkata Krishna Nimmagadda
    Ali Akoglu
    Salim Hariri
    Talal Moukabary
    The Journal of Supercomputing, 2012, 59 : 1360 - 1378
  • [6] Multi-GPU Parallel Memetic Algorithm for Capacitated Vehicle Routing Problem
    Wodecki, Mieczyslaw
    Bozejko, Wojciech
    Karpinski, Michaffl
    Pacut, Maciej
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT II, 2014, 8385 : 207 - 214
  • [7] Multi-GPU Accelerated Parallel Algorithm of Wallis Transformation for Image Enhancement
    Xiao, Han
    Song, Yu-Pu
    Zhou, Qing-Lei
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2014, 7 (02): : 99 - 114
  • [8] A multi-GPU parallel optimization model for the preconditioned conjugate gradient algorithm
    Gao, Jiaquan
    Zhou, Yuanshen
    He, Guixia
    Xia, Yifei
    PARALLEL COMPUTING, 2017, 63 : 1 - 16
  • [9] Parallel Algorithm for Landform Attributes Representation on Multicore and Multi-GPU Systems
    Boratto, Murilo
    Alonso, Pedro
    Ramiro, Carla
    Barreto, Marcos
    Coelho, Leandro
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT I, 2012, 7333 : 29 - 43
  • [10] A Parallel Preconditioned Conjugate Gradient Solver for the Poisson Problem on a Multi-GPU Platform
    Ament, M.
    Knittel, G.
    Weiskopf, D.
    Strasser, W.
    PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, : 583 - 592