Scalable Implementation of a MapReduce-based Graph Processing Algorithm for Large-scale Heterogeneous Supercomputers

被引:5
|
作者
Shirahata, Koichi [1 ]
Sato, Hitoshi [1 ]
Suzumura, Toyotaro [1 ]
Matsuoka, Satoshi [1 ]
机构
[1] Tokyo Inst Technol, Dept Math & Comp Sci, Tokyo 152, Japan
关键词
Large-scale Graph Processing; GPGPU; MapReduce;
D O I
10.1109/CCGrid.2013.85
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fast processing for extremely large-scale graph is becoming increasingly important in various domains such as health care, social networks, intelligence, system biology, and electric power grids. The GIM-V algorithm based on MapReduce programing model is designed as a general graph processing method for supporting petabyte-scale graph data. On the other hand, recent large-scale data-intensive computing systems tend to employ GPU accelerators to gain good peak performance and high memory bandwidth, however, the validity of acceleration, including optimization techniques, of the GIM-V algorithm using GPUs is an open problem. To address the problem, we implemented a multi-GPU-based GIM-V application with load balance optimization between GPU devices. Our implementation extends the existing MapReduce library for supporting multi-GPU-environments using the MPI library and optimizes load balance between GPU devices by employing task scheduling-based graph partitioning. We conducted our implementation on the TSUBAME2.0 supercomputer using 256 nodes (6144 hyper-threaded CPU cores, 768 GPUs). The results exhibit that our GPU-based implementation performed 87.04 ME/s on 2(30) (1.07 billion) vertices and 2(34) (17.2 billion) edges, and 1.52 times faster than the CPU-based naive implementation with 2(29) vertices and 2(33) edges. We also studied the performance characteristics of our implementation and load balance optimization technique.
引用
收藏
页码:277 / 284
页数:8
相关论文
共 50 条
  • [41] A survey of large-scale analytical query processing in MapReduce
    Christos Doulkeridis
    Kjetil Nørvåg
    [J]. The VLDB Journal, 2014, 23 : 355 - 380
  • [42] A survey of large-scale analytical query processing in MapReduce
    Doulkeridis, Christos
    Norvag, Kjetil
    [J]. VLDB JOURNAL, 2014, 23 (03): : 355 - 380
  • [43] The Family of MapReduce and Large-Scale Data Processing Systems
    Sakr, Sherif
    Liu, Anna
    Fayoumi, Ayman G.
    [J]. ACM COMPUTING SURVEYS, 2013, 46 (01)
  • [44] MapReduce-based fuzzy c-means clustering algorithm: implementation and scalability
    Ludwig, Simone A.
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 923 - 934
  • [45] MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data
    He, Yaobin
    Tan, Haoyu
    Luo, Wuman
    Feng, Shengzhong
    Fan, Jianping
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2014, 8 (01) : 83 - 99
  • [46] A Large-Scale Graph Learning Framework of Technological Gatekeepers by MapReduce
    Liu Tong
    Guo Wensheng
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1997 - 2003
  • [47] MapReduce-based fuzzy c-means clustering algorithm: implementation and scalability
    Simone A. Ludwig
    [J]. International Journal of Machine Learning and Cybernetics, 2015, 6 : 923 - 934
  • [48] Student Research Poster: A Scalable General Purpose System for Large-Scale Graph Processing
    Sun, Jiawen
    [J]. 2016 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION TECHNIQUES (PACT), 2016, : 456 - 456
  • [49] Scalable Community Search over Large-scale Graphs based on Graph Transformer
    Wang, Yuxiang
    Gou, Xiaoxuan
    Xu, Xiaoliang
    Geng, Yuxia
    Ke, Xiangyu
    Wu, Tianxing
    Yu, Zhiyuan
    Chen, Runhuai
    Wu, Xiangying
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1680 - 1690
  • [50] Large-scale graph processing systems: a survey
    Liu, Ning
    Li, Dong-sheng
    Zhang, Yi-ming
    Li, Xiong-lve
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (03) : 384 - 404