Scalable Implementation of a MapReduce-based Graph Processing Algorithm for Large-scale Heterogeneous Supercomputers

被引:5
|
作者
Shirahata, Koichi [1 ]
Sato, Hitoshi [1 ]
Suzumura, Toyotaro [1 ]
Matsuoka, Satoshi [1 ]
机构
[1] Tokyo Inst Technol, Dept Math & Comp Sci, Tokyo 152, Japan
关键词
Large-scale Graph Processing; GPGPU; MapReduce;
D O I
10.1109/CCGrid.2013.85
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fast processing for extremely large-scale graph is becoming increasingly important in various domains such as health care, social networks, intelligence, system biology, and electric power grids. The GIM-V algorithm based on MapReduce programing model is designed as a general graph processing method for supporting petabyte-scale graph data. On the other hand, recent large-scale data-intensive computing systems tend to employ GPU accelerators to gain good peak performance and high memory bandwidth, however, the validity of acceleration, including optimization techniques, of the GIM-V algorithm using GPUs is an open problem. To address the problem, we implemented a multi-GPU-based GIM-V application with load balance optimization between GPU devices. Our implementation extends the existing MapReduce library for supporting multi-GPU-environments using the MPI library and optimizes load balance between GPU devices by employing task scheduling-based graph partitioning. We conducted our implementation on the TSUBAME2.0 supercomputer using 256 nodes (6144 hyper-threaded CPU cores, 768 GPUs). The results exhibit that our GPU-based implementation performed 87.04 ME/s on 2(30) (1.07 billion) vertices and 2(34) (17.2 billion) edges, and 1.52 times faster than the CPU-based naive implementation with 2(29) vertices and 2(33) edges. We also studied the performance characteristics of our implementation and load balance optimization technique.
引用
收藏
页码:277 / 284
页数:8
相关论文
共 50 条
  • [1] Out-of-core GPU Memory Management for MapReduce-based Large-scale Graph Processing
    Shirahata, Koichi
    Sato, Hitoshi
    Matsuoka, Satoshi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2014, : 221 - 229
  • [2] MapReduce-based Dragonfly Algorithm for large-scale Data-Clustering
    Tripathi, Ashish Kumar
    Saxena, Pranav
    Gupta, Siddharth
    [J]. 2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 171 - 175
  • [3] Graph partitioning MapReduce-based algorithms for counting triangles in large-scale graphs
    Ahmed Sharafeldeen
    Mohammed Alrahmawy
    Samir Elmougy
    [J]. Scientific Reports, 13
  • [4] Graph partitioning MapReduce-based algorithms for counting triangles in large-scale graphs
    Sharafeldeen, Ahmed
    Alrahmawy, Mohammed
    Elmougy, Samir
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [5] A Large-Scale Implementation Using MapReduce-Based SVM for Tweets Sentiment Analysis
    Lijo, V. P.
    Seetha, Hari
    [J]. INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 541 - 549
  • [6] MELT: Mapreduce-based Efficient Large-scale Trajectory Anonymization
    Ward, Katrina
    Lin, Dan
    Madria, Sanjay
    [J]. SSDBM 2017: 29TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2017,
  • [7] ARLS: A MapReduce-based output analysis tool for large-scale simulations
    Lee, Kangsun
    Jung, Kwanghoon
    Park, Joonho
    Kwon, Dongseop
    [J]. ADVANCES IN ENGINEERING SOFTWARE, 2016, 95 : 28 - 37
  • [8] A MapReduce-based artificial bee colony for large-scale data clustering
    Banharnsakun, Anan
    [J]. PATTERN RECOGNITION LETTERS, 2017, 93 : 78 - 84
  • [9] A MapReduce-based approach for shortest path problem in large-scale networks
    Aridhi, Sabeur
    Lacomme, Philippe
    Ren, Libo
    Vincent, Benjamin
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 41 : 151 - 165
  • [10] Aeromancer: A Workflow Manager for Large-Scale MapReduce-Based Scientific Workflows
    Mohamed, Nabeel
    Maji, Nabanita
    Zhang, Jing
    Timoshevskaya, Nataliya
    Feng, Wu-Chun
    [J]. 2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 739 - 746