Design and Implementation of MapReduce using the PGAS Programming Model with UPC

被引:4
|
作者
Teijeiro, Carlos [1 ]
Taboada, Guillermo L. [1 ]
Tourino, Juan [1 ]
Doallo, Ramon [1 ]
机构
[1] Univ A Coruna, Dept Elect & Syst, Comp Architecture Grp, Fac Informat, La Coruna 15071, Spain
关键词
UPC; MapReduce; HPC; programmability; collective primitives;
D O I
10.1109/ICPADS.2011.162
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
MapReduce is a powerful tool for processing large data sets used by many applications running in distributed environments. However, despite the increasing number of computationally intensive problems that require low-latency communications, the adoption of MapReduce in High Performance Computing (HPC) is still emerging. Here languages based on the Partitioned Global Address Space (PGAS) programming model have shown to be a good choice for implementing parallel applications, in order to take advantage of the increasing number of cores per node and the programmability benefits achieved by their global memory view, such as the transparent access to remote data. This paper presents the first PGAS-based MapReduce implementation that uses the Unified Parallel C (UPC) language, which (1) obtains programmability benefits in parallel programming, (2) offers advanced configuration options to define a customized load distribution for different codes, and (3) overcomes performance penalties and bottlenecks that have traditionally prevented the deployment of MapReduce applications in HPC. The performance evaluation of representative applications on shared and distributed memory environments assesses the scalability of the presented MapReduce framework, confirming its suitability.
引用
收藏
页码:196 / 203
页数:8
相关论文
共 50 条
  • [1] PGAS Approach to Implement Mapreduce Framework Based on UPC Language
    Aday, Shomanov
    Darkhan, Akhmed-Zaki
    Madina, Mansurova
    [J]. PARALLEL COMPUTING TECHNOLOGIES (PACT 2017), 2017, 10421 : 342 - 350
  • [2] Performance Comparison of Graph BFS Implemented in MapReduce and PGAS Programming Models
    Ryczkowska, Magdalena
    Nowicki, Marek
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2017), PT II, 2018, 10778 : 328 - 337
  • [3] Phylogenetic Analysis using MapReduce Programming Model
    Siddesh, G. M.
    Srinivasa, K. G.
    Mishra, Ishank
    Anurag, Abhinav
    Uppal, Eklavya
    [J]. 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 350 - 356
  • [4] Design of K-means clustering algorithm in PGAS based Mapreduce framework
    Shomanov, A. S.
    Mansurova, M. E.
    Nugumanova, A. B.
    [J]. 2018 IEEE 12TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2018, : 158 - 160
  • [5] Design and Implementation of Parallelized LDA Topic Model Based on MapReduce
    Yan, Duan-wu
    Li, Tie-jun
    Yang, Xiong-fei
    Chen, Kun
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND NETWORK TECHNOLOGY (CCNT 2018), 2018, 291 : 274 - 278
  • [6] Dataflow-like synchronization in a PGAS programming model
    Breitbart, Jens
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 762 - 769
  • [7] On the Performance and Energy Efficiency of the PGAS Programming Model on Multicore Architectures
    Lagraviere, Jeremie
    Langguth, Johannes
    Sourouri, Mohammed
    Ha, Phuong H.
    Cai, Xing
    [J]. 2016 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2016), 2016, : 800 - 807
  • [8] DESIGN AND IMPLEMENTATION OF PARALLEL TERM CONTRIBUTION ALGORITHM BASED ON MAPREDUCE MODEL
    Chao, Peng
    Bin, Wu
    Chao, Deng
    [J]. PROCEEDINGS OF THE 2012 SEVENTH OPEN CIRRUS SUMMIT (OCS 2012), 2012, : 43 - 47
  • [9] A Scalable Approach for Improving Implementation of a Frequent Pattern Mining Algorithm using MapReduce Programming
    Hasan, Md Abed
    Hassan, Naima
    Hasibuzzaman, Md
    Huq, Mohammad Rezwanul
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 106 - 111
  • [10] Optimizing Crawler4j using MapReduce Programming Model
    Siddesh G.M.
    Suresh K.
    Madhuri K.Y.
    Nijagal M.
    Rakshitha B.R.
    Srinivasa K.G.
    [J]. Siddesh, G.M. (siddeshgm14@gmail.com), 1600, Springer (98): : 329 - 336