Tuple MapReduce and Pangool: an associated implementation

被引:0
|
作者
Pedro Ferrera
Ivan De Prado
Eric Palacios
Jose Luis Fernandez-Marquez
Giovanna Di Marzo Serugendo
机构
[1] Datasalt Systems S.L.,CUI
[2] University of Geneva,undefined
来源
关键词
MapReduce; Hadoop; Big Data; Distributed systems; Scalability;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents Tuple MapReduce, a new foundational model extending MapReduce with the notion of tuples. Tuple MapReduce allows to bridge the gap between the low-level constructs provided by MapReduce and higher-level needs required by programmers, such as compound records, sorting, or joins. This paper shows as well Pangool, an open-source framework implementing Tuple MapReduce. Pangool eases the design and implementation of applications based on MapReduce and increases their flexibility, still maintaining Hadoop’s performance. Additionally, this paper shows: pseudo-codes for relational joins, rollup, and the PageRank algorithm; a Pangool’s code example; benchmark results comparing Pangool with existing approaches; reports from users of Pangool in industry; and the description of a distributed database exploiting Pangool. These results show that Tuple MapReduce can be used as a direct, better-suited replacement of the MapReduce model in current implementations without the need of modifying key system fundamentals.
引用
收藏
页码:531 / 557
页数:26
相关论文
共 50 条
  • [31] Research And Implementation of Iterative MapReduce Based On BP Algorithm
    Yang, Yu
    Zhang, Longjun
    [J]. PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING, MANUFACTURING TECHNOLOGY AND CONTROL, 2016, 67 : 507 - 510
  • [32] MRWorkflow: Design and Implementation of a Workflow System for MapReduce Applications
    Kim, Shin-gyu
    Han, Hyuck
    Yeom, Heon Y.
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (04): : 1363 - 1371
  • [33] MapReduce Implementation of a Multinomial and Mixed Naive Bayes Classifier
    Bagui, Sikha
    Devulapalli, Keerthi
    John, Sharon
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2020, 16 (02) : 1 - 23
  • [34] Design and implementation of algorithm Apriori based on iterative MapReduce
    [J]. Ji, G. (glji@njnu.edu.cn), 1600, Huazhong University of Science and Technology (40):
  • [35] A Parallel Implementation of Relief Algorithm Using Mapreduce Paradigm
    Yazidi, Jamila
    Bouaguel, Waad
    Essoussi, Nadia
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2016, PT II, 2016, 9876 : 418 - 425
  • [36] An Implementation Framework of MapReduce Email Social Network Analysis
    Gau, Rung-Hung
    Hsieh, Tzu-Chiang
    Tsai, Sheng-Wen
    Cheng, Ching-Pei
    [J]. WMUNEP 11: PROCEEDINGS OF THE SEVENTH ACM WORKSHOP ON WIRELESS MULTIMEDIA NETWORKING AND COMPUTING, 2011, : 67 - 69
  • [37] An ordinal random forest and its parallel implementation with MapReduce
    Wang, Shanshan
    Zhai, Junhai
    Zhang, Sufang
    Zhu, Hong
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2170 - 2173
  • [38] SmartCache: An Optimized MapReduce Implementation of Frequent Itemset Mining
    Huang, Dachuan
    Song, Yang
    Routray, Ramani
    Qin, Feng
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2015), 2015, : 16 - 25
  • [39] Study and Implementation of a Solution to Security Management for Mobile Environments Based on Tuple
    Nguessan, Desire
    Colombo Martini, Jose Sidnei
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 2014 - 2020
  • [40] A P2P tuple space implementation for disconnected MANETs
    Abdulkader Benchi
    Pascale Launay
    Frédéric Guidec
    [J]. Peer-to-Peer Networking and Applications, 2015, 8 : 87 - 102