Tuple MapReduce and Pangool: an associated implementation

被引:0
|
作者
Pedro Ferrera
Ivan De Prado
Eric Palacios
Jose Luis Fernandez-Marquez
Giovanna Di Marzo Serugendo
机构
[1] Datasalt Systems S.L.,CUI
[2] University of Geneva,undefined
来源
关键词
MapReduce; Hadoop; Big Data; Distributed systems; Scalability;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents Tuple MapReduce, a new foundational model extending MapReduce with the notion of tuples. Tuple MapReduce allows to bridge the gap between the low-level constructs provided by MapReduce and higher-level needs required by programmers, such as compound records, sorting, or joins. This paper shows as well Pangool, an open-source framework implementing Tuple MapReduce. Pangool eases the design and implementation of applications based on MapReduce and increases their flexibility, still maintaining Hadoop’s performance. Additionally, this paper shows: pseudo-codes for relational joins, rollup, and the PageRank algorithm; a Pangool’s code example; benchmark results comparing Pangool with existing approaches; reports from users of Pangool in industry; and the description of a distributed database exploiting Pangool. These results show that Tuple MapReduce can be used as a direct, better-suited replacement of the MapReduce model in current implementations without the need of modifying key system fundamentals.
引用
收藏
页码:531 / 557
页数:26
相关论文
共 50 条
  • [1] Tuple MapReduce and Pangool: an associated implementation
    Ferrera, Pedro
    De Prado, Ivan
    Palacios, Eric
    Fernandez-Marquez, Jose Luis
    Serugendo, Giovanna Di Marzo
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 531 - 557
  • [2] Tuple MapReduce: Beyond classic MapReduce
    Ferrera, Pedro
    de Prado, Ivan
    Palacios, Eric
    Fernandez-Marquez, Jose Luis
    Serugendo, Giovanna Di Marzo
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 260 - 269
  • [3] MapReduce Solutions Classification by Their Implementation
    Orynbekova, Kamila
    Bogdanchikov, Andrey
    Cankurt, Selcuk
    Adamov, Abzatdin
    Kadyrov, Shirali
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING PEDAGOGY, 2023, 13 (05): : 58 - 71
  • [4] Luminance Control with MapReduce Implementation
    He, Zen-long
    Lee, Shie-jue
    Wu, Chih-hung
    [J]. INTERNATIONAL CONFERENCE ON INFORMATICS, CONTROL AND AUTOMATION (ICA 2015), 2015, : 31 - 35
  • [5] Tenzing A SQL Implementation On The MapReduce Framework
    Chattopadhyay, Biswapesh
    Lin, Liang
    Liu, Weiran
    Mittal, Sagar
    Aragonada, Prathyusha
    Lychagina, Vera
    Kwon, Younghee
    Wong, Michael
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (12): : 1318 - 1327
  • [6] MapReduce Model Implementation on MPI Platform
    Guo Yucheng
    [J]. PROCEEDINGS OF THIRTEENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE, (DCABES 2014), 2014, : 88 - 91
  • [7] MAPREDUCE IMPLEMENTATION WITH MULTI-GPU
    Chen, Yi
    Chen, Su
    Jiang, Hai
    [J]. INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY: PROCEEDINGS, 2012, : 21 - 25
  • [8] Colorful triangle counting and a MAPREDUCE implementation
    Pagh, Rasmus
    Tsourakakis, Charalampos E.
    [J]. INFORMATION PROCESSING LETTERS, 2012, 112 (07) : 277 - 281
  • [9] Phoenix: A MapReduce Implementation With New Enhancements
    Al-Badarneh, Amer
    Najadat, Hassan
    Al-Soud, Majd
    Mosaid, Rasha
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [10] TTSLI:: An implementation of Tree-Tuple Synchronized languages
    Lecland, B
    Réty, P
    [J]. REWRITING TECHNIQUES AND APPLICATIONS, 2002, 2378 : 376 - 379