A Scalable and Composable Map-Reduce System

被引:0
|
作者
Arif, Mahwish [1 ]
Vandierendonck, Hans [1 ]
Nikolopoulos, Dimitrios S. [1 ]
de Supinski, Bronis R. [1 ]
机构
[1] Queens Univ Belfast, Belfast, Antrim, North Ireland
基金
英国工程与自然科学研究理事会;
关键词
Map-reduce model; performance; composition; programmability; MAPREDUCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel map-reduce runtime system that is designed for scalability and for composition with other parallel software. We use a modified programming interface that expresses reduction operations over data containers as opposed to key-value pairs. This design choice admits higher efficiency as the programmer can select appropriate data structures. Our runtime targets shared memory systems, which are increasingly capable of performing data analytics on terabyte-sized data sets stored in-memory. Our map-reduce runtime is built over the Cilk programming language and outperforms Phoenix++, by 1.5x-4x for 5 out of 7 map-reduce benchmarks on 48 threads. These results arise from a combination of factors: (i) the reduction of framework overheads, including the elimination of repeated (de-) serialization of key-value pairs; (ii) the use of more appropriate intermediate data structures that reductions over containers support.
引用
收藏
页码:2233 / 2242
页数:10
相关论文
共 50 条
  • [1] Scalable Process Discovery Using Map-Reduce
    Evermann, Joerg
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (03) : 469 - 481
  • [2] Research and implementation of scalable parallel computing based on Map-Reduce
    阮青强
    沈文枫
    柴亚辉
    徐炜民
    [J]. Advances in Manufacturing, 2011, 15 (05) : 426 - 429
  • [3] Research and implementation of scalable parallel computing based on Map-Reduce
    阮青强
    沈文枫
    柴亚辉
    徐炜民
    [J]. Journal of Shanghai University(English Edition), 2011, 15 (05) : 426 - 429
  • [4] Implementation of Map-Reduce Based Distributed System
    Wang Yidan
    Liu Yi
    Gao Boqi
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL INDUSTRIAL INFORMATICS AND COMPUTER ENGINEERING CONFERENCE, 2015, : 1014 - 1017
  • [5] Scalable preference queries for high-dimensional data using map-reduce
    Guzun, Gheorghi
    Tosado, Joel E.
    Canahuate, Guadalupe
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2243 - 2252
  • [6] A Fast and Scalable Workflow for SNPs Detection in Genome Sequences Using Hadoop Map-Reduce
    Tahir, Muhammad
    Sardaraz, Muhammad
    [J]. GENES, 2020, 11 (02)
  • [7] Article Recommendation System Based on Keyword using Map-Reduce
    Singh, Sanjana
    Ahuja, Nishtha
    [J]. 2015 THIRD INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2015, : 548 - 550
  • [8] The Evaluation of Map-Reduce Join Algorithms
    Penar, Maciej
    Wilczek, Artur
    [J]. BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2016, 2016, 613 : 192 - 203
  • [9] Incremental Map-Reduce on Repository History
    Haertel, Johannes
    Laemmel, Ralf
    [J]. PROCEEDINGS OF THE 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER '20), 2020, : 320 - 331
  • [10] VDBSCAN Clustering with Map-Reduce Technique
    Sharma, Ashish
    Upadhyay, Dhara
    [J]. RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 2, 2018, 708 : 305 - 314