AJIRA: a Lightweight Distributed Middleware for MapReduce and Stream Processing

被引:11
|
作者
Urbani, Jacopo [1 ]
Margara, Alessandro [1 ]
Jacobs, Ceriel [1 ]
Voulgaris, Spyros [1 ]
Bal, Henri [1 ]
机构
[1] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
关键词
D O I
10.1109/ICDCS.2014.62
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, MapReduce is the most popular programming model for large-scale data processing and this motivated the research community to improve its efficiency either with new extensions, algorithmic optimizations, or hardware. In this paper we address two main limitations of MapReduce: one relates to the model's limited expressiveness, which prevents the implementation of complex programs that require multiple steps or iterations. The other relates to the efficiency of its most popular implementations (e.g., Hadoop), which provide good resource utilization only for massive volumes of input, operating suboptimally for smaller or rapidly changing input. To address these limitations, we present AJIRA, a new middleware designed for efficient and generic data processing. At a conceptual level, AJIRA replaces the traditional map/reduce primitives by generic operators that can be dynamically allocated, allowing the execution of more complex batch and stream processing jobs. At a more technical level, AJIRA adopts a distributed, multi-threaded architecture that strives at minimizing overhead for non-critical functionality. These characteristics allow AJIRA to be used as a single programming model for both batch and stream processing. To this end, we evaluated its performance against Hadoop, Spark, Esper, and Storm, which are state of the art systems for both batch and stream processing. Our evaluation shows that AJIRA is competitive in a wide range of scenarios both in terms of processing time and scalability, making it an ideal choice where flexibility, extensibility, and the processing of both large and dynamic data with a single programming model are either desirable or even mandatory requirements.
引用
收藏
页码:545 / 554
页数:10
相关论文
共 50 条
  • [1] SMILE - Distributed middleware for event stream processing
    Strom, Rob
    Dorai, Chitra
    Buttner, Gerry
    Li, Ying
    [J]. PROCEEDINGS OF THE SIXTH INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2007, : 553 - 554
  • [2] Stream Processing on GPUs Using Distributed Multimedia Middleware
    Repplinger, Michael
    Slusallek, Philipp
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I, 2010, 6067 : 429 - 438
  • [3] Stream processing on GPUs using distributed multimedia middleware
    Repplinger, Michael
    Slusallek, Philipp
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (07): : 669 - 680
  • [4] Biologically-Inspired Distributed Middleware Management for Stream Processing Systems
    Lakshmanan, Geetika T.
    Strom, Robert E.
    [J]. MIDDLEWARE 2008, PROCEEDINGS, 2008, 5346 : 223 - 242
  • [5] Moving Enterprise Integration Middleware toward the Distributed Stream Processing Architecture
    Bakulev, Alexander
    Bakuleva, Marina
    [J]. 2019 8TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2019, : 192 - 195
  • [6] Dragon: A Lightweight, High Performance Distributed Stream Processing Engine
    Harwood, Aaron
    Read, Maria Rodriguez
    Amarasinghe, Gayashan Niroshana
    [J]. 2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 1344 - 1351
  • [7] A Lightweight Modeling Middleware for Corpus Processing
    Gaertner, Markus
    Kuhn, Jonas
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1087 - 1095
  • [8] DSCAGENTS: A LIGHTWEIGHT MIDDLEWARE FOR DISTRIBUTED SMART CAMERAS
    Quaritsch, Markus
    Rinner, Bernhard
    [J]. 2008 SECOND ACM/IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS, 2008, : 565 - 572
  • [9] A middleware for efficient stream processing in CUDA
    Nakagawa, Shinta
    Ino, Fumihiko
    Hagihara, Kenichi
    [J]. COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2010, 25 (1-2): : 41 - 49
  • [10] A Lightweight Middleware Monitor for Distributed Scientific Workflows
    Serra da Cruz, Sergio Manuel
    da Silva, Fabricio Nogueira
    Gadelha, Luiz M. R., Jr.
    Reis Cavalcanti, Maria Claudia
    Campos, Maria Luiza M.
    Mattoso, Marta
    [J]. CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS, 2008, : 693 - +