AJIRA: a Lightweight Distributed Middleware for MapReduce and Stream Processing

被引:11
|
作者
Urbani, Jacopo [1 ]
Margara, Alessandro [1 ]
Jacobs, Ceriel [1 ]
Voulgaris, Spyros [1 ]
Bal, Henri [1 ]
机构
[1] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
关键词
D O I
10.1109/ICDCS.2014.62
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, MapReduce is the most popular programming model for large-scale data processing and this motivated the research community to improve its efficiency either with new extensions, algorithmic optimizations, or hardware. In this paper we address two main limitations of MapReduce: one relates to the model's limited expressiveness, which prevents the implementation of complex programs that require multiple steps or iterations. The other relates to the efficiency of its most popular implementations (e.g., Hadoop), which provide good resource utilization only for massive volumes of input, operating suboptimally for smaller or rapidly changing input. To address these limitations, we present AJIRA, a new middleware designed for efficient and generic data processing. At a conceptual level, AJIRA replaces the traditional map/reduce primitives by generic operators that can be dynamically allocated, allowing the execution of more complex batch and stream processing jobs. At a more technical level, AJIRA adopts a distributed, multi-threaded architecture that strives at minimizing overhead for non-critical functionality. These characteristics allow AJIRA to be used as a single programming model for both batch and stream processing. To this end, we evaluated its performance against Hadoop, Spark, Esper, and Storm, which are state of the art systems for both batch and stream processing. Our evaluation shows that AJIRA is competitive in a wide range of scenarios both in terms of processing time and scalability, making it an ideal choice where flexibility, extensibility, and the processing of both large and dynamic data with a single programming model are either desirable or even mandatory requirements.
引用
收藏
页码:545 / 554
页数:10
相关论文
共 50 条
  • [21] A Lightweight Indexing Approach for Efficient Batch Similarity Processing with MapReduce
    Phan T.N.
    Dang T.K.
    [J]. SN Computer Science, 2020, 1 (1)
  • [22] Distributed Stream Processing with DUP
    Bader, Kai Christian
    Eissler, Tilo
    Evans, Nathan
    GauthierDickey, Chris
    Grothoff, Christian
    Grothoff, Krista
    Keene, Jeff
    Meier, Harald
    Ritzdorf, Craig
    Rutherford, Matthew J.
    [J]. NETWORK AND PARALLEL COMPUTING, 2010, 6289 : 232 - +
  • [23] Reliable stream data processing for elastic distributed stream processing systems
    Xiaohui Wei
    Yuan Zhuang
    Hongliang Li
    Zhiliang Liu
    [J]. Cluster Computing, 2020, 23 : 555 - 574
  • [24] Reliable stream data processing for elastic distributed stream processing systems
    Wei, Xiaohui
    Zhuang, Yuan
    Li, Hongliang
    Liu, Zhiliang
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02): : 555 - 574
  • [25] Optimizing Cloud MapReduce for Processing Stream Data using Pipelining
    Karve, Rutvik
    Dahiphale, Devendra
    Chhajer, Amit
    [J]. UKSIM FIFTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2011), 2011, : 344 - 349
  • [26] A lightweight and distributed middleware to provide presence awareness in mobile ubiquitous systems
    Rodriguez-Covili, Juan
    Ochoa, Sergio F.
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2013, 78 (10) : 2009 - 2025
  • [27] A flexible, lightweight middleware supporting the development of distributed applications across platforms
    Baloian, Nelson
    Zurita, Gustavo
    Antunez, Pedro
    Baytelman, Felipe
    [J]. PROCEEDINGS OF THE 2007 11TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, VOLS 1 AND 2, 2007, : 92 - +
  • [28] Lightweight morphing support for evolving middleware data exchanges in distributed applications
    Agarwala, S
    Eisenhauer, G
    Schwan, K
    [J]. 25TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2005, : 697 - 706
  • [29] Efficient Processing Distributed Joins with Bloomfilter using MapReduce
    Zhang, Changchun
    Wu, Lei
    Li, Jing
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2013, 6 (03): : 43 - 57
  • [30] Distributed XML Twig Query Processing Using MapReduce
    Bi, Xin
    Wang, Guoren
    Zhao, Xiangguo
    Zhang, Zhen
    Chen, Shuang
    [J]. WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 203 - 214