Big Stream Processing Systems: An Experimental Evaluation

被引:9
|
作者
Shahverdi, Elkhan [1 ]
Awad, Ahmed [1 ]
Sakr, Sherif [1 ]
机构
[1] Univ Taru, Tartu, Estonia
关键词
D O I
10.1109/ICDEW.2019.00-35
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the world gets more instrumented and connected, we are witnessing a flood of digital data generated from various hardware (e.g., sensors) or software in the format of flowing streams of data. Real-time processing for such massive amounts of streaming data is a crucial requirement in several application domains including financial markets, surveillance systems, manufacturing, smart cities, and scalable monitoring infrastructure. In the last few years, several big stream processing engines have been introduced to tackle this challenge. In this article, we present an extensive experimental study of five popular systems in this domain, namely, Apache Storm, Apache Rink, Apache Spark, Kafka Streams and Hazelcast Jet. We report and analyze the performance characteristics of these systems. In addition, we report a set of insights and important lessons that we have learned from conducting our experiments.
引用
下载
收藏
页码:53 / 60
页数:8
相关论文
共 50 条
  • [1] Big SQL systems: an experimental evaluation
    Victor Aluko
    Sherif Sakr
    Cluster Computing, 2019, 22 : 1347 - 1377
  • [2] Big SQL systems: an experimental evaluation
    Aluko, Victor
    Sakr, Sherif
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (04): : 1347 - 1377
  • [3] Big data processing tools: An experimental performance evaluation
    Rodrigues, Mario
    Santos, Maribel Yasmina
    Bernardino, Jorge
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (02)
  • [4] Evaluation and Development Perspectives of Stream Data Processing Systems
    Gorawski, Marcin
    Gorawska, Anna
    Pasterak, Krzysztof
    COMPUTER NETWORKS, CN 2013, 2013, 370 : 300 - 311
  • [5] Dagstuhl Seminar on Big Stream Processing
    Sakr, Sherif
    Rabl, Tilmann
    Hirzel, Martin
    Carbone, Paris
    Strohbach, Martin
    SIGMOD RECORD, 2018, 47 (03) : 36 - 39
  • [6] Dagstuhl seminar on big stream processing
    Sakr S.
    Rabl T.
    Hirzel M.
    Carbone P.
    Strohbach M.
    2018, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (47) : 36 - 39
  • [7] An Efficient Approach for Storage of Big Data Streams in Distributed Stream Processing Systems
    Alshamrani, Sultan
    Waseem, Quadri
    Alharbi, Abdullah
    Alosaimi, Wael
    Turabieh, Hamza
    Alyami, Hashem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 91 - 98
  • [8] Evaluation of Data Enrichment Methods for Distributed Stream Processing Systems
    Scheinert, Dominik
    Casares, Fabian
    Geldenhuys, Morgan K.
    Styp-Rekowski, Kevin
    Kao, Odej
    2023 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING, IC2E, 2023, : 202 - 211
  • [9] An Evaluation of Data Stream Processing Systems for Data Driven Applications
    Samosir, Jonathan
    Indrawan-Santiago, Maria
    Haghighi, Pari Delir
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 439 - 449
  • [10] Stream Processing Languages in the Big Data Era
    Hirzel, Martin
    Baudart, Guillaume
    Bonifati, Angela
    Della Valle, Emanuele
    Sakr, Sherif
    Vlachou, Akrivi
    SIGMOD RECORD, 2018, 47 (02) : 29 - 40