Benchmarking Streaming Computation Engines: Storm, Flink and Spark Streaming

被引:187
|
作者
Chintapalli, Sanket [1 ]
Dagit, Derek [1 ]
Evans, Bobby [1 ]
Farivar, Reza [1 ]
Graves, Thomas [1 ]
Holderbaugh, Mark [1 ]
Liu, Zhuo [1 ]
Nusbaum, Kyle [1 ]
Patil, Kishorkumar [1 ]
Peng, Boyang Jerry [1 ]
Poulosky, Paul [1 ]
机构
[1] Yahoo Inc, Sunnyvale, CA 94089 USA
关键词
Streaming processing; Benchmark; Storm; Spark; Flink; Low Latency;
D O I
10.1109/IPDPSW.2016.138
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Streaming data processing has been gaining attention due to its application into a wide range of scenarios. To serve the booming demands of streaming data processing, many computation engines have been developed. However, there is still a lack of real-world benchmarks that would be helpful when choosing the most appropriate platform for serving real-time streaming needs. In order to address this problem, we developed a streaming benchmark for three representative computation engines: Flink, Storm and Spark Streaming. Instead of testing speed-of-light event processing, we construct a full data pipeline using Kafka and Redis in order to more closely mimic the real-world production scenarios. Based on our experiments, we provide a performance comparison of the three data engines in terms of 99th percentile latency and throughput for various configurations.
引用
收藏
页码:1789 / 1792
页数:4
相关论文
共 50 条
  • [21] Computation of Streaming Current in Oil Pipes
    Wang, Jufen
    Meng, Haolong
    IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 2009, 16 (02) : 299 - 304
  • [22] Streaming Complexity of Spanning Tree Computation
    Chang, Yi-Jun
    Farach-Colton, Martin
    Hsu, Tsan-Sheng
    Tsai, Meng-Tsung
    37TH INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2020), 2020, 154
  • [23] Acoustic streaming measurements in annular thermoacoustic engines
    Job, Stéphane
    Gusev, Vitalyi
    Lotton, Pierrick
    Bruneau, Michel
    Job, S. (stephane.job@univ-lemans.fr), 1892, Acoustical Society of America (113):
  • [24] Acoustic streaming measurements in annular thermoacoustic engines
    Job, S
    Gusev, V
    Lotton, P
    Bruneau, M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2003, 113 (04): : 1892 - 1899
  • [25] Twitter Heron: Towards Extensible Streaming Engines
    Fu, Maosong
    Agrawal, Ashvin
    Floratou, Avrilia
    Graham, Bill
    Jorgensen, Andrew
    Li, Runhang
    Lu, Neng
    Ramasamy, Karthik
    Rao, Sriram
    Wang, Cong
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 1165 - 1172
  • [26] Streaming Linear Regression on Spark MLlib and MOA
    Akgun, Baris
    Oguducu, Sule Gunduz
    PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 1244 - 1247
  • [27] An Enforcement of Real Time Scheduling in Spark Streaming
    Liao, Xinyi
    Gao, Zhiwei
    Ji, Weixing
    Wang, Yizhuo
    2015 SIXTH INTERNATIONAL GREEN COMPUTING CONFERENCE AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2015,
  • [28] Spark Streaming动态资源分配策略
    刘备
    谭新明
    曹文彬
    计算机应用, 2017, 37 (06) : 1574 - 1579
  • [29] StreamDM: Advanced Data Mining in Spark Streaming
    Bifet, Albert
    Maniu, Silviu
    Qian, Jianfeng
    Tian, Guangjian
    He, Cheng
    Fan, Wei
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1608 - 1611
  • [30] Adaptive Scheduling of Parallel Jobs in Spark Streaming
    Cheng, Dazhao
    Chen, Yuan
    Zhou, Xiaobo
    Gmach, Daniel
    Milojicic, Dejan
    IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,