Evaluation of Data Enrichment Methods for Distributed Stream Processing Systems

被引:0
|
作者
Scheinert, Dominik [1 ]
Casares, Fabian [1 ]
Geldenhuys, Morgan K. [1 ]
Styp-Rekowski, Kevin [1 ]
Kao, Odej [1 ]
机构
[1] Tech Univ Berlin, Berlin, Germany
关键词
Distributed Stream Processing; Data Enrichment; Data Analysis; Resource Management; Cloud Computing;
D O I
10.1109/IC2E59103.2023.00030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stream processing has become a critical component in the architecture of modern applications. With the exponential growth of data generation from sources such as the Internet of Things, business intelligence, and telecommunications, real-time processing of unbounded data streams has become a necessity. DSP systems provide a solution to this challenge, offering high horizontal scalability, fault-tolerant execution, and the ability to process data streams from multiple sources in a single DSP job. Often enough though, data streams need to be enriched with extra information for correct processing, which introduces additional dependencies and potential bottlenecks. In this paper, we present an in-depth evaluation of data enrichment methods for DSP systems and identify the different use cases for stream processing in modern systems. Using a representative DSP system and conducting the evaluation in a realistic cloud environment, we found that outsourcing enrichment data to the DSP system can improve performance for specific use cases. However, this increased resource consumption highlights the need for stream processing solutions specifically designed for the performance-intensive workloads of cloud-based applications.
引用
收藏
页码:202 / 211
页数:10
相关论文
共 50 条
  • [21] Distributed Multilevel Secure Data Stream Processing
    Xie, Xing
    Ray, Indrakshi
    Ranasinghe, Waruna
    Gilbert, Philips A.
    Shashidhara, Pramod
    Yadav, Anoop
    2013 33RD IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW 2013), 2013, : 368 - 373
  • [22] A Prediction Framework for Distributed Data Stream Processing
    He ZhiYong
    Du RongHua
    PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 179 - 183
  • [23] Distributed resource allocation for stream data processing
    Tang, Ao
    Liu, Zhen
    Xia, Cathy
    Zhang, Li
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2006, 4208 : 91 - 100
  • [24] Accommodating Bursts in Distributed Stream Processing Systems
    Drougas, Yannis
    Kalogeraki, Vana
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 362 - 372
  • [25] Rethinking the design of distributed stream processing systems
    Zhou, Yongluan
    Aberer, Karl
    Salehi, Ali
    Tan, Kian-Lee
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 182 - +
  • [26] Distributed resource allocation in stream processing systems
    Xia, Cathy H.
    Broberg, James A.
    Liu, Zhen
    Zhang, Li
    Distributed Computing, Proceedings, 2006, 4167 : 489 - 504
  • [27] Processing Partially Ordered Requests in Distributed Stream Processing Systems
    Cai, Rijun
    Wu, Weigang
    Huang, Ning
    Wu, Lihui
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2016, 2016, 10048 : 211 - 219
  • [28] Quantitative Impact Evaluation of an Abstraction Layer for Data Stream Processing Systems
    Hesse, Guenter
    Matthies, Christoph
    Glass, Kelvin
    Huegle, Johannes
    Uflacker, Matthias
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 1381 - 1392
  • [29] TDAG: A Tunable Distributed Data Processing Model for Data Stream
    Tang, Jintao
    Lin, Xuelian
    Shen, Yang
    Wo, Tianyu
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 433 - 437
  • [30] Load Adaptive Distributed Stream Processing System for Explosive Stream Data
    Lee, Myungcheol
    Lee, Miyoung
    Hur, Sung Jin
    Kim, Ikkyun
    2015 17TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2015, : 753 - 757