Evaluation of Load Prediction Techniques for Distributed Stream Processing

被引:4
|
作者
Gontarska, Kordian [1 ,2 ]
Geldenhuys, Morgan [2 ]
Scheinert, Dominik [2 ]
Wiesner, Philipp [2 ]
Polze, Andreas [1 ]
Thamsen, Lauritz [2 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Potsdam, Germany
[2] Tech Univ Berlin, Berlin, Germany
关键词
Distributed Stream Processing; Resource Management and Optimization; Load Prediction; Time Series Forecasting; Machine Learning;
D O I
10.1109/IC2E52221.2021.00023
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed Stream Processing (DSP) systems enable processing large streams of continuous data to produce results in near to real time. They are an essential part of many data-intensive applications and analytics platforms. The rate at which events arrive at DSP systems can vary considerably over time, which may be due to trends, cyclic, and seasonal patterns within the data streams. A priori knowledge of incoming workloads enables proactive approaches to resource management and optimization tasks such as dynamic scaling, live migration of resources, and the tuning of configuration parameters during run-times, thus leading to a potentially better Quality of Service. In this paper we conduct a comprehensive evaluation of different load prediction techniques for DSP jobs. We identify three use-cases and formulate requirements for making load predictions specific to DSP jobs. Automatically optimized classical and Deep Learning methods are being evaluated on nine different datasets from typical DSP domains, i.e. the IoT, Web 2.0, and cluster monitoring. We compare model performance with respect to overall accuracy and training duration. Our results show that the Deep Learning methods provide the most accurate load predictions for the majority of the evaluated datasets.
引用
收藏
页码:91 / 98
页数:8
相关论文
共 50 条
  • [1] Load balancing techniques for distributed stream processing applications in overlay environments
    Drougas, Yannis
    Repantis, Thomas
    Kalogeraki, Vana
    NINTH IEEE INTERNATIONAL SYMPOSIUM ON OBJECT AND COMPONENT-ORIENTED REAL-TIME DISTRIBUTED COMPUTING, PROCEEDINGS, 2006, : 33 - 40
  • [2] Load distribution for distributed stream processing
    Xing, Y
    CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2004 WORKSHOPS, PROCEEDINGS, 2004, 3268 : 112 - 120
  • [3] Load distribution for distributed stream processing
    Xing, Ying
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3268 : 112 - 120
  • [4] A Prediction Framework for Distributed Data Stream Processing
    He ZhiYong
    Du RongHua
    PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 179 - 183
  • [5] Load Adaptive Distributed Stream Processing System for Explosive Stream Data
    Lee, Myungcheol
    Lee, Miyoung
    Hur, Sung Jin
    Kim, Ikkyun
    2015 17TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2015, : 753 - 757
  • [6] Optimization of Load Adaptive Distributed Stream Processing Services
    Wu, Xing
    Liu, Yan
    2014 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2014), 2014, : 504 - 511
  • [7] Load shedding and distributed resource control of stream processing networks
    Feng, Hanhua
    Liu, Zhen
    Xia, Cathy H.
    Zhang, Li
    PERFORMANCE EVALUATION, 2007, 64 (9-12) : 1102 - 1120
  • [8] A Dynamic Load Balancing Technique of Distributed Stream Processing System
    Ouyang Lin
    Guo Qing-ping
    2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING SYMPOSIA, VOLS 1-5, PROCEEDINGS, 2008, : 555 - +
  • [9] Enabling A Load Adaptive Distributed Stream Processing Platform on Synchronized Clusters
    Wu, Xing
    Liu, Yan
    2014 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2014, : 627 - 630
  • [10] Adaptive component composition and load balancing for distributed stream processing applications
    Repantis, Thomas
    Drougas, Yannis
    Kalogeraki, Vana
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2009, 2 (01) : 60 - 74