SLA-Based Adaptation Schemes in Distributed Stream Processing Engines

被引：3

作者：

Hanif, Muhammad ^{[1
]}

Kim, Eunsam ^{[2
]}

Helal, Sumi ^{[3
]}

Lee, Choonhwa ^{[1
]}

机构：

[1] Hanyang Univ, Div Comp Sci & Engn, Seoul 133791, South Korea

[2] Hongik Univ, Dept Comp Engn, Seoul 121791, South Korea

[3] Univ Lancaster, Sch Comp & Commun, Lancaster, England

来源：

APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 06期

基金：

新加坡国家研究基金会;

关键词：

big data; distributed computing; modern stream processing engine; SLA; watermarking; cloud computing; MODEL;

D O I：

10.3390/app9061045

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

With the upswing in the volume of data, information online, and magnanimous cloud applications, big data analytics becomes mainstream in the research communities in the industry as well as in the scholarly world. This prompted the emergence and development of real-time distributed stream processing frameworks, such as Flink, Storm, Spark, and Samza. These frameworks endorse complex queries on streaming data to be distributed across multiple worker nodes in a cluster. Few of these stream processing frameworks provides fundamental support for controlling the latency and throughput of the system as well as the correctness of the results. However, none has the ability to handle them on the fly at runtime. We present a well-informed and efficient adaptive watermarking and dynamic buffering timeout mechanism for the distributed streaming frameworks. It is designed to increase the overall throughput of the system by making the watermarks adaptive towards the stream of incoming workload, and scale the buffering timeout dynamically for each task tracker on the fly while maintaining the Service Level Agreement (SLA)-based end-to-end latency of the system. This work focuses on tuning the parameters of the system (such as window correctness, buffering timeout, and so on) based on the prediction of incoming workloads and assesses whether a given workload will breach an SLA using output metrics including latency, throughput, and correctness of both intermediate and final results. We used Apache Flink as our testbed distributed processing engine for this work. However, the proposed mechanism can be applied to other streaming frameworks as well. Our results on the testbed model indicate that the proposed system outperforms the status quo of stream processing. With the inclusion of learning models like naive Bayes, multilayer perceptron (MLP), and sequential minimal optimization (SMO)., the system shows more progress in terms of keeping the SLA intact as well as quality of service (QoS).

引用

页数：21

共 50 条

[31] Mapping heavy communication SLA-based workflows onto Grid resources with parallel processing technology
Quan, Dang Minh
Altmann, Joern
Yang, Laurence T.
HPCC 2008: 10TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2008, : 274 - +
[32] Mapping light communication SLA-based workflows onto Grid resources with parallel processing technology\
Quan, Dang Minh
Altmann, Joern
Yang, Laurence T.
PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, 2008, : 191 - +
[33] Towards SLA-Based optimal workload distribution in SANs
Gencay, Eray
Sinz, Carsten
Kuechlin, Wolfgang
2008 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, VOLS 1 AND 2, 2008, : 755 - 758
[34] A Brief Review on SLA-Based Approaches to the Teaching of Preposition “on”
王小平
海外英语, 2010, (06) : 283 - 285+288
[35] Theodolite: Scalability Benchmarking of Distributed Stream Processing Engines in Microservice Architectures
Henning, Soeren
Hasselbring, Wilhelm
BIG DATA RESEARCH, 2021, 25
[36] Risk Assessment in SLA-Based WDM Backbone Networks
Xia, Ming
Choi, Joon Ho
Wang, Ting
OFC: 2009 CONFERENCE ON OPTICAL FIBER COMMUNICATION, VOLS 1-5, 2009, : 1680 - +
[37] A framework for dynamic SLA-based QoS control for UMTS
Chakravorty, R
Pratt, I
Crowcroft, J
D'Arienzo, M
IEEE WIRELESS COMMUNICATIONS, 2003, 10 (05) : 30 - 37
[38] Dynamic Optimization of SLA-Based Services Scaling Rules
Antonescu, Alexandru-Florian
Oprescu, Ana-Maria
Demchenko, Yuri
de Laat, Cees
Braun, Torsten
2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), VOL 1, 2013, : 282 - 289
[39] Query-Centric Failure Recovery for Distributed Stream Processing Engines
Su, Li
Zhou, Yongluan
2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1276 - 1279
[40] Research on the SLA-based service modeling and parameter mapping
Chen, Gang
Zhou, Wen-An
Song, Jun-De
Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2007, 30 (04): : 28 - 32

← 1 2 3 4 5 →