Elastic Scaling of Stateful Operators Over Fluctuating Data Streams

被引:0
|
作者
Wu, Minghui [1 ]
Sun, Dawei [1 ]
Gao, Shang [2 ]
Li, Keqin [3 ]
Buyya, Rajkumar [4 ]
机构
[1] China Univ Geosci, Sch Informat Engn, Beijing 100083, Peoples R China
[2] Deakin Univ, Sch Informat Technol, Waurn Ponds, Vic 3216, Australia
[3] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
[4] Univ Melbourne, Sch Comp & Informat Syst, Cloud Comp & Distributed Syst CLOUDS Lab, Melbourne, Vic 3010, Australia
基金
中国国家自然科学基金;
关键词
Streams; Parallel processing; Topology; Resource management; Data models; Computational modeling; System performance; Distributed stream computing; operator parallelism; resource scaling; state management; stateful operator;
D O I
10.1109/TSC.2024.3436596
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Elastic scaling of parallel operators has emerged as a powerful approach to reduce response time in stream applications with fluctuating inputs. Many state-of-the-art works focus on stateless operators and change the operator parallelism from one aspect. They often lack efficient management of operator states and overlook the costs associated with resource over-provisioning. To overcome these limitations, we introduce Es-Stream for elastic scaling of stateful operators over fluctuating data streams, which includes: 1) We observe that under-provisioning of operator parallelism leads to data pile-up, resulting in longer system latency, while over-provisioning of operator parallelism causes idle instances and additional resource consumption. 2) The Es-Stream system scales in two dimensions: the parallelism of operators and the number of resources. It dynamically adjusts operators to an optimal parallelism while scaling the resources used by the stream application. 3) When the parallelism of stateful operators changes, upstream operators backup downstream operators' state and cache the emitted data tuples at dynamic time intervals, ensuring the operator parallelism is adjusted in a low-overhead way. 4) Experimental results demonstrate that Es-Stream provides promising performance improvements, reducing the maximum system latency by 3x and saving the maximum state recovery time by 2x, compared to existing state-of-the-art works.
引用
收藏
页码:3555 / 3568
页数:14
相关论文
共 50 条
  • [1] Stateful Adaptive Streams with Approximate Computing and Elastic Scaling
    Francisco, Joao
    Coimbra, Miguel E.
    Neto, Pedro F.
    Freitag, Felix
    Veiga, Luis
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 174 - 183
  • [2] Elastic Scaling of Stateful Network Functions
    Woo, Shinae
    Sherry, Justine
    Han, Sangjin
    Moon, Sue
    Ratnasamy, Sylvia
    Shenker, Scott
    PROCEEDINGS OF THE 15TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI'18), 2018, : 299 - 312
  • [3] Parallel Patterns for Window-Based Stateful Operators on Data Streams: An Algorithmic Skeleton Approach
    De Matteis, Tiziano
    Mencagli, Gabriele
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (02) : 382 - 401
  • [4] Parallel Patterns for Window-Based Stateful Operators on Data Streams: An Algorithmic Skeleton Approach
    Tiziano De Matteis
    Gabriele Mencagli
    International Journal of Parallel Programming, 2017, 45 : 382 - 401
  • [5] Elastic Scaling of Data Parallel Operators in Stream Processing
    Schneider, Scott
    Andrade, Henrique
    Gedik, Bugra
    Biem, Alain
    Wu, Kun-Lung
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 603 - +
  • [6] Frequency operators for condensative queries over data streams
    Ma, LS
    Nutt, W
    ICEBE 2005: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, PROCEEDINGS, 2005, : 518 - 525
  • [7] Elastic Scaling for Distributed Latency-sensitive Data Stream Operators
    De Matteis, Tiziano
    Mencagli, Gabriele
    2017 25TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2017), 2017, : 61 - 68
  • [8] Semantics of data streams and operators
    Maier, D
    Li, J
    Tucker, P
    Tufte, K
    Papadimos, V
    DATABASE THEORY - ICDT 2005, PROCEEDINGS, 2005, 3363 : 37 - 52
  • [9] Efficient State Management for Scaling Out Stateful Operators in Stream Processing Systems
    Mudassar, Muhammad
    Zhai, Yanlong
    Liao, Lejian
    BIG DATA, 2019, 7 (03) : 192 - 206
  • [10] A Round-Robin Bloom Filter for Stateful Control over Event Streams
    Szabo, Robert
    2013 IEEE 4TH INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2013, : 497 - 501