Hierarchical Auto-scaling Policies for Data Stream Processing on Heterogeneous Resources

被引:1
|
作者
Russo, Gabriele Russo [1 ]
Cardellini, Valeria [1 ]
Lo Presti, Francesco [1 ]
机构
[1] Univ Roma Tor Vergata, Dept Civil Engn & Comp Sci Engn, Via Politecn 1, I-00133 Rome, Italy
关键词
Auto-scaling; Data Stream Processing; resource management; reinforcement learning; MODEL;
D O I
10.1145/3597435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data Stream Processing (DSP) applications analyze data flows in near real-time by means of operators, which process and transform incoming data. Operators handle high data rates running parallel replicas across multiple processors and hosts. To guarantee consistent performance without wasting resources in the face of variable workloads, auto-scaling techniques have been studied to adapt operator parallelism at run-time. However, most of the effort has been spent under the assumption of homogeneous computing infrastructures, neglecting the complexity of modern environments. We consider the problem of deciding both how many operator replicas should be executed and which types of computing nodes should be acquired. We devise heterogeneity-aware policies by means of a two-layered hierarchy of controllers. While application-level components steer the adaptation process for whole applications, aiming to guarantee user-specified requirements, lower-layer components control auto-scaling of single operators. We tackle the fundamental challenge of performance andworkload uncertainty, exploiting Bayesian optimization (BO) and reinforcement learning (RL) to devise policies. The evaluation shows that our approach is able to meet users' requirements in terms of response time and adaptation overhead, while minimizing the cost due to resource usage, outperforming state-of-the-art baselines. We also demonstrate how partial model information is exploited to reduce training time for learning-based controllers.
引用
收藏
页数:44
相关论文
共 50 条
  • [1] Auto-scaling Techniques for Elastic Data Stream Processing
    Heinze, Thomas
    Pappalardo, Valerio
    Jerzak, Zbigniew
    Fetzer, Christof
    2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2014, : 296 - 302
  • [2] MEAD: Model-Based Vertical Auto-Scaling for Data Stream Processing
    Russo, Gabriele Russo
    Cardellini, Valeria
    Casale, Giuliano
    Lo Presti, Francesco
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 314 - 323
  • [3] Auto-scaling Compute and Network Resources in a Data-center
    Biswas, Anshuman
    Nandy, Biswajit
    Majumdar, Shikharesh
    El-haraki, Ali
    2020 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2020, : 696 - 703
  • [4] Categorization of Intercloud users and auto-scaling of resources
    Tamanna Jena
    J. R. Mohanty
    Suresh Chandra Satapathy
    Evolutionary Intelligence, 2021, 14 : 369 - 379
  • [5] Categorization of Intercloud users and auto-scaling of resources
    Jena, Tamanna
    Mohanty, J. R.
    Satapathy, Suresh Chandra
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 369 - 379
  • [6] An Auto-Scaling Framework for Heterogeneous Hadoop Systems
    Bibal, J. V. Benifa
    Dejey, D.
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2017, 26 (04)
  • [7] Model-based Stream Processing Auto-scaling in Geo-Distributed Environments
    Arkian, HamidReza
    Pierre, Guillaume
    Tordsson, Johan
    Elmroth, Erik
    30TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2021), 2021,
  • [8] A Cost-Efficient Auto-Scaling Algorithm for Large-Scale Graph Processing in Cloud Environments with Heterogeneous Resources
    Heidari, Safiollah
    Buyya, Rajkumar
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (08) : 1729 - 1741
  • [9] An Auto-scaling Framework for Controlling Enterprise Resources on Clouds
    Biswas, Anshuman
    Majumdar, Shikharesh
    Nandy, Biswajit
    El-Haraki, Ali
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 971 - 980
  • [10] DRS: Auto-Scaling for Real-Time Stream Analytics
    Fu, Tom Z. J.
    Ding, Jianbing
    Ma, Richard T. B.
    Winslett, Marianne
    Yang, Yin
    Zhang, Zhenjie
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2017, 25 (06) : 3338 - 3352