Infra: SLO Aware Elastic Auto Scaling in the Cloud for Cost Reduction

被引:1
|
作者
Sidhanta, Subhajit [1 ]
Mukhopadhyay, Supratik [1 ]
机构
[1] Louisiana State Univ, Baton Rouge, LA 70803 USA
来源
2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016 | 2016年
关键词
D O I
10.1109/BigDataCongress.2016.25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enterprises often host applications and services on clusters of virtual machine instances provided by cloud service providers, like Amazon, Rackspace, Microsoft, etc. Users pay a cloud usage cost on the basis of the hourly usage [1] of virtual machine instances composing the cluster. A cluster composition refers to the number of virtual machine instances of each type (from a predefined list of types) comprising a cluster. We present Infra, a cloud provisioning framework that can predict an (epsilon, delta)-minimum cluster composition required to run a given application workload on a cloud under an SLO (i.e., Service Level Objective) deadline. This paper does not present a new approximation algorithm, instead we provide a tool that applies existing machine learning techniques to predict an (epsilon, delta)-minimum cluster composition. An (epsilon, delta)-minimum cluster composition specifies a cluster composition whose cost approximates that of the minimum cluster composition (i.e., the cluster composition that incurs the minimum cloud usage cost that must be incurred in executing a given application under an SLO deadline); the approximation bounds the error to a predefined threshold epsilon with a degree of confidence 100 * (1 - delta)%. The degree of confidence 100 * (1 - delta)% specifies that the probability of failure in achieving the error threshold epsilon for the above approximation is at most delta. For epsilon = 0.1 and delta = 0.02, we experimentally demonstrate that an (epsilon, delta)-minimum cluster composition predicted by Infra successfully approximates the minimum cluster composition, i.e., the accuracy of prediction of minimum cluster composition ranges from 93.1% to 97.99% (the error is bound by the error threshold of 0.1) with a 98% degree of confidence, since 100* (1 - delta) = 98%. Auto scaling refers to the process of automatically adding cloud instances to a cluster to adapt to an increase in application workload (increased request rate), and deleting instances from a cluster when there is a decrease in workload (reduced request rate). However, state-of-the-art auto scaling techniques have the following disadvantages: A) they require explicit policy definition for changing the cluster configuration and therefore lack the ability to automatically adapt a cluster with respect to changing workload, B) they do not compute the appropriate size of resources required, and therefore do not result in an "optimal" cluster composition. Infra provides an auto scaler that automatically adapts a cloud infrastructure to changing application workload, scaling the cluster up/down based on predictions from the Infra provisioning tool.
引用
收藏
页码:141 / 148
页数:8
相关论文
共 50 条
  • [21] Elastic Provisioning of Cloud Caches: A Cost-Aware TTL Approach
    Carra, Damiano
    Neglia, Giovanni
    Michiardi, Pietro
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2020, 28 (03) : 1283 - 1296
  • [22] Cost-Aware Multidimensional Auto-Scaling of Service- and Cloud-Based Dynamic Routing to Prevent System Overload
    Amiri, Amirali
    Zdun, Uwe
    van Hoorn, Andre
    Dustdar, Schahram
    2022 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2022), 2022, : 379 - 384
  • [23] Elastic Provisioning of Cloud Caches: a Cost-aware TTL Approach
    Carra, Damiano
    Neglia, Giovanni
    Michiardi, Pietro
    PROCEEDINGS OF THE 2018 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '18), 2018, : 526 - 526
  • [24] MArk: Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving
    Zhang, Chengliang
    Yu, Minchen
    Wang, Wei
    Yan, Feng
    PROCEEDINGS OF THE 2019 USENIX ANNUAL TECHNICAL CONFERENCE, 2019, : 1049 - 1062
  • [25] Cost-Aware and SLO-Fulfilling Software as a Service
    Niehoerster, Oliver
    Brinkmann, Andre
    Keller, Axel
    Kleineweber, Christoph
    Krueger, Jens
    Simon, Jens
    JOURNAL OF GRID COMPUTING, 2012, 10 (03) : 553 - 577
  • [26] REPO: A Microservices Elastic Management System for Cost Reduction in the Cloud
    MaQalhaes, Alex
    Rech, Luciana
    Moraes, Ricardo
    Vasques, Francisco
    2018 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2018, : 333 - 338
  • [27] Cost-efficient auto-scaling of container-based elastic processes
    Sheganaku, Gerta
    Schulte, Stefan
    Waibel, Philipp
    Weber, Ingo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 138 : 296 - 312
  • [28] Cost-Aware and SLO-Fulfilling Software as a Service
    Oliver Niehörster
    André Brinkmann
    Axel Keller
    Christoph Kleineweber
    Jens Krüger
    Jens Simon
    Journal of Grid Computing, 2012, 10 : 553 - 577
  • [29] Reinforcement Learning-Based Auto-scaling Algorithm for Elastic Cloud Workflow Service
    Lu, Jian-bin
    Yu, Yang
    Pan, Mao-lin
    PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT 2021, 2022, 13148 : 303 - 310
  • [30] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Jingqi Yang
    Chuanchang Liu
    Yanlei Shang
    Bo Cheng
    Zexiang Mao
    Chunhong Liu
    Lisha Niu
    Junliang Chen
    Information Systems Frontiers, 2014, 16 : 7 - 18