Infra: SLO Aware Elastic Auto Scaling in the Cloud for Cost Reduction

被引:1
|
作者
Sidhanta, Subhajit [1 ]
Mukhopadhyay, Supratik [1 ]
机构
[1] Louisiana State Univ, Baton Rouge, LA 70803 USA
来源
2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016 | 2016年
关键词
D O I
10.1109/BigDataCongress.2016.25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enterprises often host applications and services on clusters of virtual machine instances provided by cloud service providers, like Amazon, Rackspace, Microsoft, etc. Users pay a cloud usage cost on the basis of the hourly usage [1] of virtual machine instances composing the cluster. A cluster composition refers to the number of virtual machine instances of each type (from a predefined list of types) comprising a cluster. We present Infra, a cloud provisioning framework that can predict an (epsilon, delta)-minimum cluster composition required to run a given application workload on a cloud under an SLO (i.e., Service Level Objective) deadline. This paper does not present a new approximation algorithm, instead we provide a tool that applies existing machine learning techniques to predict an (epsilon, delta)-minimum cluster composition. An (epsilon, delta)-minimum cluster composition specifies a cluster composition whose cost approximates that of the minimum cluster composition (i.e., the cluster composition that incurs the minimum cloud usage cost that must be incurred in executing a given application under an SLO deadline); the approximation bounds the error to a predefined threshold epsilon with a degree of confidence 100 * (1 - delta)%. The degree of confidence 100 * (1 - delta)% specifies that the probability of failure in achieving the error threshold epsilon for the above approximation is at most delta. For epsilon = 0.1 and delta = 0.02, we experimentally demonstrate that an (epsilon, delta)-minimum cluster composition predicted by Infra successfully approximates the minimum cluster composition, i.e., the accuracy of prediction of minimum cluster composition ranges from 93.1% to 97.99% (the error is bound by the error threshold of 0.1) with a 98% degree of confidence, since 100* (1 - delta) = 98%. Auto scaling refers to the process of automatically adding cloud instances to a cluster to adapt to an increase in application workload (increased request rate), and deleting instances from a cluster when there is a decrease in workload (reduced request rate). However, state-of-the-art auto scaling techniques have the following disadvantages: A) they require explicit policy definition for changing the cluster configuration and therefore lack the ability to automatically adapt a cluster with respect to changing workload, B) they do not compute the appropriate size of resources required, and therefore do not result in an "optimal" cluster composition. Infra provides an auto scaler that automatically adapts a cloud infrastructure to changing application workload, scaling the cluster up/down based on predictions from the Infra provisioning tool.
引用
收藏
页码:141 / 148
页数:8
相关论文
共 50 条
  • [41] Heterogeneity-aware adaptive auto-scaling heuristic for improved QoS and resource usage in cloud environments
    Jyoti Sahni
    Deo Prakash Vidyarthi
    Computing, 2017, 99 : 351 - 381
  • [42] HAVEN: Holistic Load Balancing and Auto Scaling in the Cloud
    Poddar, Rishabh
    Vishnoi, Anilkumar
    Mann, Vijay
    2015 7TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS IEEE COMSNETS 2015, 2015,
  • [43] Cost-Aware Cloud Provisioning
    Chard, Ryan
    Chard, Kyle
    Bubendorfer, Kris
    Lacinski, Lukasz
    Madduri, Ravi
    Foster, Ian
    2015 IEEE 11TH INTERNATIONAL CONFERENCE ON E-SCIENCE, 2015, : 136 - 144
  • [44] Improvements of the Reactive Auto Scaling Method for Cloud Platform
    Augustyn, Dariusz Rafal
    COMPUTER NETWORKS (CN 2017), 2017, 718 : 422 - 431
  • [45] DDoS Attack on Cloud Auto-scaling Mechanisms
    Bremler-Barr, Anat
    Brosh, Eli
    Sides, Mor
    IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,
  • [46] An Auto-scaling Framework for Containerized Elastic Applications
    Tian Ye
    Xue Guangtao
    Qian Shiyou
    Li Minglu
    2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM), 2017, : 422 - 430
  • [47] Considering an Elastic Scaling Model for Cloud Security
    MacDermott, Aine
    Shi, Qi
    Merabti, Madjid
    Kifiyat, Kashif
    2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 150 - 155
  • [48] Cost Aware Resource Sizing and Scaling of Microservices
    Agarwal, Preyashi
    Lakshmi, J.
    2019 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT 2019), 2019, : 66 - 74
  • [49] Scaling the Performance and Cost While Scaling the Load and Resources in the Cloud
    Simjanoska, Monika
    Ristov, Sasko
    Velkoski, Goran
    Gusev, Marjan
    2013 36TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2013, : 151 - 156
  • [50] An auto-scaling mechanism for cloud-based multimedia storage systems: a fuzzy-based elastic controller
    Mostafa Ghobaei-Arani
    Maryam Rezaei
    Alireza Souri
    Multimedia Tools and Applications, 2022, 81 : 34501 - 34523