Infra: SLO Aware Elastic Auto Scaling in the Cloud for Cost Reduction

被引:1
|
作者
Sidhanta, Subhajit [1 ]
Mukhopadhyay, Supratik [1 ]
机构
[1] Louisiana State Univ, Baton Rouge, LA 70803 USA
关键词
D O I
10.1109/BigDataCongress.2016.25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enterprises often host applications and services on clusters of virtual machine instances provided by cloud service providers, like Amazon, Rackspace, Microsoft, etc. Users pay a cloud usage cost on the basis of the hourly usage [1] of virtual machine instances composing the cluster. A cluster composition refers to the number of virtual machine instances of each type (from a predefined list of types) comprising a cluster. We present Infra, a cloud provisioning framework that can predict an (epsilon, delta)-minimum cluster composition required to run a given application workload on a cloud under an SLO (i.e., Service Level Objective) deadline. This paper does not present a new approximation algorithm, instead we provide a tool that applies existing machine learning techniques to predict an (epsilon, delta)-minimum cluster composition. An (epsilon, delta)-minimum cluster composition specifies a cluster composition whose cost approximates that of the minimum cluster composition (i.e., the cluster composition that incurs the minimum cloud usage cost that must be incurred in executing a given application under an SLO deadline); the approximation bounds the error to a predefined threshold epsilon with a degree of confidence 100 * (1 - delta)%. The degree of confidence 100 * (1 - delta)% specifies that the probability of failure in achieving the error threshold epsilon for the above approximation is at most delta. For epsilon = 0.1 and delta = 0.02, we experimentally demonstrate that an (epsilon, delta)-minimum cluster composition predicted by Infra successfully approximates the minimum cluster composition, i.e., the accuracy of prediction of minimum cluster composition ranges from 93.1% to 97.99% (the error is bound by the error threshold of 0.1) with a 98% degree of confidence, since 100* (1 - delta) = 98%. Auto scaling refers to the process of automatically adding cloud instances to a cluster to adapt to an increase in application workload (increased request rate), and deleting instances from a cluster when there is a decrease in workload (reduced request rate). However, state-of-the-art auto scaling techniques have the following disadvantages: A) they require explicit policy definition for changing the cluster configuration and therefore lack the ability to automatically adapt a cluster with respect to changing workload, B) they do not compute the appropriate size of resources required, and therefore do not result in an "optimal" cluster composition. Infra provides an auto scaler that automatically adapts a cloud infrastructure to changing application workload, scaling the cluster up/down based on predictions from the Infra provisioning tool.
引用
收藏
页码:141 / 148
页数:8
相关论文
共 50 条
  • [1] SLO-Power: SLO and Power-aware Elastic Scaling for Web Services
    Savasci, Mehmet
    Souza, Abel
    Li Wu
    Irwin, David
    Ali-Eldin, Ahmed
    Shenoy, Prashant
    2024 IEEE 24TH INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID 2024, 2024, : 136 - 147
  • [2] Interference aware prediction mechanism for auto scaling in cloud
    Babu, K. R. Remesh
    Samuel, Philip
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 69 : 351 - 363
  • [3] Elastic Auto-Scaling Architecture in Telco Cloud
    Cao, Dang Sao
    Nguyen, Dinh Tam
    Nguyen, Xuan Chinh
    Tran, Van Thuyet
    Nguyen, Hai Binh
    Lang, Khac Thuan
    Nguyen, Van Tuan
    Dao, Ngoc Lam
    Pham, Thanh Tu
    Cao, Ngoc Son
    Chu, Dinh Hung
    Nguyen, Phi Hung
    Pham, Cong Dan
    Nguyen, Duc Hai
    2023 25TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, ICACT, 2023, : 401 - 406
  • [4] Spock: Exploiting Serverless Functions for SLO and Cost Aware Resource Procurement in Public Cloud
    Gunasekaran, Jashwant Raj
    Thinakaran, Prashanth
    Kandemir, Mahmut Taylan
    Urgaonkar, Bhuvan
    Kesidis, George
    Das, Chita R.
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 199 - 208
  • [5] A cost-AWARE approach based ON learning automata FOR resource auto-scaling IN cloud computing environment
    Mogoui, Khosro
    Arani, Mostafa Ghobaei
    International Journal of Hybrid Information Technology, 2015, 8 (07): : 389 - 398
  • [6] SIRM: Cost efficient and SLO aware ML prediction on Fog-Cloud Network
    Phalak, Chetan
    Chahal, Dheeraj
    Singhal, Rekha
    2023 15TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS, COMSNETS, 2023,
  • [7] Cost-Availability Aware Scaling: Towards Optimal Scaling of Cloud Services
    Bento, Andre
    Araujo, Filipe
    Barbosa, Raul
    JOURNAL OF GRID COMPUTING, 2023, 21 (04)
  • [8] Cost-Availability Aware Scaling: Towards Optimal Scaling of Cloud Services
    Andre Bento
    Filipe Araujo
    Raul Barbosa
    Journal of Grid Computing, 2023, 21
  • [9] ALVEC: Auto-scaling by Lotka Volterra elastic cloud: A QoS aware non linear dynamical allocation model
    Goswami , Bidisha
    Sarkar, Jyotirmoy
    Saha, Snehanshu
    Kar, Saibal
    Sarkar, Poulami
    SIMULATION MODELLING PRACTICE AND THEORY, 2019, 93 : 262 - 292
  • [10] Petri net model for resource scheduling with auto scaling in elastic cloud
    Remesh Babu K.R.
    Samuel P.
    Remesh Babu, K.R. (remeshbabu@yahoo.com), 1600, Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (22): : 462 - 477