Infra: SLO Aware Elastic Auto Scaling in the Cloud for Cost Reduction

被引:1
|
作者
Sidhanta, Subhajit [1 ]
Mukhopadhyay, Supratik [1 ]
机构
[1] Louisiana State Univ, Baton Rouge, LA 70803 USA
来源
2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016 | 2016年
关键词
D O I
10.1109/BigDataCongress.2016.25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enterprises often host applications and services on clusters of virtual machine instances provided by cloud service providers, like Amazon, Rackspace, Microsoft, etc. Users pay a cloud usage cost on the basis of the hourly usage [1] of virtual machine instances composing the cluster. A cluster composition refers to the number of virtual machine instances of each type (from a predefined list of types) comprising a cluster. We present Infra, a cloud provisioning framework that can predict an (epsilon, delta)-minimum cluster composition required to run a given application workload on a cloud under an SLO (i.e., Service Level Objective) deadline. This paper does not present a new approximation algorithm, instead we provide a tool that applies existing machine learning techniques to predict an (epsilon, delta)-minimum cluster composition. An (epsilon, delta)-minimum cluster composition specifies a cluster composition whose cost approximates that of the minimum cluster composition (i.e., the cluster composition that incurs the minimum cloud usage cost that must be incurred in executing a given application under an SLO deadline); the approximation bounds the error to a predefined threshold epsilon with a degree of confidence 100 * (1 - delta)%. The degree of confidence 100 * (1 - delta)% specifies that the probability of failure in achieving the error threshold epsilon for the above approximation is at most delta. For epsilon = 0.1 and delta = 0.02, we experimentally demonstrate that an (epsilon, delta)-minimum cluster composition predicted by Infra successfully approximates the minimum cluster composition, i.e., the accuracy of prediction of minimum cluster composition ranges from 93.1% to 97.99% (the error is bound by the error threshold of 0.1) with a 98% degree of confidence, since 100* (1 - delta) = 98%. Auto scaling refers to the process of automatically adding cloud instances to a cluster to adapt to an increase in application workload (increased request rate), and deleting instances from a cluster when there is a decrease in workload (reduced request rate). However, state-of-the-art auto scaling techniques have the following disadvantages: A) they require explicit policy definition for changing the cluster configuration and therefore lack the ability to automatically adapt a cluster with respect to changing workload, B) they do not compute the appropriate size of resources required, and therefore do not result in an "optimal" cluster composition. Infra provides an auto scaler that automatically adapts a cloud infrastructure to changing application workload, scaling the cluster up/down based on predictions from the Infra provisioning tool.
引用
收藏
页码:141 / 148
页数:8
相关论文
共 50 条
  • [31] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Yang, Jingqi
    Liu, Chuanchang
    Shang, Yanlei
    Cheng, Bo
    Mao, Zexiang
    Liu, Chunhong
    Niu, Lisha
    Chen, Junliang
    INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 7 - 18
  • [32] Cost-aware Automatic Virtual Machine Scaling in Fine Granularity for Cloud Applications
    Zhao, He
    Peng, Chenglei
    Yu, Yao
    Zhou, Yu
    Wang, Ziqiang
    Du, Sidan
    2013 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2013, : 109 - 116
  • [33] Cost-aware automatic scaling and workload-aware replica management for edge-cloud environment
    Li, Chunlin
    Liu, Jun
    Lu, Bo
    Luo, Youlong
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 180
  • [34] mSIRM: Cost-Efficient and SLO-aware ML Load Balancing on Fog and Multi-Cloud Network
    Phalak, Chetan
    Chahal, Dheeraj
    Ramesh, Manju
    Singhal, Rekha
    PROCEEDINGS OF THE 13TH WORKSHOP ON AI AND SCIENTIFIC COMPUTING AT SCALE USING FLEXIBLE COMPUTING INFRASTRUCTURES, FLEXSCIENCE 2023, 2023, : 19 - 26
  • [35] Cloud Computing Technology as an Auto Filtration System for Cost Reduction While Roaming
    Biswas, S.
    Mukherjee, A.
    Chowdhury, M. Roy
    Bhattacharya, A. B.
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, (FICTA 2016), VOL 2, 2017, 516 : 437 - 445
  • [36] A cost-driven online auto-scaling algorithm for web applications in cloud environments
    Si, Wen
    Pan, Li
    Liu, Shijun
    KNOWLEDGE-BASED SYSTEMS, 2022, 244
  • [37] Performance-Cost Trade-Off in Auto-Scaling Mechanisms for Cloud Computing
    Fe, Iure
    Matos, Rubens
    Dantas, Jamilson
    Melo, Carlos
    Nguyen, Tuan Anh
    Min, Dugki
    Choi, Eunmi
    Silva, Francisco Airton
    Maciel, Paulo Romero Martins
    SENSORS, 2022, 22 (03)
  • [38] Heterogeneity-aware adaptive auto-scaling heuristic for improved QoS and resource usage in cloud environments
    Sahni, Jyoti
    Vidyarthi, Deo Prakash
    COMPUTING, 2017, 99 (04) : 351 - 381
  • [39] Metaheuristic based auto-scaling for microservices in cloud environment: a new container-aware application scheduling
    Sarma, Subramonian Krishna
    INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS, 2023, 19 (01) : 74 - 96
  • [40] Auto Scaling for Various Patterns of Workflow within Deadline time and Energy aware VM allocation in Cloud Environment
    Santhosh, Sruthy
    Binu, A.
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON DATA SCIENCE & ENGINEERING (ICDSE), 2016, : 41 - 45