Performance Interference-Aware Vertical Elasticity for Cloud-hosted Latency-Sensitive Applications

被引:30
|
作者
Shekhar, Shashank [1 ]
Abdel-Aziz, Hamzah [1 ]
Bhattacharjee, Anirban [1 ]
Gokhale, Aniruddha [1 ]
Koutsoukos, Xenofon [1 ]
机构
[1] Vanderbilt Univ, Dept EECS, 221 Kirkland Hall, Nashville, TN 37235 USA
基金
美国国家科学基金会;
关键词
Cloud computing; Data center; Multi-tenancy; Workload variability; Latency sensitive; Performance interference; Vertical elasticity; Virtualization; Linux containers; Docker; Online predictive models; Gaussian processes;
D O I
10.1109/CLOUD.2018.00018
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Elastic auto-scaling in cloud platforms has primarily used horizontal scaling by assigning application instances to distributed resources. Owing to rapid advances in hardware, cloud providers are now seeking vertical elasticity before attempting horizontal scaling to provide elastic auto-scaling for applications. Vertical elasticity solutions must, however, be cognizant of performance interference that stems from multi-tenant collocated applications since interference significantly impacts application quality-of-service (QoS) properties, such as latency. The problem becomes more pronounced for latency-sensitive applications that demand strict QoS properties. Further exacerbating the problem are variations in workloads, which make it hard to determine the right kinds of timely resource adaptations for latency-sensitive applications. To address these challenges and overcome limitations in existing offline approaches, we present an online, data-driven approach which utilizes Gaussian Processes-based machine learning techniques to build runtime predictive models of the performance of the system under different levels of interference. The predictive online models are then used in dynamically adapting to the workload variability by vertically auto-scaling co-located applications such that performance interference is minimized and QoS properties of latency-sensitive applications are met.
引用
收藏
页码:82 / 89
页数:8
相关论文
共 29 条
  • [1] Understanding Performance Interference Benchmarking and Application Profiling Techniques for Cloud-hosted Latency-Sensitive Applications
    Shekhar, Shashank
    Barve, Yogesh
    Gokhale, Aniruddha
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC' 17), 2017, : 187 - 188
  • [2] IADA: A dynamic interference-aware cloud scheduling architecture for latency-sensitive workloads
    Meyer, Vinicius
    da Silva, Matheus L.
    Kirchoff, Dionatra F.
    De Rose, Cesar A. F.
    JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 194
  • [3] Network performance isolation for latency-sensitive cloud applications
    Cheng, Luwei
    Wang, Cho-Li
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2013, 29 (04): : 1073 - 1084
  • [4] Cloud Support for Latency-Sensitive Telephony Applications
    Kim, Jong Yul
    Schulzrinne, Henning
    2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), VOL 1, 2013, : 421 - 426
  • [5] Towards Resource and Contract Heterogeneity Aware Rescaling for Cloud-hosted Applications
    Chhetri, Mohan Baruwal
    Quoc Bao Vo
    Kowalczyk, Ryszard
    Nepal, Surya
    2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 153 - 162
  • [6] INDICES: Exploiting Edge Resources for Performance-aware Cloud-hosted Services
    Shekhar, Shashank
    Chhokra, Ajay Dev
    Bhattacharjee, Anirban
    Aupy, Guillaume
    Gokhale, Aniruddha
    2017 IEEE 1ST INTERNATIONAL CONFERENCE ON FOG AND EDGE COMPUTING (ICFEC), 2017, : 75 - 80
  • [7] Performance Monitoring and Root Cause Analysis for Cloud-hosted Web Applications
    Jayathilaka, Hiranya
    Krintz, Chandra
    Wolski, Rich
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 469 - 478
  • [8] Interference-aware Component Scheduling for Reducing Tail Latency in Cloud Interactive Services
    Han, Rui
    Wang, Junwei
    Huang, Siguang
    Shao, Chenrong
    Zhan, Shulin
    Zhan, Jianfeng
    Luis Vazquez-Poletti, Jose
    2015 IEEE 35TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 2015, : 744 - 745
  • [9] UPSARA: A Model-driven Approach for Performance Analysis of Cloud-hosted Applications
    Barve, Yogesh D.
    Shekhar, Shashank
    Khare, Shweta
    Bhattacharjee, Anirban
    Gokhale, Aniruddha
    2018 IEEE/ACM 11TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2018, : 1 - 10
  • [10] Nomad: An Efficient Consensus Approach for Latency-Sensitive Edge-Cloud Applications
    Hao, Zijiang
    Yi, Shanhe
    Li, Qun
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, : 2539 - 2547