Auto-Scaling Containerized Applications in Geo-Distributed Clouds

被引:3
|
作者
Shi, Tao [1 ]
Ma, Hui [2 ,3 ]
Chen, Gang [2 ,3 ]
Hartmann, Sven [4 ]
机构
[1] Qingdao Agr Univ, Sci & Informat Coll, Qingdao 266109, Shandong, Peoples R China
[2] Victoria Univ Wellington, Ctr Data Sci & Artificial Intelligence, Wellington 6011, New Zealand
[3] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6012, New Zealand
[4] Tech Univ Clausthal, Dept Informat, D-38678 Clausthal Zellerfeld, Germany
关键词
Auto-scaling; containerized application; geo-distributed clouds; workload management; safe reinforcement learning; time series analysis; SERVICE DEPLOYMENT; AWARE; MICROSERVICE; HYBRID; MODEL;
D O I
10.1109/TSC.2023.3317262
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a lightweight and flexible infrastructure solution, containers have increasingly been used for application deployment on a global scale. By rapidly scaling containers at different locations, the deployed applications can handle dynamic workloads from the worldwide user community. Existing studies usually focus on the (dynamic) container scaling within a single data center or the (static) container deployment across geo-distributed data centers. This article studies an increasingly important container scaling problem for application deployment in geo-distributed clouds. Reinforcement learning (RL) has been widely used in container scaling due to its high adaptability and robustness. To handle high-dimensional state spaces in geo-distributed clouds, we propose a deep RL algorithm, named DeepScale, to auto-scale containerized applications. DeepScale innovatively utilizes multi-step predicted future workloads to train a holistic scaling policy. It features several newly designed algorithmic components, including a domain-tailored state constructor and a heuristic-based action executor. These new algorithmic components are essential to meet the requirements of low deployment costs and achieve desirable application performance. We conduct extensive simulation studies using real-world datasets. The results show that DeepScale can significantly outperform an industry-leading scaling strategy and two state-of-the-art baselines in terms of both cost-effectiveness and constraint satisfaction.
引用
收藏
页码:4261 / 4274
页数:14
相关论文
共 50 条
  • [21] A hybrid auto-scaling technique for clouds processing applications with service level agreements
    Anshuman Biswas
    Shikharesh Majumdar
    Biswajit Nandy
    Ali El-Haraki
    Journal of Cloud Computing, 6
  • [22] Resource Pricing Game in Geo-distributed Clouds
    Roh, Heejun
    Jung, Cheoulhoon
    Lee, Wonjun
    Du, Ding-Zhu
    2013 PROCEEDINGS IEEE INFOCOM, 2013, : 1519 - 1527
  • [23] A hybrid auto-scaling technique for clouds processing applications with service level agreements
    Biswas, Anshuman
    Majumdar, Shikharesh
    Nandy, Biswajit
    El-Haraki, Ali
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2017, 6
  • [24] Efficient evolutionary optimization using predictive auto-scaling in containerized environment
    Ivanovic, Milos
    Simic, Visnja
    APPLIED SOFT COMPUTING, 2022, 129
  • [25] An Auto-scaling Framework for Controlling Enterprise Resources on Clouds
    Biswas, Anshuman
    Majumdar, Shikharesh
    Nandy, Biswajit
    El-Haraki, Ali
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 971 - 980
  • [26] Reducing the expenses of geo-distributed data centers with portable containerized modules
    Brocanelli, Marco
    Zheng, Wenli
    Wang, Xiaorui
    PERFORMANCE EVALUATION, 2014, 79 : 104 - 119
  • [27] Proactive Auto-Scaling for Delay-Sensitive IoT Applications Over Edge Clouds
    Wang, Weimeng
    Liu, Lei
    Yan, Zhongmin
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (06) : 9536 - 9546
  • [28] Towards Maximal Service Profit in Geo-Distributed Clouds
    Yang, Zhenjie
    Cui, Yong
    Wang, Xin
    Liu, Yadong
    Li, Minming
    Zhang, Zhixing
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 442 - 452
  • [29] Model Driven Deployment of Auto-scaling Services on Multiple Clouds
    Alipour, Hanieh
    Liu, Yan
    2018 IEEE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE ARCHITECTURE COMPANION (ICSA-C 2018), 2018, : 93 - 96
  • [30] PASCAL: An architecture for proactive auto-scaling of distributed services
    Lombardi, Federico
    Muti, Andrea
    Aniello, Leonardo
    Baldoni, Roberto
    Bonomi, Silvia
    Querzoni, Leonardo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 98 : 342 - 361