The Power of Prediction: Microservice Auto Scaling via Workload Learning

被引:31
|
作者
Luo, Shutian [1 ,2 ,3 ,5 ]
Xu, Huanle [3 ,5 ]
Ye, Kejiang [1 ,5 ]
Xu, Guoyao [4 ]
Zhang, Liping [4 ]
Yang, Guodong [4 ]
Xu, Chengzhong [3 ,5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[2] Univ CAS, Beijing, Peoples R China
[3] Univ Macau, Zhuhai, Peoples R China
[4] Alibaba Grp, Hangzhou, Peoples R China
[5] Guangdong Hong Kong Macao Joint Lab Human Machine, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Microservices; Proactive Auto-scaler; Workload Uncertainty Learning;
D O I
10.1145/3542929.3563477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When deploying microservices in production clusters, it is critical to automatically scale containers to improve cluster utilization and ensure service level agreements (SLA). Although reactive scaling approaches work well for monolithic architectures, they are not necessarily suitable for microservice frameworks due to the long delay caused by complex microservice call chains. In contrast, existing proactive approaches leverage end-to-end performance prediction for scaling, but cannot effectively handle microservice multiplexing and dynamic microservice dependencies. In this paper, we present Madu, a proactive microservice auto-scaler that scales containers based on predictions for individual microservices. Madu learns workload uncertainty to handle the highly dynamic dependency between microservices. Additionally, Madu adopts OS-level metrics to optimize resource usage while maintaining good control over scaling overhead. Experiments on large-scale deployments of microservices in Alibaba clusters show that the overall prediction accuracy of Madu can reach as high as 92.3% on average, which is 13% higher than the state-of-the-art approaches. Furthermore, experiments running real-world microservice benchmarks in a local cluster of 20 servers show that Madu can reduce the overall resource usage by 1.7x compared to reactive solutions, while reducing end-to-end service latency by 50%.
引用
收藏
页码:355 / 369
页数:15
相关论文
共 50 条
  • [21] Resource Provision Method of Cloud Platform for Power Distribution System via Workload Prediction
    Sun, Yuyan
    Hu, Chunchao
    Guo, Caishan
    Zhang, Ziyi
    Ma, Guolong
    Wang, Haizhu
    Cai, Zexiang
    4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 7 - 12
  • [22] Learning to make auto-scaling decisions with heterogeneous spot and on-demand instances via reinforcement learning
    Lin, Liduo
    Pan, Li
    Liu, Shijun
    INFORMATION SCIENCES, 2022, 614 (480-496) : 480 - 496
  • [23] Interference aware prediction mechanism for auto scaling in cloud
    Babu, K. R. Remesh
    Samuel, Philip
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 69 : 351 - 363
  • [24] Towards Workload Trend Time Series Probabilistic Prediction via Probabilistic Deep Learning
    Ruan, Li
    Guo, Heng
    Xue, Yunzhi
    Ruan, Tao
    Ji, Yuetiansi
    Xiao, Limin
    PROCEEDINGS OF 2023 18TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATA, SSTD 2023, 2023, : 41 - 50
  • [25] Hilbert Transform-Based Workload Prediction and Dynamic Frequency Scaling for Power-Efficient Video Encoding
    Jin, Xin
    Goto, Satoshi
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2012, 31 (05) : 649 - 661
  • [26] Introducing an adaptive model for auto-scaling cloud computing based on workload classification
    Alanagh, Yoosef Alidoost
    Firouzi, Mojtaba
    Kenari, Abdolreza Rasouli
    Shamsi, Mahboubeh
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (22):
  • [27] Cloud Workload Turning Points Prediction via Cloud Feature-Enhanced Deep Learning
    Ruan, Li
    Bai, Yu
    Li, Shaoning
    Lv, Jiaxun
    Zhang, Tianyuan
    Xiao, Limin
    Fang, Haiguang
    Wang, Chunhao
    Xue, Yunzhi
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (02) : 1719 - 1732
  • [28] Microscaler: Cost-Effective Scaling for Microservice Applications in the Cloud With an Online Learning Approach
    Yu, Guangba
    Chen, Pengfei
    Zheng, Zibin
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (02) : 1100 - 1116
  • [29] Workload Patterns for Quality-driven Dynamic Cloud Service Configuration and Auto-Scaling
    Zhang, Li
    Zhang, Yichuan
    Jamshidi, Pooyan
    Xu, Lei
    Pahl, Claus
    2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2014, : 156 - 165
  • [30] Ensemble Learning for Large-Scale Workload Prediction
    Singh, Nidhi
    Rao, Shrisha
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (02) : 149 - 165