The Power of Prediction: Microservice Auto Scaling via Workload Learning

被引:31
|
作者
Luo, Shutian [1 ,2 ,3 ,5 ]
Xu, Huanle [3 ,5 ]
Ye, Kejiang [1 ,5 ]
Xu, Guoyao [4 ]
Zhang, Liping [4 ]
Yang, Guodong [4 ]
Xu, Chengzhong [3 ,5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[2] Univ CAS, Beijing, Peoples R China
[3] Univ Macau, Zhuhai, Peoples R China
[4] Alibaba Grp, Hangzhou, Peoples R China
[5] Guangdong Hong Kong Macao Joint Lab Human Machine, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Microservices; Proactive Auto-scaler; Workload Uncertainty Learning;
D O I
10.1145/3542929.3563477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When deploying microservices in production clusters, it is critical to automatically scale containers to improve cluster utilization and ensure service level agreements (SLA). Although reactive scaling approaches work well for monolithic architectures, they are not necessarily suitable for microservice frameworks due to the long delay caused by complex microservice call chains. In contrast, existing proactive approaches leverage end-to-end performance prediction for scaling, but cannot effectively handle microservice multiplexing and dynamic microservice dependencies. In this paper, we present Madu, a proactive microservice auto-scaler that scales containers based on predictions for individual microservices. Madu learns workload uncertainty to handle the highly dynamic dependency between microservices. Additionally, Madu adopts OS-level metrics to optimize resource usage while maintaining good control over scaling overhead. Experiments on large-scale deployments of microservices in Alibaba clusters show that the overall prediction accuracy of Madu can reach as high as 92.3% on average, which is 13% higher than the state-of-the-art approaches. Furthermore, experiments running real-world microservice benchmarks in a local cluster of 20 servers show that Madu can reduce the overall resource usage by 1.7x compared to reactive solutions, while reducing end-to-end service latency by 50%.
引用
收藏
页码:355 / 369
页数:15
相关论文
共 50 条
  • [41] Dynamic power and temperature management based on accurate workload prediction
    Zhao, Guo-Xing
    Yan, Jia-Qi
    Luo, Zu-Ying
    Jisuanji Xuebao/Chinese Journal of Computers, 2014, 37 (11): : 2298 - 2308
  • [42] HephaestusForge: Optimal microservice deployment across the Compute Continuum via Reinforcement Learning
    Santos, Jose
    Zaccarini, Mattia
    Poltronieri, Filippo
    Tortonesi, Mauro
    Stefanelli, Cesare
    Di Cicco, Nicola
    De Turck, Filip
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
  • [43] x Online Microservice Orchestration for IoT via Multiobjective Deep Reinforcement Learning
    Yu, Yinbo
    Liu, Jiajia
    Fang, Jing
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (18) : 17513 - 17525
  • [44] PredictDDL: Reusable Workload Performance Prediction for Distributed Deep Learning
    Assogba, Kevin
    Lima, Eduardo
    Rafique, M. Mustafa
    Kwon, Minseok
    2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, CLUSTER, 2023, : 13 - 24
  • [45] Deep Learning Approach for Workload Prediction and Balancing in Cloud Computing
    Karimunnisa, Syed
    Pachipala, Yellamma
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 754 - 763
  • [46] Deep Reinforcement Learning for Workload Prediction in Federated Cloud Environments
    Ahamed, Zaakki
    Khemakhem, Maher
    Eassa, Fathy
    Alsolami, Fawaz
    Basuhail, Abdullah
    Jambi, Kamal
    SENSORS, 2023, 23 (15)
  • [47] Association Learning based Hybrid Model for Cloud Workload Prediction
    Kumar, Siddhant
    Muthiyan, Neha
    Gupta, Shaifu
    Dileep, A. D.
    Nigam, Aditya
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [48] Multivariate Deep Learning Model For Workload Prediction In Cloud Computing
    Dang-Quang, Nhat-Minh
    Yoo, Myungsik
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 858 - 862
  • [49] Performance prediction of a clean coal power plant via machine learning and deep learning techniques
    Haddadin, Mariana
    Mohamed, Omar
    Abu Elhaija, Wejdan
    Matar, Mustafa
    ENERGY & ENVIRONMENT, 2023, 35 (07) : 3575 - 3599
  • [50] Machine learning-based auto-scaling for containerized applications
    Mahmoud Imdoukh
    Imtiaz Ahmad
    Mohammad Gh. Alfailakawi
    Neural Computing and Applications, 2020, 32 : 9745 - 9760