The Power of Prediction: Microservice Auto Scaling via Workload Learning

被引:31
|
作者
Luo, Shutian [1 ,2 ,3 ,5 ]
Xu, Huanle [3 ,5 ]
Ye, Kejiang [1 ,5 ]
Xu, Guoyao [4 ]
Zhang, Liping [4 ]
Yang, Guodong [4 ]
Xu, Chengzhong [3 ,5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[2] Univ CAS, Beijing, Peoples R China
[3] Univ Macau, Zhuhai, Peoples R China
[4] Alibaba Grp, Hangzhou, Peoples R China
[5] Guangdong Hong Kong Macao Joint Lab Human Machine, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Microservices; Proactive Auto-scaler; Workload Uncertainty Learning;
D O I
10.1145/3542929.3563477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When deploying microservices in production clusters, it is critical to automatically scale containers to improve cluster utilization and ensure service level agreements (SLA). Although reactive scaling approaches work well for monolithic architectures, they are not necessarily suitable for microservice frameworks due to the long delay caused by complex microservice call chains. In contrast, existing proactive approaches leverage end-to-end performance prediction for scaling, but cannot effectively handle microservice multiplexing and dynamic microservice dependencies. In this paper, we present Madu, a proactive microservice auto-scaler that scales containers based on predictions for individual microservices. Madu learns workload uncertainty to handle the highly dynamic dependency between microservices. Additionally, Madu adopts OS-level metrics to optimize resource usage while maintaining good control over scaling overhead. Experiments on large-scale deployments of microservices in Alibaba clusters show that the overall prediction accuracy of Madu can reach as high as 92.3% on average, which is 13% higher than the state-of-the-art approaches. Furthermore, experiments running real-world microservice benchmarks in a local cluster of 20 servers show that Madu can reduce the overall resource usage by 1.7x compared to reactive solutions, while reducing end-to-end service latency by 50%.
引用
收藏
页码:355 / 369
页数:15
相关论文
共 50 条
  • [1] Microservice Auto-Scaling Algorithm Based on Workload Prediction in Cloud-Edge Collaboration Environment
    Peng, Zijun
    Tang, Bing
    Xu, Wei
    Yang, Qing
    Hussaini, Ehsanullah
    Xiao, Yuqiang
    Li, Haiyan
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 608 - 615
  • [2] Workload Prediction for Adaptive Power Scaling Using Deep Learning
    Tarsa, Stephen J.
    Kumar, Amit P.
    Kung, H. T.
    2014 IEEE INTERNATIONAL CONFERENCE ON IC DESIGN & TECHNOLOGY (ICICDT), 2014,
  • [3] Microservice-Oriented Workload Prediction Using Deep Learning
    Stefan, Sebastian
    Niculescu, Virginia
    E-INFORMATICA SOFTWARE ENGINEERING JOURNAL, 2022, 16 (01)
  • [4] Adaptive Workload Prediction for Proactive Auto Scaling in PaaS Systems
    Shariffdeen, R. S.
    Munasinghe, D. T. S. P.
    Bhathiya, H. S.
    Bandara, U. K. J. U.
    Bandara, H. M. N. Dilum
    2016 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGIES AND APPLICATIONS (CLOUDTECH), 2016, : 22 - 29
  • [5] Auto-Scaling Provision Basing on Workload Prediction in the Virtualized Data Center
    Feng, Danqing
    Wu, Zhibo
    Zuo, Decheng
    Zhang, Zhan
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2020, 12 (01) : 53 - 69
  • [6] Dynamic workload patterns prediction for proactive auto-scaling of web applications
    Iqbal, Waheed
    Erradi, Abdelkarim
    Mahmood, Arif
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2018, 124 : 94 - 107
  • [7] Improved Q Network Auto-Scaling in Microservice Architecture
    Kim, Yeonggwang
    Park, Jaehyung
    Yoon, Junchurl
    Kim, Jinsul
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [8] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Jingqi Yang
    Chuanchang Liu
    Yanlei Shang
    Bo Cheng
    Zexiang Mao
    Chunhong Liu
    Lisha Niu
    Junliang Chen
    Information Systems Frontiers, 2014, 16 : 7 - 18
  • [9] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Yang, Jingqi
    Liu, Chuanchang
    Shang, Yanlei
    Cheng, Bo
    Mao, Zexiang
    Liu, Chunhong
    Niu, Lisha
    Chen, Junliang
    INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 7 - 18
  • [10] Predictive Auto-scaling: LSTM-Based Multi-step Cloud Workload Prediction
    Suleiman, Basem
    Alibasa, Muhammad Johan
    Chang, Ya-Yuan
    Anaissi, Ali
    SERVICE-ORIENTED COMPUTING - ICSOC 2023 WORKSHOPS, 2024, 14518 : 5 - 16