Noah: Reinforcement-Learning-Based Rate Limiter for Microservices in Large-Scale E-Commerce Services

被引:2
|
作者
Li, Zhao [1 ]
Sun, Haifeng [2 ]
Xiong, Zheng [1 ]
Huang, Qun [2 ,3 ]
Hu, Zehong [1 ]
Li, Ding [1 ]
Ruan, Shasha [1 ]
Hong, Hai [1 ]
Gui, Jie [2 ]
He, Jintao [2 ]
Xu, Zebin [1 ]
Fang, Yang [1 ]
机构
[1] Alibaba Grp, Hangzhou 311121, Peoples R China
[2] Peking Univ, Dept Comp & Sci, Beijing 100871, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Containers; Microservice architectures; Production; Electronic commerce; Monitoring; Measurement; Training; Deep reinforcement learning (DRL); deployment experience; e-commerce; microservice; rate limit;
D O I
10.1109/TNNLS.2023.3264038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern large-scale online service providers typically deploy microservices into containers to achieve flexible service management. One critical problem in such container-based microservice architectures is to control the arrival rate of requests in the containers to avoid containers from being overloaded. In this article, we present our experience of rate limit for the containers in, one of the largest e-commerce services in the world. Given the highly diverse characteristics of containers in, we point out that the existing rate limit mechanisms cannot meet our demand. Thus, we design, a dynamic rate limiter that can automatically adapt to the specific characteristic of each container without human efforts. The key idea of is to use deep reinforcement learning (DRL) that automatically infers the most suitable configuration for each container. To fully embrace the advantages of DRL in our context, addresses two technical challenges. First, uses a lightweight system monitoring mechanism to collect container status. In this way, it minimizes the monitoring overhead while ensuring a timely reaction to system load changes. Second, injects synthetic extreme data when training its models. Thus, its model gains knowledge on unseen special events and hence remains highly available in extreme scenarios. To guarantee model convergence with the injected training data, adopts task-specific curriculum learning to train the model from normal data to extreme data gradually. has been deployed in the production of for two years, serving more than 50 000 containers and around 300 types of microservice applications. Experimental results show that can well adapt to three common scenarios in the production environment. It effectively achieves better system availability and shorter request response time compared with four state-of-the-art rate limiters.
引用
收藏
页码:5403 / 5417
页数:15
相关论文
共 50 条
  • [1] CFSF: On Cloud-Based Recommendation for Large-Scale E-commerce
    Hu, Long
    Lin, Kai
    Hassan, Mohammad Mehedi
    Alamri, Atif
    Alelaiwi, Abdulhameed
    MOBILE NETWORKS & APPLICATIONS, 2015, 20 (03): : 380 - 390
  • [2] CFSF: On Cloud-Based Recommendation for Large-Scale E-commerce
    Long Hu
    Kai Lin
    Mohammad Mehedi Hassan
    Atif Alamri
    Abdulhameed Alelaiwi
    Mobile Networks and Applications, 2015, 20 : 380 - 390
  • [3] Large-scale Visual Search and Similarity for E-Commerce
    Anand, Gaurav
    Wang, Siyun
    Ni, Karl
    APPLICATIONS OF MACHINE LEARNING 2021, 2021, 11843
  • [4] Ontology management for large-scale e-commerce applications
    Lee, J
    Goodwin, R
    DEEC 2005: International Workshop on Data Engineering Issues in E-Commerce, Proceedings, 2005, : 7 - 15
  • [5] e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce
    Shin, Wonyoung
    Park, Jonghun
    Woo, Taekang
    Cho, Yongwoo
    Oh, Kwangjin
    Song, Hwanjun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3484 - 3494
  • [6] DAliM: Machine Learning Based Intelligent Lucky Money Determination for Large-Scale E-Commerce Businesses
    Fu, Min
    Wong, Chi Man
    Zhu, Hai
    Huang, Yanjun
    Li, Yuanping
    Zheng, Xi
    Wu, Jia
    Yang, Jian
    Vong, Chi Man
    SERVICE-ORIENTED COMPUTING (ICSOC 2018), 2018, 11236 : 740 - 755
  • [7] Building Large-Scale Deep Learning System for Entity Recognition in E-Commerce Search
    Wen, Musen
    Vasthimal, Deepak Kumar
    Lu, Alan
    Wang, Tian
    Guo, Aimin
    BDCAT'19: PROCEEDINGS OF THE 6TH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2019, : 149 - 154
  • [8] Online E-Commerce Fraud: A Large-scale Detection and Analysis
    Weng, Haiqin
    Li, Zhao
    Ji, Shouling
    Chu, Chen
    Lu, Haifeng
    Du, Tianyu
    He, Qinming
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1435 - 1440
  • [9] Machine Learning Clustering for Collaborative Filtering Recommendation of Large-Scale E-commerce in Cloud Computing
    Han, Ling-Mei
    Gao, Yan-Ping
    Liu, Jian-Guo
    Journal of Network Intelligence, 2023, 8 (04): : 1321 - 1337
  • [10] Heterogeneous Embedding Propagation for Large-scale E-Commerce User Alignment
    Zheng, Vincent W.
    Sha, Mo
    Li, Yuchen
    Yang, Hongxia
    Fang, Yuan
    Zhang, Zhenjie
    Tan, Kian-Lee
    Chang, Kevin Chen-Chuan
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1434 - 1439