Noah: Reinforcement-Learning-Based Rate Limiter for Microservices in Large-Scale E-Commerce Services

被引:2
|
作者
Li, Zhao [1 ]
Sun, Haifeng [2 ]
Xiong, Zheng [1 ]
Huang, Qun [2 ,3 ]
Hu, Zehong [1 ]
Li, Ding [1 ]
Ruan, Shasha [1 ]
Hong, Hai [1 ]
Gui, Jie [2 ]
He, Jintao [2 ]
Xu, Zebin [1 ]
Fang, Yang [1 ]
机构
[1] Alibaba Grp, Hangzhou 311121, Peoples R China
[2] Peking Univ, Dept Comp & Sci, Beijing 100871, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Containers; Microservice architectures; Production; Electronic commerce; Monitoring; Measurement; Training; Deep reinforcement learning (DRL); deployment experience; e-commerce; microservice; rate limit;
D O I
10.1109/TNNLS.2023.3264038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern large-scale online service providers typically deploy microservices into containers to achieve flexible service management. One critical problem in such container-based microservice architectures is to control the arrival rate of requests in the containers to avoid containers from being overloaded. In this article, we present our experience of rate limit for the containers in, one of the largest e-commerce services in the world. Given the highly diverse characteristics of containers in, we point out that the existing rate limit mechanisms cannot meet our demand. Thus, we design, a dynamic rate limiter that can automatically adapt to the specific characteristic of each container without human efforts. The key idea of is to use deep reinforcement learning (DRL) that automatically infers the most suitable configuration for each container. To fully embrace the advantages of DRL in our context, addresses two technical challenges. First, uses a lightweight system monitoring mechanism to collect container status. In this way, it minimizes the monitoring overhead while ensuring a timely reaction to system load changes. Second, injects synthetic extreme data when training its models. Thus, its model gains knowledge on unseen special events and hence remains highly available in extreme scenarios. To guarantee model convergence with the injected training data, adopts task-specific curriculum learning to train the model from normal data to extreme data gradually. has been deployed in the production of for two years, serving more than 50 000 containers and around 300 types of microservice applications. Experimental results show that can well adapt to three common scenarios in the production environment. It effectively achieves better system availability and shorter request response time compared with four state-of-the-art rate limiters.
引用
收藏
页码:5403 / 5417
页数:15
相关论文
共 50 条
  • [41] A Reinforcement Learning Based Large-Scale Refinery Production Scheduling Algorithm
    Chen, Yuandong
    Ding, Jinliang
    Chen, Qingda
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6041 - 6055
  • [42] Large-scale and adaptive service composition based on deep reinforcement learning
    Liu, Jiang-Wen
    Hu, Li-Qiang
    Cai, Zhao-Quan
    Xing, Li-Ning
    Tan, Xu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 65
  • [43] Deep Reinforcement Learning-Based Large-Scale Robot Exploration
    Cao, Yuhong
    Zhao, Rui
    Wang, Yizhuo
    Xiang, Bairan
    Sartoretti, Guillaume
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4631 - 4638
  • [44] Adaptive and large-scale service composition based on deep reinforcement learning
    Wang, Hongbing
    Gu, Mingzhu
    Yu, Qi
    Tao, Yong
    Li, Jiajie
    Fei, Huanhuan
    Yan, Jia
    Zhao, Wei
    Hong, Tianjing
    KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 75 - 90
  • [45] Semi-Supervised Learning with Auxiliary Evaluation Component for Large Scale e-Commerce Text Classification
    Liu, Mingkuan
    Wen, Musen
    Kopru, Selcuk
    Liu, Xianjing
    Lu, Alan
    DEEP LEARNING APPROACHES FOR LOW-RESOURCE NATURAL LANGUAGE PROCESSING (DEEPLO), 2018, : 68 - 76
  • [46] InfDetect: a Large Scale Graph-based Fraud Detection System for E-Commerce Insurance
    Chen, Cen
    Liang, Chen
    Lin, Jianbin
    Wang, Li
    Liu, Ziqi
    Yang, Xinxing
    Zhou, Jun
    Shuang, Yang
    Qi, Yuan
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1765 - 1773
  • [47] On-line dynamic reinforcement learning and its application in multi-agent based e-commerce
    Li Jian
    Jing Bo
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1359 - +
  • [48] Solving the green reverse logistics problem in e-commerce using a reinforcement learning based genetic algorithm
    Li, Min-Yang
    Shih, Fang-Yu
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2024, 68
  • [49] FLOPPIES: A Framework for Large-Scale Ontology Population of Product Information from Tabular Data in E-commerce Stores
    Nederstigt, Lennart J.
    Aanen, Steven S.
    Vandic, Damir
    Frasincar, Flavius
    DECISION SUPPORT SYSTEMS, 2014, 59 : 296 - 311
  • [50] The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service
    Chen, Meng
    Liu, Ruixue
    Shen, Lei
    Yuan, Shaozu
    Zhou, Jingyan
    Wu, Youzheng
    He, Xiaodong
    Zhou, Bowen
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 459 - 466