Noah: Reinforcement-Learning-Based Rate Limiter for Microservices in Large-Scale E-Commerce Services

被引：2

作者：

Li, Zhao ^{[1
]}

Sun, Haifeng ^{[2
]}

Xiong, Zheng ^{[1
]}

Huang, Qun ^{[2
,3
]}

Hu, Zehong ^{[1
]}

Li, Ding ^{[1
]}

Ruan, Shasha ^{[1
]}

Hong, Hai ^{[1
]}

Gui, Jie ^{[2
]}

He, Jintao ^{[2
]}

Xu, Zebin ^{[1
]}

Fang, Yang ^{[1
]}

机构：

[1] Alibaba Grp, Hangzhou 311121, Peoples R China

[2] Peking Univ, Dept Comp & Sci, Beijing 100871, Peoples R China

[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Containers; Microservice architectures; Production; Electronic commerce; Monitoring; Measurement; Training; Deep reinforcement learning (DRL); deployment experience; e-commerce; microservice; rate limit;

D O I：

10.1109/TNNLS.2023.3264038

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Modern large-scale online service providers typically deploy microservices into containers to achieve flexible service management. One critical problem in such container-based microservice architectures is to control the arrival rate of requests in the containers to avoid containers from being overloaded. In this article, we present our experience of rate limit for the containers in, one of the largest e-commerce services in the world. Given the highly diverse characteristics of containers in, we point out that the existing rate limit mechanisms cannot meet our demand. Thus, we design, a dynamic rate limiter that can automatically adapt to the specific characteristic of each container without human efforts. The key idea of is to use deep reinforcement learning (DRL) that automatically infers the most suitable configuration for each container. To fully embrace the advantages of DRL in our context, addresses two technical challenges. First, uses a lightweight system monitoring mechanism to collect container status. In this way, it minimizes the monitoring overhead while ensuring a timely reaction to system load changes. Second, injects synthetic extreme data when training its models. Thus, its model gains knowledge on unseen special events and hence remains highly available in extreme scenarios. To guarantee model convergence with the injected training data, adopts task-specific curriculum learning to train the model from normal data to extreme data gradually. has been deployed in the production of for two years, serving more than 50 000 containers and around 300 types of microservice applications. Experimental results show that can well adapt to three common scenarios in the production environment. It effectively achieves better system availability and shorter request response time compared with four state-of-the-art rate limiters.

引用

页码：5403 / 5417

页数：15

共 50 条

[41] A Reinforcement Learning Based Large-Scale Refinery Production Scheduling Algorithm
Chen, Yuandong
Ding, Jinliang
Chen, Qingda
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6041 - 6055
[42] Large-scale and adaptive service composition based on deep reinforcement learning
Liu, Jiang-Wen
Hu, Li-Qiang
Cai, Zhao-Quan
Xing, Li-Ning
Tan, Xu
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 65
[43] Deep Reinforcement Learning-Based Large-Scale Robot Exploration
Cao, Yuhong
Zhao, Rui
Wang, Yizhuo
Xiang, Bairan
Sartoretti, Guillaume
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4631 - 4638
[44] Adaptive and large-scale service composition based on deep reinforcement learning
Wang, Hongbing
Gu, Mingzhu
Yu, Qi
Tao, Yong
Li, Jiajie
Fei, Huanhuan
Yan, Jia
Zhao, Wei
Hong, Tianjing
KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 75 - 90
[45] Semi-Supervised Learning with Auxiliary Evaluation Component for Large Scale e-Commerce Text Classification
Liu, Mingkuan
Wen, Musen
Kopru, Selcuk
Liu, Xianjing
Lu, Alan
DEEP LEARNING APPROACHES FOR LOW-RESOURCE NATURAL LANGUAGE PROCESSING (DEEPLO), 2018, : 68 - 76
[46] InfDetect: a Large Scale Graph-based Fraud Detection System for E-Commerce Insurance
Chen, Cen
Liang, Chen
Lin, Jianbin
Wang, Li
Liu, Ziqi
Yang, Xinxing
Zhou, Jun
Shuang, Yang
Qi, Yuan
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1765 - 1773
[47] On-line dynamic reinforcement learning and its application in multi-agent based e-commerce
Li Jian
Jing Bo
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1359 - +
[48] Solving the green reverse logistics problem in e-commerce using a reinforcement learning based genetic algorithm
Li, Min-Yang
Shih, Fang-Yu
ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2024, 68
[49] FLOPPIES: A Framework for Large-Scale Ontology Population of Product Information from Tabular Data in E-commerce Stores
Nederstigt, Lennart J.
Aanen, Steven S.
Vandic, Damir
Frasincar, Flavius
DECISION SUPPORT SYSTEMS, 2014, 59 : 296 - 311
[50] The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service
Chen, Meng
Liu, Ruixue
Shen, Lei
Yuan, Shaozu
Zhou, Jingyan
Wu, Youzheng
He, Xiaodong
Zhou, Bowen
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 459 - 466

← 1 2 3 4 5 →