Real-Time Rideshare Driver Supply Values Using Online Reinforcement Learning

被引：3

作者：

Han, Benjamin ^{[1
,4
]}

Lee, Hyungjun ^{[2
,4
]}

Martin, Sebastien ^{[3
,4
]}

机构：

[1] OpenSea, San Francisco, CA 94110 USA

[2] Snap Inc, San Francisco, CA USA

[3] Northwestern Univ, Evanston, IL USA

[4] Lyft Inc, San Francisco, CA USA

来源：

PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年

关键词：

Multi-Agent Reinforcement Learning; Online Learning; On-Policy Control; Temporal Difference; Streaming; Real-Time; Adaptive; Dispatch; Matching; Rideshare; Transportation;

D O I：

10.1145/3534678.3539141

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we present Online Supply Values (OSV), a system for estimating the return of available rideshare drivers to match drivers to ride requests at Lyft. Because a future driver state can be accurately predicted from a request destination, it is possible to estimate the expected action value of assigning a ride request to an available driver as a Markov Decision Process using the Bellman Equation. These estimates are updated using temporal difference and are shown to adapt to changing marketplace conditions in real-time. While reinforcement learning has been studied for rideshare dispatch, fully-online approaches without offline priors or other guardrails had never been evaluated in the real world. This work presents the algorithmic changes needed to bridge this gap. OSV is now deployed globally as a core component of Lyft's dispatch matching system. Our A/B user experiments in major US cities measure a +(0.96 +/- 0.53)% increase in the request fulfillment rate and a +(0.73 +/- 0.22)% increase to profit per passenger session over the previous algorithm.

引用

下载

页码：2968 / 2976

页数：9

共 50 条

[1] Real-Time IDS Using Reinforcement Learning
Sagha, Hesam
Shouraki, Saeed Bagheri
Khasteh, Hosein
Dehghani, Mahdi
2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 593 - +
[2] Real-time optimization using reinforcement learning
Powell, By Kody M.
Machalek, Derek
Quah, Titus
COMPUTERS & CHEMICAL ENGINEERING, 2020, 143 (143)
[3] EXPERIMENTS WITH ONLINE REINFORCEMENT LEARNING IN REAL-TIME STRATEGY GAMES
Andersen, Kresten Toftgaard
Zeng, Yifeng
Christensen, Dennis Dahl
Tran, Dung
APPLIED ARTIFICIAL INTELLIGENCE, 2009, 23 (09) : 855 - 871
[4] Real-Time Reinforcement Learning
Ramstedt, Simon
Pal, Christopher
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[5] Real-time Driver Drowsiness Detection using Deep Learning
Dipu M.T.A.
Hossain S.S.
Arafat Y.
Rafiq F.B.
Dipu, Md. Tanvir Ahammed, 1600, Science and Information Organization (12): : 844 - 850
[6] Real-time Driver Drowsiness Detection using Deep Learning
Dipu, Md Tanvir Ahammed
Hossain, Syeda Sumbul
Arafat, Yeasir
Rafiq, Fatama Binta
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 844 - 850
[7] Benchmarking Real-Time Reinforcement Learning
Thodoroff, Pierre
Li, Wenyu
Lawrence, Neil D.
NEURIPS 2021 WORKSHOP ON PRE-REGISTRATION IN MACHINE LEARNING, VOL 181, 2021, 181 : 26 - 41
[8] Reinforcement Learning-based Real-time Fair Online Resource Matching
Mishra, Pankaj
Moustafa, Ahmed
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2022, : 34 - 41
[9] Reinforcement Learning for Online Dispatching Policy in Real-Time Train Timetable Rescheduling
Yue P.
Jin Y.
Dai X.
Feng Z.
Cui D.
IEEE Transactions on Intelligent Transportation Systems, 2024, 25 (01) : 478 - 490
[10] Real-time Energy Management of Microgrid Using Reinforcement Learning
Bi, Wenzheng
Shu, Yuankai
Dong, Wei
Yang, Qiang
2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, : 38 - 41

← 1 2 3 4 5 →