Real-Time Rideshare Driver Supply Values Using Online Reinforcement Learning

被引:3
|
作者
Han, Benjamin [1 ,4 ]
Lee, Hyungjun [2 ,4 ]
Martin, Sebastien [3 ,4 ]
机构
[1] OpenSea, San Francisco, CA 94110 USA
[2] Snap Inc, San Francisco, CA USA
[3] Northwestern Univ, Evanston, IL USA
[4] Lyft Inc, San Francisco, CA USA
关键词
Multi-Agent Reinforcement Learning; Online Learning; On-Policy Control; Temporal Difference; Streaming; Real-Time; Adaptive; Dispatch; Matching; Rideshare; Transportation;
D O I
10.1145/3534678.3539141
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present Online Supply Values (OSV), a system for estimating the return of available rideshare drivers to match drivers to ride requests at Lyft. Because a future driver state can be accurately predicted from a request destination, it is possible to estimate the expected action value of assigning a ride request to an available driver as a Markov Decision Process using the Bellman Equation. These estimates are updated using temporal difference and are shown to adapt to changing marketplace conditions in real-time. While reinforcement learning has been studied for rideshare dispatch, fully-online approaches without offline priors or other guardrails had never been evaluated in the real world. This work presents the algorithmic changes needed to bridge this gap. OSV is now deployed globally as a core component of Lyft's dispatch matching system. Our A/B user experiments in major US cities measure a +(0.96 +/- 0.53)% increase in the request fulfillment rate and a +(0.73 +/- 0.22)% increase to profit per passenger session over the previous algorithm.
引用
下载
收藏
页码:2968 / 2976
页数:9
相关论文
共 50 条
  • [1] Real-Time IDS Using Reinforcement Learning
    Sagha, Hesam
    Shouraki, Saeed Bagheri
    Khasteh, Hosein
    Dehghani, Mahdi
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 593 - +
  • [2] Real-time optimization using reinforcement learning
    Powell, By Kody M.
    Machalek, Derek
    Quah, Titus
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 143 (143)
  • [3] EXPERIMENTS WITH ONLINE REINFORCEMENT LEARNING IN REAL-TIME STRATEGY GAMES
    Andersen, Kresten Toftgaard
    Zeng, Yifeng
    Christensen, Dennis Dahl
    Tran, Dung
    APPLIED ARTIFICIAL INTELLIGENCE, 2009, 23 (09) : 855 - 871
  • [4] Real-Time Reinforcement Learning
    Ramstedt, Simon
    Pal, Christopher
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [5] Real-time Driver Drowsiness Detection using Deep Learning
    Dipu M.T.A.
    Hossain S.S.
    Arafat Y.
    Rafiq F.B.
    Dipu, Md. Tanvir Ahammed, 1600, Science and Information Organization (12): : 844 - 850
  • [6] Real-time Driver Drowsiness Detection using Deep Learning
    Dipu, Md Tanvir Ahammed
    Hossain, Syeda Sumbul
    Arafat, Yeasir
    Rafiq, Fatama Binta
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 844 - 850
  • [7] Benchmarking Real-Time Reinforcement Learning
    Thodoroff, Pierre
    Li, Wenyu
    Lawrence, Neil D.
    NEURIPS 2021 WORKSHOP ON PRE-REGISTRATION IN MACHINE LEARNING, VOL 181, 2021, 181 : 26 - 41
  • [8] Reinforcement Learning-based Real-time Fair Online Resource Matching
    Mishra, Pankaj
    Moustafa, Ahmed
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2022, : 34 - 41
  • [9] Reinforcement Learning for Online Dispatching Policy in Real-Time Train Timetable Rescheduling
    Yue P.
    Jin Y.
    Dai X.
    Feng Z.
    Cui D.
    IEEE Transactions on Intelligent Transportation Systems, 2024, 25 (01) : 478 - 490
  • [10] Real-time Energy Management of Microgrid Using Reinforcement Learning
    Bi, Wenzheng
    Shu, Yuankai
    Dong, Wei
    Yang, Qiang
    2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, : 38 - 41