Real-world ride-hailing vehicle repositioning using deep reinforcement learning

被引：32

作者：

Jiao, Yan ^{[1
]}

Tang, Xiaocheng ^{[1
]}

Qin, Zhiwei ^{[1
]}

Li, Shuaiji ^{[1
]}

Zhang, Fan ^{[2
]}

Zhu, Hongtu ^{[2
]}

Ye, Jieping ^{[3
]}

机构：

[1] DiDi Labs, Mountain View, CA 94043 USA

[2] Didi Chuxing, Beijing, Peoples R China

[3] Univ Michigan, Ann Arbor, MI 48109 USA

来源：

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES | 2021年 / 130卷

关键词：

Ridesharing; Vehicle repositioning; Deep reinforcement learning; ALGORITHM;

D O I：

10.1016/j.trc.2021.103289

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

We present a new practical framework based on deep reinforcement learning and decision-time planning for real-world vehicle repositioning on ride-hailing (a type of mobility-on-demand, MoD) platforms. Our approach learns the spatiotemporal state-value function using a batch training algorithm with deep value networks. The optimal repositioning action is generated ondemand through value-based policy search, which combines planning and bootstrapping with the value networks. For the large-fleet problems, we develop several algorithmic features that we incorporate into our framework and that we demonstrate to induce coordination among the algorithmically-guided vehicles. We benchmark our algorithm with baselines in a ride-hailing simulation environment to demonstrate its superiority in improving income efficiency measured by income-per-hour. We have also designed and run a real-world experiment program with regular drivers on a major ride-hailing platform. We have observed significantly positive results on key metrics comparing our method with experienced drivers who performed idle-time repositioning based on their own expertise.

引用

页数：25

共 50 条

[1] Real-world ride-hailing vehicle repositioning using deep reinforcement learning
Jiao, Yan
Tang, Xiaocheng
Qin, Zhiwei
Li, Shuaiji
Zhang, Fan
Zhu, Hongtu
Ye, Jieping
[J]. Transportation Research Part C: Emerging Technologies, 2021, 130
[2] Scalable Deep Reinforcement Learning for Ride-Hailing
Feng, Jiekun
Gluzman, Mark
Dai, J. G.
[J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3743 - 3748
[3] Scalable Deep Reinforcement Learning for Ride-Hailing
Feng, Jiekun
Gluzman, Mark
Dai, J. G.
[J]. IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (06): : 2060 - 2065
[4] Equilibrium Inverse Reinforcement Learning for Ride-hailing Vehicle Network
Oda, Takuma
[J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 2281 - 2290
[5] Deep dispatching: A deep reinforcement learning approach for vehicle dispatching on online ride-hailing platform
Liu, Yang
Wu, Fanyou
Lyu, Cheng
Li, Shen
Ye, Jieping
Qu, Xiaobo
[J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 161
[6] A Reinforcement Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems
Wei, Honghao
Yang, Zixian
Liu, Xin
Qin, Zhiwei
Tang, Xiaocheng
Ying, Lei
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1846 - 1856
[7] Operating Electric Vehicle Fleet for Ride-Hailing Services With Reinforcement Learning
Shi, Jie
Gao, Yuanqi
Wang, Wei
Yu, Nanpeng
Ioannou, Petros A.
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (11) : 4822 - 4834
[8] Reinforcement Learning from Optimization Proxy for Ride-Hailing Vehicle Relocation
Yuan, Enpeng
Chen, Wenbo
Van Hentenryck, Pascal
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 75 : 985 - 1002
[9] Reinforcement Learning from Optimization Proxy for Ride-Hailing Vehicle Relocation
Yuan, Enpeng
Chen, Wenbo
Van Hentenryck, Pascal
[J]. Journal of Artificial Intelligence Research, 2022, 75 : 985 - 1002
[10] Dynamic Ride-Hailing Route Planning Based on Deep Reinforcement Learning
Zheng, Bolong
Ming, Lingfeng
Hu, Qi
Fang, Yixiang
Zheng, Kai
Li, Guohui
[J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (02): : 329 - 341

← 1 2 3 4 5 →