Real-world ride-hailing vehicle repositioning using deep reinforcement learning

被引:32
|
作者
Jiao, Yan [1 ]
Tang, Xiaocheng [1 ]
Qin, Zhiwei [1 ]
Li, Shuaiji [1 ]
Zhang, Fan [2 ]
Zhu, Hongtu [2 ]
Ye, Jieping [3 ]
机构
[1] DiDi Labs, Mountain View, CA 94043 USA
[2] Didi Chuxing, Beijing, Peoples R China
[3] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
Ridesharing; Vehicle repositioning; Deep reinforcement learning; ALGORITHM;
D O I
10.1016/j.trc.2021.103289
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
We present a new practical framework based on deep reinforcement learning and decision-time planning for real-world vehicle repositioning on ride-hailing (a type of mobility-on-demand, MoD) platforms. Our approach learns the spatiotemporal state-value function using a batch training algorithm with deep value networks. The optimal repositioning action is generated ondemand through value-based policy search, which combines planning and bootstrapping with the value networks. For the large-fleet problems, we develop several algorithmic features that we incorporate into our framework and that we demonstrate to induce coordination among the algorithmically-guided vehicles. We benchmark our algorithm with baselines in a ride-hailing simulation environment to demonstrate its superiority in improving income efficiency measured by income-per-hour. We have also designed and run a real-world experiment program with regular drivers on a major ride-hailing platform. We have observed significantly positive results on key metrics comparing our method with experienced drivers who performed idle-time repositioning based on their own expertise.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Real-world ride-hailing vehicle repositioning using deep reinforcement learning
    Jiao, Yan
    Tang, Xiaocheng
    Qin, Zhiwei
    Li, Shuaiji
    Zhang, Fan
    Zhu, Hongtu
    Ye, Jieping
    [J]. Transportation Research Part C: Emerging Technologies, 2021, 130
  • [2] Scalable Deep Reinforcement Learning for Ride-Hailing
    Feng, Jiekun
    Gluzman, Mark
    Dai, J. G.
    [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3743 - 3748
  • [3] Scalable Deep Reinforcement Learning for Ride-Hailing
    Feng, Jiekun
    Gluzman, Mark
    Dai, J. G.
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (06): : 2060 - 2065
  • [4] Equilibrium Inverse Reinforcement Learning for Ride-hailing Vehicle Network
    Oda, Takuma
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 2281 - 2290
  • [5] Deep dispatching: A deep reinforcement learning approach for vehicle dispatching on online ride-hailing platform
    Liu, Yang
    Wu, Fanyou
    Lyu, Cheng
    Li, Shen
    Ye, Jieping
    Qu, Xiaobo
    [J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 161
  • [6] A Reinforcement Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems
    Wei, Honghao
    Yang, Zixian
    Liu, Xin
    Qin, Zhiwei
    Tang, Xiaocheng
    Ying, Lei
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1846 - 1856
  • [7] Operating Electric Vehicle Fleet for Ride-Hailing Services With Reinforcement Learning
    Shi, Jie
    Gao, Yuanqi
    Wang, Wei
    Yu, Nanpeng
    Ioannou, Petros A.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (11) : 4822 - 4834
  • [8] Reinforcement Learning from Optimization Proxy for Ride-Hailing Vehicle Relocation
    Yuan, Enpeng
    Chen, Wenbo
    Van Hentenryck, Pascal
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 75 : 985 - 1002
  • [9] Reinforcement Learning from Optimization Proxy for Ride-Hailing Vehicle Relocation
    Yuan, Enpeng
    Chen, Wenbo
    Van Hentenryck, Pascal
    [J]. Journal of Artificial Intelligence Research, 2022, 75 : 985 - 1002
  • [10] Dynamic Ride-Hailing Route Planning Based on Deep Reinforcement Learning
    Zheng, Bolong
    Ming, Lingfeng
    Hu, Qi
    Fang, Yixiang
    Zheng, Kai
    Li, Guohui
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (02): : 329 - 341