Optimizing Long-Term Efficiency and Fairness in Ride-Hailing via Joint Order Dispatching and Driver Repositioning

被引:11
|
作者
Sun, Jiahui [1 ]
Jin, Haiming [1 ]
Yang, Zhaoxing [1 ]
Su, Lu [2 ]
Wang, Xinbing [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Purdue Univ, W Lafayette, IN 47907 USA
关键词
Ride-Hailing; Long-Term Efficiency and Fairness; Joint Order Dispatching and Driver Repositioning;
D O I
10.1145/3534678.3539060
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ride-hailing service offered by mobility-on-demand platforms, such as Uber and Didi Chuxing, has greatly facilitated people's traveling and commuting, and become increasingly popular in recent years. Efficiency (e.g., gross merchandise volume) has always been an important metric for such platforms. However, only focusing on the efficiency inevitably ignores the fairness of driver incomes, which could impair the sustainability of the overall ride-hailing system in the long run. To optimize the aforementioned two essential metrics, order dispatching and driver repositioning play an important role, as they impact not only the immediate, but also the future order-serving outcomes of drivers. Thus, in this paper, we aim to exploit joint order dispatching and driver repositioning to optimize both the long-term efficiency and fairness for ride-hailing platforms. To address this problem, we propose a novel multi-agent reinforcement learning framework, referred to as JDRL, to help drivers make distributed order selection and repositioning decisions. Specifically, to cope with the variable action space, JDRL segments the action space into a fixed number of action groups, and fixes the policy output dimension for order selection as the number of action groups. In terms of the fairness criterion, JDRL adopts the max-min fairness, and augments the vanilla policy gradient to an iterative training algorithm that alternates between a minimization step and a policy improvement step to maximize both the worst and the overall performance of agents. In addition, we provide the theoretical convergence guarantee of our JDRL training algorithm even under non-convex policy networks and stochastic gradient updating. Extensive experiments are conducted with three public real-world ride-hailing order datasets, including over 2 million orders in Haikou, China, over 5 million orders in Chengdu, China, and over 6 million orders in New York City, USA. Experimental results show that JDRL demonstrates a consistent advantage compared to state-of-the-art baselines in terms of both efficiency and fairness. To the best of our knowledge, this is the first work that exploits joint order dispatching and driver repositioning to optimize both the long-term efficiency and fairness in a ride-hailing system.
引用
收藏
页码:3950 / 3960
页数:11
相关论文
共 4 条
  • [1] Optimizing Long-Term Efficiency and Fairness in Ride-Hailing Under Budget Constraint via Joint Order Dispatching and Driver Repositioning
    Sun, Jiahui
    Jin, Haiming
    Yang, Zhaoxing
    Su, Lu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 3348 - 3362
  • [2] Ride-Hailing Order Dispatching at DiDi via Reinforcement Learning
    Qin, Zhiwei
    Tang, Xiaocheng
    Jiao, Yan
    Zhang, Fan
    Xu, Zhe
    Zhu, Hongtu
    Ye, Jieping
    INFORMS JOURNAL ON APPLIED ANALYTICS, 2020, 50 (05): : 272 - 286
  • [3] Joint Optimization of Pricing, Dispatching and Repositioning in Ride-Hailing With Multiple Models Interplayed Reinforcement Learning
    Zhang, Zhongyun
    Yang, Lei
    Yao, Jiajun
    Ma, Chao
    Wang, Jianguo
    IEEE Transactions on Knowledge and Data Engineering, 2024, 36 (12) : 8593 - 8606
  • [4] CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms
    Jin, Jiarui
    Zhou, Ming
    Zhang, Weinan
    Li, Minne
    Guo, Zilong
    Qin, Zhiwei
    Jiao, Yan
    Tang, Xiaocheng
    Wang, Chenxi
    Wang, Jun
    Wu, Guobin
    Ye, Jieping
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1983 - 1992