Optimizing Long-Term Efficiency and Fairness in Ride-Hailing via Joint Order Dispatching and Driver Repositioning

被引：11

作者：

Sun, Jiahui ^{[1
]}

Jin, Haiming ^{[1
]}

Yang, Zhaoxing ^{[1
]}

Su, Lu ^{[2
]}

Wang, Xinbing ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] Purdue Univ, W Lafayette, IN 47907 USA

来源：

PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年

关键词：

Ride-Hailing; Long-Term Efficiency and Fairness; Joint Order Dispatching and Driver Repositioning;

D O I：

10.1145/3534678.3539060

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The ride-hailing service offered by mobility-on-demand platforms, such as Uber and Didi Chuxing, has greatly facilitated people's traveling and commuting, and become increasingly popular in recent years. Efficiency (e.g., gross merchandise volume) has always been an important metric for such platforms. However, only focusing on the efficiency inevitably ignores the fairness of driver incomes, which could impair the sustainability of the overall ride-hailing system in the long run. To optimize the aforementioned two essential metrics, order dispatching and driver repositioning play an important role, as they impact not only the immediate, but also the future order-serving outcomes of drivers. Thus, in this paper, we aim to exploit joint order dispatching and driver repositioning to optimize both the long-term efficiency and fairness for ride-hailing platforms. To address this problem, we propose a novel multi-agent reinforcement learning framework, referred to as JDRL, to help drivers make distributed order selection and repositioning decisions. Specifically, to cope with the variable action space, JDRL segments the action space into a fixed number of action groups, and fixes the policy output dimension for order selection as the number of action groups. In terms of the fairness criterion, JDRL adopts the max-min fairness, and augments the vanilla policy gradient to an iterative training algorithm that alternates between a minimization step and a policy improvement step to maximize both the worst and the overall performance of agents. In addition, we provide the theoretical convergence guarantee of our JDRL training algorithm even under non-convex policy networks and stochastic gradient updating. Extensive experiments are conducted with three public real-world ride-hailing order datasets, including over 2 million orders in Haikou, China, over 5 million orders in Chengdu, China, and over 6 million orders in New York City, USA. Experimental results show that JDRL demonstrates a consistent advantage compared to state-of-the-art baselines in terms of both efficiency and fairness. To the best of our knowledge, this is the first work that exploits joint order dispatching and driver repositioning to optimize both the long-term efficiency and fairness in a ride-hailing system.

引用

页码：3950 / 3960

页数：11

共 4 条

[1] Optimizing Long-Term Efficiency and Fairness in Ride-Hailing Under Budget Constraint via Joint Order Dispatching and Driver Repositioning
Sun, Jiahui
Jin, Haiming
Yang, Zhaoxing
Su, Lu
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 3348 - 3362
[2] Ride-Hailing Order Dispatching at DiDi via Reinforcement Learning
Qin, Zhiwei
Tang, Xiaocheng
Jiao, Yan
Zhang, Fan
Xu, Zhe
Zhu, Hongtu
Ye, Jieping
INFORMS JOURNAL ON APPLIED ANALYTICS, 2020, 50 (05): : 272 - 286
[3] Joint Optimization of Pricing, Dispatching and Repositioning in Ride-Hailing With Multiple Models Interplayed Reinforcement Learning
Zhang, Zhongyun
Yang, Lei
Yao, Jiajun
Ma, Chao
Wang, Jianguo
IEEE Transactions on Knowledge and Data Engineering, 2024, 36 (12) : 8593 - 8606
[4] CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms
Jin, Jiarui
Zhou, Ming
Zhang, Weinan
Li, Minne
Guo, Zilong
Qin, Zhiwei
Jiao, Yan
Tang, Xiaocheng
Wang, Chenxi
Wang, Jun
Wu, Guobin
Ye, Jieping
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1983 - 1992

← 1 →