Repositioning Strategy for Ride-Hailing Vehicles Based on Geometric Road Network Structure and Reinforcement Learning

被引：0

作者：

Xu L. ^{[1
]}

Yu J. ^{[1
]}

Pei M. ^{[1
]}

Wu P. ^{[2
]}

Li P. ^{[3
]}

机构：

[1] School of Civil Engineering and Transportation, South China University of Technology, Guangdong, Guangzhou

[2] School of Transportation Engineering, Chongqing Jiaotong University, Chongqing

[3] School of Automobile and Transportation, Shenzhen Polytechnic, Guangdong, Shenzhen

来源：

Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science) | 2023年 / 51卷 / 10期

关键词：

graph neural network; Markov decision process; multi-agent reinforcement learning; repositioning strategy of ride-hailing; urban traffic;

D O I：

10.12141/j.issn.1000-565X.230148

中图分类号：

学科分类号：

摘要：

The inefficient and inaccurate bidirectional search by both ride-hailing drivers and passengers leads to a mismatch between supply and demand. Ride-hailing vehicle repositioning strategy can pre-dispatch vehicles to areas with future demand, improving supply-demand matching. However, existing research mostly uses network grids to represent the urban road environment, lacking geometric topological information and reducing the dispatch accuracy. To address this issue, a ride-hailing vehicle relocation algorithm called GA2C was proposed based on Graph Neural Networks (GNN) and Actor-Critic reinforcement learning algorithm. This algorithm has a smoother learning process and can perform high-dimensional sampling, and it is suitable for learning the best relocation strategy for a large number of ride-hailing vehicles as multi-agent systems. Moreover, the geometric network structure was used to represent the urban road environment by using a GNN as a function approximator to learn the geometric information of the road network. Additionally, an action sampling strategy based on action value function was introduced to increase the randomness of action selection, effectively preventing competition. A ride-hailing vehicle relocation simulation experiment was conducted using Python, and the results are as follows: (i) the order response rate of the GA2C algorithm is 84. 2%, significantly higher than all the comparative experimental results; (ii) in the order distribution comparative experiment, GA2C’s relative improvements in uniform distribution, central distribution layout, block distribution layout, and checkerboard distribution layout are 1. 17%, 6. 02%, 13. 12%, and 14. 55%, respectively. The above experimental results demonstrate that the GA2C algorithm can effectively relocate ride-hailing vehicles. When the order distribution presents significant differences, and the distance between different demand areas is relatively close, it can better learn dynamic demand changes, and achieve maximum order response rate by relocating ride-hailing vehicles. © 2023 South China University of Technology. All rights reserved.

引用

页码：88 / 109

页数：21

共 36 条

[1] CHAO X L, A generalized fluid model of ride-hailing systems, Transportation Research Part B：Methodological, 150, pp. 587-605, (2021)
[2] CHENG S F, RAJENDRAM R．, Taxis strike back：a field trial of the driver guidance system, Proceedings of the 17th International Conference on Autonomous Agents and Multi-Agent Systems, pp. 577-584, (2018)
[3] Order-dispatching strategy induced by optimal transport plan for an online ride-hailing system ［J］, Transportation Research Record, 2676, pp. 156-169, (2022)
[4] CARLEE J W．, MOVI：a model-free approach to dynamic fleet management, Proceedings of the IEEE INFOCOM 2018-IEEE Conference on Computer Communications, pp. 2708-2716, (2018)
[5] WEN Huiying, LIN Yifeng, WU Haoshu, Extended co-evolutionary algorithm for path planning based on the urban traffic environment evolution ［J］, Journal of South China University of Technology（Natural Science Edition）, 49, 10, pp. 1-10, (2021)
[6] A taxi dispatch system based on prediction of demand and destination ［J］, Journal of Parallel and Distributed Computing, 157, 11, pp. 269-279, (2021)
[7] ZHANG L Y, A taxi order dispatch model based on combinatorial optimization, Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2151-2159, (2017)
[8] LUO Ruifa, HAO Huijun, XU Taorang, Fundamental diagram model of mixed traffic flow of connected vehicles considering time delay ［J］, Journal of South China University of Technology（Natural Science Edition）, 51, 1, pp. 106-113, (2023)
[9] Data-driven robust taxi dispatch under demand uncertainties ［J］, IEEE Transactions on Control Systems Technology, 27, 1, pp. 175-191, (2019)
[10] SUTTON R，, BARTO A．, Reinforcement learning： an introduction ［J］．, IEEE Transactions on Neural Networks, 16, pp. 285-286, (2015)

← 1 2 3 4 →