Exploring Deep Reinforcement Learning for Task Dispatching in Autonomous On-Demand Services

被引：4

作者：

Yang, Lei ^{[1
]}

Yu, Xi ^{[1
]}

Cao, Jiannong ^{[2
]}

Liu, Xuxun ^{[3
]}

Zhou, Pan ^{[4
]}

机构：

[1] South China Univ Technol, Sch Software Engn, 382 Waihuandong Rd, Guangzhou 510006, Guangdong, Peoples R China

[2] Hong Kong Polytech Univ, Dept Comp, Kowloon, 11 Yucai Rd, Hong Kong, Peoples R China

[3] South China Univ Technol, Sch Elect & Informat Engn, 382 Waihuandong Rd, Guangzhou 510006, Guangdong, Peoples R China

[4] Huazhong Univ Sci & Technol, Hubei Engn Res Ctr Big Data Secur, Sch Cyber Sci & Engn, 1037 Luoyu Rd, Wuhan 430074, Hubei, Peoples R China

来源：

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA | 2021年 / 15卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Demand dispatching; on-demand services; deep reinforcement learning; ASSIGNMENT;

D O I：

10.1145/3442343

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Autonomous on-demand services, such as GOGOX (formerly GoGoVan) in Hong Kong, provide a platform for users to request services and for suppliers to meet such demands. In such a platform, the suppliers have autonomy to accept or reject the demands to be dispatched to him/her, so it is challenging to make an online matching between demands and suppliers. Existing methods use round-based approaches to dispatch demands. In these works, the dispatching decision is based on the predicted response patterns of suppliers to demands in the current round, but they all fail to consider the impact of future demands and suppliers on the current dispatching decision. This could lead to taking a suboptimal dispatching decision from the future perspective. To solve this problem, we propose a novel demand dispatching model using deep reinforcement learning. In this model, we make each demand as an agent. The action of each agent, i.e., the dispatching decision of each demand, is determined by a centralized algorithm in a coordinated way. The model works in the following two steps. (1) It learns the demand's expected value in each spatiotemporal state using historical transition data. (2) Based on the learned values, it conducts a Many-To-Many dispatching using a combinatorial optimization algorithm by considering both immediate rewards and expected values of demands in the next round. In order to get a higher total reward, the demands with a high expected value (short response time) in the future may be delayed to the next round. On the contrary, the demands with a low expected value (long response time) in the future would be dispatched immediately. Through extensive experiments using real-world datasets, we show that the proposed model outperforms the existing models in terms of Cancellation Rate and Average Response Time.

引用

页数：23

共 50 条

[1] A Novel Demand Dispatching Model for Autonomous On-Demand Services
Yang, Lei
Yu, Xi
Cao, Jiannong
Li, Wengen
Wang, Yuqi
Szczecinski, Michal
[J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (01) : 322 - 333
[2] A Deep Reinforcement Learning Approach to Ride-Sharing Vehicle Dispatching in Autonomous Mobility-on-Demand Systems
Guo, Ge
Xu, Yangguang
[J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2022, 14 (01) : 128 - 140
[3] Optimization of On-Demand Shared Autonomous Vehicle Deployments Utilizing Reinforcement Learning
Meneses-Cime, Karina
Guvenc, Bilin Aksun
Guvenc, Levent
[J]. SENSORS, 2022, 22 (21)
[4] Deep Reinforcement Learning for On-demand Intelligent Routing in Deterministic Networks
Liu, Ying
Yin, Jianhui
Zhang, Weiting
Xie, Shanghan
[J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 1932 - 1937
[5] Exploring Deep Reinforcement Learning for Autonomous Powerline Tracking
Pienroj, Panin
Schonborn, Sandro
Birke, Robert
[J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM 2019 WKSHPS), 2019, : 496 - 501
[6] A Matching Algorithm with Reinforcement Learning and Decoupling Strategy for Order Dispatching in On-Demand Food Delivery
Chen, Jingfang
Wang, Ling
Pan, Zixiao
Wu, Yuting
Zheng, Jie
Ding, Xuetao
[J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 29 (02) : 386 - 399
[7] Asynchronous Deep Reinforcement Learning for Collaborative Task Computing and On-Demand Resource Allocation in Vehicular Edge Computing
Liu, Lei
Feng, Jie
Mu, Xuanyu
Pei, Qingqi
Lan, Dapeng
Xiao, Ming
[J]. IEEE Transactions on Intelligent Transportation Systems, 2023, 24 (12) : 15513 - 15526
[8] Dynamic matching radius decision model for on-demand ride services: A deep multi-task learning approach
Chen, Taijie
Shen, Zijian
Feng, Siyuan
Yang, Linchuan
Ke, Jintao
[J]. Transportation Research Part E: Logistics and Transportation Review, 2025, 193
[9] A robust deep reinforcement learning approach to driverless taxi dispatching under uncertain demand
Zhou, Xiaoting
Wu, Lubin
Zhang, Yu
Chen, Zhen-Song
Jiang, Shancheng
[J]. INFORMATION SCIENCES, 2023, 646
[10] Multi-task dispatch of shared autonomous electric vehicles for Mobility-on-Demand services - combination of deep reinforcement learning and combinatorial optimization method
Wang, Ning
Guo, Jiahui
[J]. HELIYON, 2022, 8 (11)

← 1 2 3 4 5 →