Exploring Deep Reinforcement Learning for Task Dispatching in Autonomous On-Demand Services

被引:4
|
作者
Yang, Lei [1 ]
Yu, Xi [1 ]
Cao, Jiannong [2 ]
Liu, Xuxun [3 ]
Zhou, Pan [4 ]
机构
[1] South China Univ Technol, Sch Software Engn, 382 Waihuandong Rd, Guangzhou 510006, Guangdong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Kowloon, 11 Yucai Rd, Hong Kong, Peoples R China
[3] South China Univ Technol, Sch Elect & Informat Engn, 382 Waihuandong Rd, Guangzhou 510006, Guangdong, Peoples R China
[4] Huazhong Univ Sci & Technol, Hubei Engn Res Ctr Big Data Secur, Sch Cyber Sci & Engn, 1037 Luoyu Rd, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Demand dispatching; on-demand services; deep reinforcement learning; ASSIGNMENT;
D O I
10.1145/3442343
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Autonomous on-demand services, such as GOGOX (formerly GoGoVan) in Hong Kong, provide a platform for users to request services and for suppliers to meet such demands. In such a platform, the suppliers have autonomy to accept or reject the demands to be dispatched to him/her, so it is challenging to make an online matching between demands and suppliers. Existing methods use round-based approaches to dispatch demands. In these works, the dispatching decision is based on the predicted response patterns of suppliers to demands in the current round, but they all fail to consider the impact of future demands and suppliers on the current dispatching decision. This could lead to taking a suboptimal dispatching decision from the future perspective. To solve this problem, we propose a novel demand dispatching model using deep reinforcement learning. In this model, we make each demand as an agent. The action of each agent, i.e., the dispatching decision of each demand, is determined by a centralized algorithm in a coordinated way. The model works in the following two steps. (1) It learns the demand's expected value in each spatiotemporal state using historical transition data. (2) Based on the learned values, it conducts a Many-To-Many dispatching using a combinatorial optimization algorithm by considering both immediate rewards and expected values of demands in the next round. In order to get a higher total reward, the demands with a high expected value (short response time) in the future may be delayed to the next round. On the contrary, the demands with a low expected value (long response time) in the future would be dispatched immediately. Through extensive experiments using real-world datasets, we show that the proposed model outperforms the existing models in terms of Cancellation Rate and Average Response Time.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] A Novel Demand Dispatching Model for Autonomous On-Demand Services
    Yang, Lei
    Yu, Xi
    Cao, Jiannong
    Li, Wengen
    Wang, Yuqi
    Szczecinski, Michal
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (01) : 322 - 333
  • [2] A Deep Reinforcement Learning Approach to Ride-Sharing Vehicle Dispatching in Autonomous Mobility-on-Demand Systems
    Guo, Ge
    Xu, Yangguang
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2022, 14 (01) : 128 - 140
  • [3] Optimization of On-Demand Shared Autonomous Vehicle Deployments Utilizing Reinforcement Learning
    Meneses-Cime, Karina
    Guvenc, Bilin Aksun
    Guvenc, Levent
    [J]. SENSORS, 2022, 22 (21)
  • [4] Deep Reinforcement Learning for On-demand Intelligent Routing in Deterministic Networks
    Liu, Ying
    Yin, Jianhui
    Zhang, Weiting
    Xie, Shanghan
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 1932 - 1937
  • [5] Exploring Deep Reinforcement Learning for Autonomous Powerline Tracking
    Pienroj, Panin
    Schonborn, Sandro
    Birke, Robert
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM 2019 WKSHPS), 2019, : 496 - 501
  • [6] A Matching Algorithm with Reinforcement Learning and Decoupling Strategy for Order Dispatching in On-Demand Food Delivery
    Chen, Jingfang
    Wang, Ling
    Pan, Zixiao
    Wu, Yuting
    Zheng, Jie
    Ding, Xuetao
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 29 (02) : 386 - 399
  • [7] Asynchronous Deep Reinforcement Learning for Collaborative Task Computing and On-Demand Resource Allocation in Vehicular Edge Computing
    Liu, Lei
    Feng, Jie
    Mu, Xuanyu
    Pei, Qingqi
    Lan, Dapeng
    Xiao, Ming
    [J]. IEEE Transactions on Intelligent Transportation Systems, 2023, 24 (12) : 15513 - 15526
  • [8] Dynamic matching radius decision model for on-demand ride services: A deep multi-task learning approach
    Chen, Taijie
    Shen, Zijian
    Feng, Siyuan
    Yang, Linchuan
    Ke, Jintao
    [J]. Transportation Research Part E: Logistics and Transportation Review, 2025, 193
  • [9] A robust deep reinforcement learning approach to driverless taxi dispatching under uncertain demand
    Zhou, Xiaoting
    Wu, Lubin
    Zhang, Yu
    Chen, Zhen-Song
    Jiang, Shancheng
    [J]. INFORMATION SCIENCES, 2023, 646
  • [10] Multi-task dispatch of shared autonomous electric vehicles for Mobility-on-Demand services - combination of deep reinforcement learning and combinatorial optimization method
    Wang, Ning
    Guo, Jiahui
    [J]. HELIYON, 2022, 8 (11)