Learning to Optimize Industry-Scale Dynamic Pickup and Delivery Problems

被引：18

作者：

Li, Xijun ^{[1
,2
]}

Luo, Weilin ^{[2
,4
]}

Yuan, Mingxuan ^{[2
]}

Wang, Jun ^{[2
,3
]}

Lu, Jiawen ^{[2
]}

Wang, Jie ^{[1
]}

Lu, Jinhu ^{[4
]}

Zeng, Jia ^{[2
]}

机构：

[1] USTC, MIRA Lab, Hefei, Peoples R China

[2] Huawei, Noahs Ark Lab, Beijing, Peoples R China

[3] UCL, London, England

[4] Beihang Univ, Beijing, Peoples R China

来源：

2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021) | 2021年

关键词：

ALGORITHMS; STRATEGIES;

D O I：

10.1109/ICDE51399.2021.00283

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Dynamic Pickup and Delivery Problem (DPDP) is aimed at dynamically scheduling vehicles among multiple sites in order to minimize the cost when delivery orders are not known a priori. Although DPDP plays an important role in modern logistics and supply chain management, state-of-the-art DPDP algorithms are still limited on their solution quality and efficiency. In practice, they fail to provide a scalable solution as the numbers of vehicles and sites become large. In this paper, we propose a data-driven approach, $patial-Temporal Aided Double Deep Graph Network (ST-DDGN), to solve industry-scale DPDP. In our method, the delivery demands are first forecast using spatial-temporal prediction method, which guides the neural network to perceive spatial-temporal distribution of delivery demand when dispatching vehicles. Besides, the relationships of individuals such as vehicles are modelled by establishing a graph-based value function. ST-DDGN incorporates attention-based graph embedding with Double DQN (DDQN). As such, it can make the inference across vehicles more efficiently compared with traditional methods. Our method is entirely data driven and thus adaptive, i.e., the relational representation of adjacent vehicles can be learned and corrected by ST-DDGN from data periodically. We have conducted extensive experiments over real-world data to evaluate our solution. The results show that ST-DDGN reduces 11.27% number of the used vehicles and decreases 13.12% total transportation cost on average over the strong baselines, including the heuristic algorithm deployed in our UAT (User Acceptance Test) environment and a variety of vanilla DRL methods. We are due to fully deploy our solution into our online logistics system and it is estimated that millions of USD logistics cost can be saved per year.

引用

页码：2511 / 2522

页数：12

共 50 条

[1] Dynamic pickup and delivery problems
Berbeglia, Gerardo
Cordeau, Jean-Francois
Laporte, Gilbert
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 202 (01) : 8 - 15
[2] A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems
Ma, Yi
Hao, Xiaotian
Hao, Jianye
Lu, Jiawen
Liu, Xing
Tong, Xialiang
Yuan, Mingxuan
Li, Zhigang
Tang, Jie
Meng, Zhaopeng
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[3] A survey of dynamic pickup and delivery problems
Cai, Junchuang
Zhu, Qingling
Lin, Qiuzhen
Ma, Lijia
Li, Jianqiang
Ming, Zhong
[J]. NEUROCOMPUTING, 2023, 554
[4] Survey of the dynamic pickup and delivery problems
Wang X.
Zhao Z.
[J]. Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2021, 41 (02): : 319 - 331
[5] Industry-Scale Duplicate Detection
Weis, Melanie
Naumann, Felix
Jehle, Ulrich
Lufter, Jens
Schuster, Holger
[J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1253 - 1264
[6] Distribution of waiting time for dynamic pickup and delivery problems
Stefan Vonolfen
Michael Affenzeller
[J]. Annals of Operations Research, 2016, 236 : 359 - 382
[7] Distribution of waiting time for dynamic pickup and delivery problems
Vonolfen, Stefan
Affenzeller, Michael
[J]. ANNALS OF OPERATIONS RESEARCH, 2016, 236 (02) : 359 - 382
[8] Industry-scale application and evaluation of deep learning for drug target prediction
Noé Sturm
Andreas Mayr
Thanh Le Van
Vladimir Chupakhin
Hugo Ceulemans
Joerg Wegner
Jose-Felipe Golib-Dzib
Nina Jeliazkova
Yves Vandriessche
Stanislav Böhm
Vojtech Cima
Jan Martinovic
Nigel Greene
Tom Vander Aa
Thomas J. Ashby
Sepp Hochreiter
Ola Engkvist
Günter Klambauer
Hongming Chen
[J]. Journal of Cheminformatics, 12
[9] Industry-scale application and evaluation of deep learning for drug target prediction
Sturm, Noe
Mayr, Andreas
Thanh Le Van
Chupakhin, Vladimir
Ceulemans, Hugo
Wegner, Joerg
Golib-Dzib, Jose-Felipe
Jeliazkova, Nina
Vandriessche, Yves
Bohm, Stanislav
Cima, Vojtech
Martinovic, Jan
Greene, Nigel
Vander Aa, Tom
Ashby, Thomas J.
Hochreiter, Sepp
Engkvist, Ola
Klambauer, Guenter
Chen, Hongming
[J]. JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
[10] Look-ahead strategies for dynamic pickup and delivery problems
Mes, Martijn
van der Heijden, Matthieu
Schuur, Peter
[J]. OR SPECTRUM, 2010, 32 (02) : 395 - 421

← 1 2 3 4 5 →