Learning to Optimize Industry-Scale Dynamic Pickup and Delivery Problems

被引:18
|
作者
Li, Xijun [1 ,2 ]
Luo, Weilin [2 ,4 ]
Yuan, Mingxuan [2 ]
Wang, Jun [2 ,3 ]
Lu, Jiawen [2 ]
Wang, Jie [1 ]
Lu, Jinhu [4 ]
Zeng, Jia [2 ]
机构
[1] USTC, MIRA Lab, Hefei, Peoples R China
[2] Huawei, Noahs Ark Lab, Beijing, Peoples R China
[3] UCL, London, England
[4] Beihang Univ, Beijing, Peoples R China
关键词
ALGORITHMS; STRATEGIES;
D O I
10.1109/ICDE51399.2021.00283
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Dynamic Pickup and Delivery Problem (DPDP) is aimed at dynamically scheduling vehicles among multiple sites in order to minimize the cost when delivery orders are not known a priori. Although DPDP plays an important role in modern logistics and supply chain management, state-of-the-art DPDP algorithms are still limited on their solution quality and efficiency. In practice, they fail to provide a scalable solution as the numbers of vehicles and sites become large. In this paper, we propose a data-driven approach, $patial-Temporal Aided Double Deep Graph Network (ST-DDGN), to solve industry-scale DPDP. In our method, the delivery demands are first forecast using spatial-temporal prediction method, which guides the neural network to perceive spatial-temporal distribution of delivery demand when dispatching vehicles. Besides, the relationships of individuals such as vehicles are modelled by establishing a graph-based value function. ST-DDGN incorporates attention-based graph embedding with Double DQN (DDQN). As such, it can make the inference across vehicles more efficiently compared with traditional methods. Our method is entirely data driven and thus adaptive, i.e., the relational representation of adjacent vehicles can be learned and corrected by ST-DDGN from data periodically. We have conducted extensive experiments over real-world data to evaluate our solution. The results show that ST-DDGN reduces 11.27% number of the used vehicles and decreases 13.12% total transportation cost on average over the strong baselines, including the heuristic algorithm deployed in our UAT (User Acceptance Test) environment and a variety of vanilla DRL methods. We are due to fully deploy our solution into our online logistics system and it is estimated that millions of USD logistics cost can be saved per year.
引用
收藏
页码:2511 / 2522
页数:12
相关论文
共 50 条
  • [1] Dynamic pickup and delivery problems
    Berbeglia, Gerardo
    Cordeau, Jean-Francois
    Laporte, Gilbert
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 202 (01) : 8 - 15
  • [2] A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems
    Ma, Yi
    Hao, Xiaotian
    Hao, Jianye
    Lu, Jiawen
    Liu, Xing
    Tong, Xialiang
    Yuan, Mingxuan
    Li, Zhigang
    Tang, Jie
    Meng, Zhaopeng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] A survey of dynamic pickup and delivery problems
    Cai, Junchuang
    Zhu, Qingling
    Lin, Qiuzhen
    Ma, Lijia
    Li, Jianqiang
    Ming, Zhong
    [J]. NEUROCOMPUTING, 2023, 554
  • [4] Survey of the dynamic pickup and delivery problems
    Wang X.
    Zhao Z.
    [J]. Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2021, 41 (02): : 319 - 331
  • [5] Industry-Scale Duplicate Detection
    Weis, Melanie
    Naumann, Felix
    Jehle, Ulrich
    Lufter, Jens
    Schuster, Holger
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1253 - 1264
  • [6] Distribution of waiting time for dynamic pickup and delivery problems
    Stefan Vonolfen
    Michael Affenzeller
    [J]. Annals of Operations Research, 2016, 236 : 359 - 382
  • [7] Distribution of waiting time for dynamic pickup and delivery problems
    Vonolfen, Stefan
    Affenzeller, Michael
    [J]. ANNALS OF OPERATIONS RESEARCH, 2016, 236 (02) : 359 - 382
  • [8] Industry-scale application and evaluation of deep learning for drug target prediction
    Noé Sturm
    Andreas Mayr
    Thanh Le Van
    Vladimir Chupakhin
    Hugo Ceulemans
    Joerg Wegner
    Jose-Felipe Golib-Dzib
    Nina Jeliazkova
    Yves Vandriessche
    Stanislav Böhm
    Vojtech Cima
    Jan Martinovic
    Nigel Greene
    Tom Vander Aa
    Thomas J. Ashby
    Sepp Hochreiter
    Ola Engkvist
    Günter Klambauer
    Hongming Chen
    [J]. Journal of Cheminformatics, 12
  • [9] Industry-scale application and evaluation of deep learning for drug target prediction
    Sturm, Noe
    Mayr, Andreas
    Thanh Le Van
    Chupakhin, Vladimir
    Ceulemans, Hugo
    Wegner, Joerg
    Golib-Dzib, Jose-Felipe
    Jeliazkova, Nina
    Vandriessche, Yves
    Bohm, Stanislav
    Cima, Vojtech
    Martinovic, Jan
    Greene, Nigel
    Vander Aa, Tom
    Ashby, Thomas J.
    Hochreiter, Sepp
    Engkvist, Ola
    Klambauer, Guenter
    Chen, Hongming
    [J]. JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
  • [10] Look-ahead strategies for dynamic pickup and delivery problems
    Mes, Martijn
    van der Heijden, Matthieu
    Schuur, Peter
    [J]. OR SPECTRUM, 2010, 32 (02) : 395 - 421