Solving the Vehicle Routing Problem with Stochastic Travel Cost Using Deep Reinforcement Learning

被引：0

作者：

Cai, Hao ^{[1
]}

Xu, Peng ^{[1
]}

Tang, Xifeng ^{[1
]}

Lin, Gan ^{[1
]}

机构：

[1] Hohai Univ, Coll Civil & Transportat Engn, Xikang Rd, Nanjing 210024, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 16期

关键词：

VRP-STC; graph attention networks; multi-head attention mechanism; deep reinforcement learning; GO; SHOGI; CHESS; GAME;

D O I：

10.3390/electronics13163242

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Vehicle Routing Problem (VRP) is a classic combinatorial optimization problem commonly encountered in the fields of transportation and logistics. This paper focuses on a variant of the VRP, namely the Vehicle Routing Problem with Stochastic Travel Cost (VRP-STC). In VRP-STC, the introduction of stochastic travel costs increases the complexity of the problem, rendering traditional algorithms unsuitable for solving it. In this paper, the GAT-AM model combining Graph Attention Networks (GAT) and multi-head Attention Mechanism (AM) is employed. The GAT-AM model uses an encoder-decoder architecture and employs a deep reinforcement learning algorithm. The GAT in the encoder learns feature representations of nodes in different subspaces, while the decoder uses multi-head AM to construct policies through both greedy and sampling decoding methods. This increases solution diversity, thereby finding high-quality solutions. The REINFORCE with Rollout Baseline algorithm is used to train the learnable parameters within the neural network. Test results show that the advantages of GAT-AM become greater as problem complexity increases, with the optimal solution generally unattainable through traditional algorithms within an acceptable timeframe.

引用

页数：19

共 50 条

[31] Dynamic stochastic electric vehicle routing with safe reinforcement learning
Basso, Rafael
Kulcsar, Balazs
Sanchez-Diaz, Ivan
Qu, Xiaobo
[J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 157
[32] Solving vehicle routing problems with stochastic and correlated travel times and makespan objectives
Bakach, Iurii
Campbell, Ann Melissa
Ehmke, Jan Fabian
Urban, Timothy L.
[J]. EURO JOURNAL ON TRANSPORTATION AND LOGISTICS, 2021, 10
[33] Solving vehicle routing problems with stochastic and correlated travel times and makespan objectives
Bakach, Iurii
Campbell, Ann Melissa
Ehmke, Jan Fabian
Urban, Timothy L.
[J]. EURO Journal on Transportation and Logistics, 2021, 10
[34] Reliable vehicle routing problem in stochastic networks with correlated travel times
Mojtaba Rajabi-Bahaabadi
Afshin Shariat-Mohaymany
Mohsen Babaei
Daniele Vigo
[J]. Operational Research, 2021, 21 : 299 - 330
[35] Reliable vehicle routing problem in stochastic networks with correlated travel times
Rajabi-Bahaabadi, Mojtaba
Shariat-Mohaymany, Afshin
Babaei, Mohsen
Vigo, Daniele
[J]. OPERATIONAL RESEARCH, 2021, 21 (01) : 299 - 330
[36] Applying genetic algorithm to vehicle routing problem with stochastic travel times
Xie, Binglei
An, Shi
[J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES A-MATHEMATICAL ANALYSIS, 2006, 13 : 693 - 697
[37] A Robust Optimization Approach for the Vehicle Routing Problem with Uncertain Travel Cost
Solano-Charris, Elyn L.
Prins, Christian
Santos, Andrea Cynthia
[J]. 2014 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2014, : 98 - 103
[38] Deep Reinforcement Learning Algorithm for Fast Solutions to Vehicle Routing Problem with Time-Windows
Gupta, Abhinav
Ghosh, Supratim
Dhara, Anulekha
[J]. PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 236 - 240
[39] Solving a Vehicle Routing Problem with Ant Colony Optimisation and Stochastic Ranking
Haemmerle, Alexander
Ankerl, Martin
[J]. COMPUTER AIDED SYSTEMS THEORY, PT 1, 2013, 8111 : 259 - 266
[40] Solving Permutation Flowshop Problem with Deep Reinforcement Learning
Pan, Ruyuan
Dong, Xingye
Han, Sheng
[J]. 2020 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-BESANCON 2020), 2020, : 349 - 353

← 1 2 3 4 5 →