Solving the Vehicle Routing Problem with Stochastic Travel Cost Using Deep Reinforcement Learning

被引:0
|
作者
Cai, Hao [1 ]
Xu, Peng [1 ]
Tang, Xifeng [1 ]
Lin, Gan [1 ]
机构
[1] Hohai Univ, Coll Civil & Transportat Engn, Xikang Rd, Nanjing 210024, Peoples R China
关键词
VRP-STC; graph attention networks; multi-head attention mechanism; deep reinforcement learning; GO; SHOGI; CHESS; GAME;
D O I
10.3390/electronics13163242
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Vehicle Routing Problem (VRP) is a classic combinatorial optimization problem commonly encountered in the fields of transportation and logistics. This paper focuses on a variant of the VRP, namely the Vehicle Routing Problem with Stochastic Travel Cost (VRP-STC). In VRP-STC, the introduction of stochastic travel costs increases the complexity of the problem, rendering traditional algorithms unsuitable for solving it. In this paper, the GAT-AM model combining Graph Attention Networks (GAT) and multi-head Attention Mechanism (AM) is employed. The GAT-AM model uses an encoder-decoder architecture and employs a deep reinforcement learning algorithm. The GAT in the encoder learns feature representations of nodes in different subspaces, while the decoder uses multi-head AM to construct policies through both greedy and sampling decoding methods. This increases solution diversity, thereby finding high-quality solutions. The REINFORCE with Rollout Baseline algorithm is used to train the learnable parameters within the neural network. Test results show that the advantages of GAT-AM become greater as problem complexity increases, with the optimal solution generally unattainable through traditional algorithms within an acceptable timeframe.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Dynamic stochastic electric vehicle routing with safe reinforcement learning
    Basso, Rafael
    Kulcsar, Balazs
    Sanchez-Diaz, Ivan
    Qu, Xiaobo
    [J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 157
  • [32] Solving vehicle routing problems with stochastic and correlated travel times and makespan objectives
    Bakach, Iurii
    Campbell, Ann Melissa
    Ehmke, Jan Fabian
    Urban, Timothy L.
    [J]. EURO JOURNAL ON TRANSPORTATION AND LOGISTICS, 2021, 10
  • [33] Solving vehicle routing problems with stochastic and correlated travel times and makespan objectives
    Bakach, Iurii
    Campbell, Ann Melissa
    Ehmke, Jan Fabian
    Urban, Timothy L.
    [J]. EURO Journal on Transportation and Logistics, 2021, 10
  • [34] Reliable vehicle routing problem in stochastic networks with correlated travel times
    Mojtaba Rajabi-Bahaabadi
    Afshin Shariat-Mohaymany
    Mohsen Babaei
    Daniele Vigo
    [J]. Operational Research, 2021, 21 : 299 - 330
  • [35] Reliable vehicle routing problem in stochastic networks with correlated travel times
    Rajabi-Bahaabadi, Mojtaba
    Shariat-Mohaymany, Afshin
    Babaei, Mohsen
    Vigo, Daniele
    [J]. OPERATIONAL RESEARCH, 2021, 21 (01) : 299 - 330
  • [36] Applying genetic algorithm to vehicle routing problem with stochastic travel times
    Xie, Binglei
    An, Shi
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES A-MATHEMATICAL ANALYSIS, 2006, 13 : 693 - 697
  • [37] A Robust Optimization Approach for the Vehicle Routing Problem with Uncertain Travel Cost
    Solano-Charris, Elyn L.
    Prins, Christian
    Santos, Andrea Cynthia
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2014, : 98 - 103
  • [38] Deep Reinforcement Learning Algorithm for Fast Solutions to Vehicle Routing Problem with Time-Windows
    Gupta, Abhinav
    Ghosh, Supratim
    Dhara, Anulekha
    [J]. PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 236 - 240
  • [39] Solving a Vehicle Routing Problem with Ant Colony Optimisation and Stochastic Ranking
    Haemmerle, Alexander
    Ankerl, Martin
    [J]. COMPUTER AIDED SYSTEMS THEORY, PT 1, 2013, 8111 : 259 - 266
  • [40] Solving Permutation Flowshop Problem with Deep Reinforcement Learning
    Pan, Ruyuan
    Dong, Xingye
    Han, Sheng
    [J]. 2020 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-BESANCON 2020), 2020, : 349 - 353