A Hybrid of Deep Reinforcement Learning and Local Search for the Vehicle Routing Problems

被引:74
|
作者
Zhao, Jiuxia [1 ]
Mao, Minjia [2 ]
Zhao, Xi [3 ]
Zou, Jianhua [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Management, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Routing; Adaptation models; Heuristic algorithms; Search problems; Training; Optimization; VRP; VRPTW; routing simulator; deep reinforcement learning; adaptive critic; local search; LARGE NEIGHBORHOOD SEARCH; OPTIMIZATION; ALGORITHMS; DELIVERY;
D O I
10.1109/TITS.2020.3003163
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Different variants of the Vehicle Routing Problem (VRP) have been studied for decades. State-of-the-art methods based on local search have been developed for VRPs, while still facing problems of slow running time and poor solution quality in the case of large problem size. To overcome these problems, we first propose a novel deep reinforcement learning (DRL) model, which is composed of an actor, an adaptive critic and a routing simulator. The actor, based on the attention mechanism, is designed to generate routing strategies. The adaptive critic is devised to change the network structure adaptively, in order to accelerate the convergence rate and improve the solution quality during training. The routing simulator is developed to provide graph information and reward with the actor and adaptive cirtic. Then, we combine this DRL model with a local search method to further improve the solution quality. The output of the DRL model can serve as the initial solution for the following local search method, from where the final solution of the VRP is obtained. Tested on three datasets with customer points of 20, 50 and 100 respectively, experimental results demonstrate that the DRL model alone finds better solutions compared to construction algorithms and previous DRL approaches, while enabling a 5- to 40-fold speedup. We also observe that combining the DRL model with various local search methods yields excellent solutions at a superior generation speed, comparing to that of other initial solutions.
引用
收藏
页码:7208 / 7218
页数:11
相关论文
共 50 条
  • [31] Hybrid optimization of vehicle routing problems
    Edward Lam
    Constraints, 2023, 28 : 67 - 68
  • [32] Deep reinforcement learning based energy management for a hybrid electric vehicle
    Du, Guodong
    Zou, Yuan
    Zhang, Xudong
    Liu, Teng
    Wu, Jinlong
    He, Dingbo
    ENERGY, 2020, 201 (201)
  • [33] Safe Deep Reinforcement Learning Hybrid Electric Vehicle Energy Management
    Liessner, Roman
    Dietermann, Ansgar Malte
    Baeker, Bernard
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2018, 2019, 11352 : 161 - 181
  • [34] A reinforcement learning-Variable neighborhood search method for the capacitated Vehicle Routing Problem
    Kalatzantonakis, Panagiotis
    Sifaleras, Angelo
    Samaras, Nikolaos
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [35] Reinforcement Learning for Solving the Vehicle Routing Problem
    Nazari, Mohammadreza
    Oroojlooy, Afshin
    Takac, Martin
    Snyder, Lawrence V.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [36] Tabu Search for Vehicle Routing Problems (VRPs)
    Gupta, DK
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2002, 79 (06) : 693 - 701
  • [37] Solving the Vehicle Routing Problem with Stochastic Travel Cost Using Deep Reinforcement Learning
    Cai, Hao
    Xu, Peng
    Tang, Xifeng
    Lin, Gan
    ELECTRONICS, 2024, 13 (16)
  • [38] An End-to-End Deep Reinforcement Learning Framework for Electric Vehicle Routing Problem
    Wang, Mengqin
    Wei, Yanling
    Huang, Xueliang
    Gao, Shan
    IEEE Internet of Things Journal, 2024, 11 (20) : 33671 - 33682
  • [39] A comparison of reinforcement learning policies for dynamic vehicle routing problems with stochastic customer requests
    Akkerman, Fabian
    Mes, Martijn
    van Jaarsveld, Willem
    Computers and Industrial Engineering, 2025, 200
  • [40] Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
    Mak, Stephen
    Xu, Liming
    Pearce, Tim
    Ostroumov, Michael
    Brintrup, Alexandra
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 157